default search action

combined dblp search
author search
venue search
publication search

ask others

Kentaro Mitsui

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HonoMZMWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HonoMZMWS24
Yukiya Hono, Koh Mitsuda, Tianyu Zhao, Kentaro Mitsui, Toshiaki Wakatsuki, Kei Sawada:
Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition. ACL (Findings) 2024: 13289-13305
[c6]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/SawadaZSMKHWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/SawadaZSMKHWM24
Kei Sawada, Tianyu Zhao, Makoto Shing, Kentaro Mitsui, Akio Kaga, Yukiya Hono, Toshiaki Wakatsuki, Koh Mitsuda:
Release of Pre-Trained Models for the Japanese Language. LREC/COLING 2024: 13898-13905
[c5]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/MitsuiMWHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/MitsuiMWHS24
Kentaro Mitsui, Koh Mitsuda, Toshiaki Wakatsuki, Yukiya Hono, Kei Sawada:
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems. EMNLP (Findings) 2024: 2692-2700
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01657
Kei Sawada, Tianyu Zhao, Makoto Shing, Kentaro Mitsui, Akio Kaga, Yukiya Hono, Toshiaki Wakatsuki, Koh Mitsuda:
Release of Pre-Trained Models for the Japanese Language. CoRR abs/2404.01657 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12428
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12428
Kentaro Mitsui, Koh Mitsuda, Toshiaki Wakatsuki, Yukiya Hono, Kei Sawada:
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems. CoRR abs/2406.12428 (2024)
2023
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MitsuiHS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MitsuiHS23
Kentaro Mitsui, Yukiya Hono, Kei Sawada:
UniFLG: Unified Facial Landmark Generator from Text or Speech. INTERSPEECH 2023: 5501-5505
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-06883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-06883
AprilPyone MaungMaung, Makoto Shing, Kentaro Mitsui, Kei Sawada, Fumio Okura:
Text-Guided Scene Sketch-to-Photo Synthesis. CoRR abs/2302.06883 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14337
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14337
Kentaro Mitsui, Yukiya Hono, Kei Sawada:
UniFLG: Unified Facial Landmark Generator from Text or Speech. CoRR abs/2302.14337 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-01088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-01088
Kentaro Mitsui, Yukiya Hono, Kei Sawada:
Towards human-like spoken dialogue generation between AI agents from written dialogue. CoRR abs/2310.01088 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-03668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-03668
Yukiya Hono, Koh Mitsuda, Tianyu Zhao, Kentaro Mitsui, Toshiaki Wakatsuki, Kei Sawada:
An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition. CoRR abs/2312.03668 (2023)
2022
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MitsuiS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MitsuiS22
Kentaro Mitsui, Kei Sawada:
MSR-NV: Neural Vocoder Using Multiple Sampling Rates. INTERSPEECH 2022: 798-802
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MitsuiZSHNT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MitsuiZSHNT22
Kentaro Mitsui, Tianyu Zhao, Kei Sawada, Yukiya Hono, Yoshihiko Nankaku, Keiichi Tokuda:
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue. INTERSPEECH 2022: 2328-2332
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-12040
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-12040
Kentaro Mitsui, Tianyu Zhao, Kei Sawada, Yukiya Hono, Yoshihiko Nankaku, Keiichi Tokuda:
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue. CoRR abs/2206.12040 (2022)
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MitsuiKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MitsuiKS21
Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari:
Deep Gaussian process based multi-speaker speech synthesis with latent speaker representation. Speech Commun. 132: 132-145 (2021)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2109-13714
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13714
Kentaro Mitsui, Kei Sawada:
MSR-NV: Neural Vocoder Using Multiple Sampling Rates. CoRR abs/2109.13714 (2021)
2020
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MitsuiKS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MitsuiKS20
Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari:
Multi-Speaker Text-to-Speech Synthesis Using Deep Gaussian Processes. INTERSPEECH 2020: 2032-2036
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-02950
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-02950
Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari:
Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes. CoRR abs/2008.02950 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-06248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-06248
Shinnosuke Takamichi, Kentaro Mitsui, Yuki Saito, Tomoki Koriyama, Naoko Tanji, Hiroshi Saruwatari:
JVS corpus: free Japanese multi-speaker voice corpus. CoRR abs/1908.06248 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.