default search action
Ryuichi Yamamoto
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c43]Lester Phillip Violeta, Wen-Chin Huang, Ding Ma, Ryuichi Yamamoto, Kazuhiro Kobayashi, Tomoki Toda:
Electrolaryngeal Speech Intelligibility Enhancement through Robust Linguistic Encoders. ICASSP 2024: 10961-10965 - [c42]Hyun-Wook Yoon, Jin-Seob Kim, Ryuichi Yamamoto, Ryo Terashima, Chan-Ho Song, Jae-Min Kim, Eunwoo Song:
Enhancing Multilingual TTS with Voice Conversion Based Data Augmentation and Posterior Embedding. ICASSP 2024: 12186-12190 - [c41]Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana:
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions. ICASSP 2024: 12672-12676 - [i27]You Zhang, Yongyi Zang, Jiatong Shi, Ryuichi Yamamoto, Jionghao Han, Yuxun Tang, Tomoki Toda, Zhiyao Duan:
SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan. CoRR abs/2405.05244 (2024) - [i26]Yongyi Zang, Jiatong Shi, You Zhang, Ryuichi Yamamoto, Jionghao Han, Yuxun Tang, Shengyuan Xu, Wenxiao Zhao, Jing Guo, Tomoki Toda, Zhiyao Duan:
CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection. CoRR abs/2406.02438 (2024) - [i25]Yuki Saito, Takuto Igarashi, Kentaro Seki, Shinnosuke Takamichi, Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari:
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark. CoRR abs/2406.07254 (2024) - [i24]Takuto Igarashi, Yuki Saito, Kentaro Seki, Shinnosuke Takamichi, Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari:
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment. CoRR abs/2406.07280 (2024) - [i23]Masaya Kawamura, Ryuichi Yamamoto, Yuma Shirahata, Takuya Hasumi, Kentaro Tachibana:
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning. CoRR abs/2406.07969 (2024) - [i22]You Zhang, Yongyi Zang, Jiatong Shi, Ryuichi Yamamoto, Tomoki Toda, Zhiyao Duan:
SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge. CoRR abs/2408.16132 (2024) - [i21]Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana:
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control. CoRR abs/2409.17452 (2024) - 2023
- [c40]Ryuichi Yamamoto, Reo Yoneyama, Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda:
A Comparative Study of Voice Conversion Models With Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023. ASRU 2023: 1-6 - [c39]Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana:
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform. ICASSP 2023: 1-5 - [c38]Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana:
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis. ICASSP 2023: 1-5 - [c37]Ryuichi Yamamoto, Reo Yoneyama, Tomoki Toda:
NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit. ICASSP 2023: 1-5 - [c36]Reo Yoneyama, Ryuichi Yamamoto, Kentaro Tachibana:
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs. ICASSP 2023: 1-5 - [i20]Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana:
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions. CoRR abs/2309.08140 (2023) - [i19]Lester Phillip Violeta, Wen-Chin Huang, Ding Ma, Ryuichi Yamamoto, Kazuhiro Kobayashi, Tomoki Toda:
Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders. CoRR abs/2309.09627 (2023) - [i18]Ryuichi Yamamoto, Reo Yoneyama, Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda:
A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023. CoRR abs/2310.05203 (2023) - 2022
- [c35]Takaaki Saeki, Kentaro Tachibana, Ryuichi Yamamoto:
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning. INTERSPEECH 2022: 793-797 - [c34]Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana:
A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech. INTERSPEECH 2022: 1931-1935 - [c33]Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim:
TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder. INTERSPEECH 2022: 1941-1945 - [c32]Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana:
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. INTERSPEECH 2022: 3018-3022 - [c31]Hyun-Wook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang:
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems. INTERSPEECH 2022: 4596-4600 - [i17]Takaaki Saeki, Kentaro Tachibana, Ryuichi Yamamoto:
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning. CoRR abs/2203.15683 (2022) - [i16]Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana:
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. CoRR abs/2204.10020 (2022) - [i15]Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim:
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder. CoRR abs/2206.14984 (2022) - [i14]Hyun-Wook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang:
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems. CoRR abs/2206.15067 (2022) - [i13]Reo Yoneyama, Ryuichi Yamamoto, Kentaro Tachibana:
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs. CoRR abs/2210.15887 (2022) - [i12]Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana:
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis. CoRR abs/2210.15964 (2022) - [i11]Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana:
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform. CoRR abs/2210.15975 (2022) - [i10]Ryuichi Yamamoto, Reo Yoneyama, Tomoki Toda:
NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit. CoRR abs/2210.15987 (2022) - 2021
- [c30]Ryuichi Yamamoto, Eunwoo Song, Min-Jae Hwang, Jae-Min Kim:
Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators. ICASSP 2021: 6039-6043 - [c29]Min-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim:
TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis. ICASSP 2021: 6598-6602 - [c28]Min-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim:
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model. Interspeech 2021: 2227-2231 - [c27]Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana:
Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis. Interspeech 2021: 3126-3130 - [c26]Katsuya Tanaka, Masami Mukai, Ryuichi Yamamoto, Naoki Mihara:
Data-Sharing Gateway System Design for Large-Scale Medical Information Collection with Distributed EMR Storage. MedInfo 2021: 205-209 - [c25]Eunwoo Song, Ryuichi Yamamoto, Min-Jae Hwang, Jin-Seob Kim, Ohsung Kwon, Jae-Min Kim:
Improved Parallel Wavegan Vocoder with Perceptually Weighted Spectrogram Loss. SLT 2021: 470-476 - [i9]Eunwoo Song, Ryuichi Yamamoto, Min-Jae Hwang, Jin-Seob Kim, Ohsung Kwon, Jae-Min Kim:
Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss. CoRR abs/2101.07412 (2021) - [i8]Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana:
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis. CoRR abs/2104.12395 (2021) - [i7]Tomoki Hayashi, Ryuichi Yamamoto, Takenori Yoshimura, Peter Wu, Jiatong Shi, Takaaki Saeki, Yooncheol Ju, Yusuke Yasuda, Shinnosuke Takamichi, Shinji Watanabe:
ESPnet2-TTS: Extending the Edge of TTS Research. CoRR abs/2110.07840 (2021) - 2020
- [c24]Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim:
Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram. ICASSP 2020: 6199-6203 - [c23]Min-Jae Hwang, Eunwoo Song, Ryuichi Yamamoto, Frank K. Soong, Hong-Goo Kang:
Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network. ICASSP 2020: 7219-7223 - [c22]Katsuki Inoue, Sunao Hara, Masanobu Abe, Tomoki Hayashi, Ryuichi Yamamoto, Shinji Watanabe:
Semi-Supervised Speaker Adaptation for End-to-End Speech Synthesis with Pretrained Models. ICASSP 2020: 7634-7638 - [c21]Tomoki Hayashi, Ryuichi Yamamoto, Katsuki Inoue, Takenori Yoshimura, Shinji Watanabe, Tomoki Toda, Kazuya Takeda, Yu Zhang, Xu Tan:
Espnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit. ICASSP 2020: 7654-7658 - [c20]Eunwoo Song, Min-Jae Hwang, Ryuichi Yamamoto, Jin-Seob Kim, Ohsung Kwon, Jae-Min Kim:
Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder. INTERSPEECH 2020: 3570-3574 - [p2]Katsuya Tanaka, Ryuichi Yamamoto:
Health Test Bed Group. Security Infrastructure Technology for Integrated Utilization of Big Data 2020: 133-166 - [i6]Min-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim:
TTS-by-TTS: TTS-driven Data Augmentation for Fast and High-Quality Speech Synthesis. CoRR abs/2010.13421 (2020) - [i5]Ryuichi Yamamoto, Eunwoo Song, Min-Jae Hwang, Jae-Min Kim:
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators. CoRR abs/2010.14151 (2020)
2010 – 2019
- 2019
- [c19]Shigeki Karita, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto:
A Comparative Study on Transformer vs RNN in Speech Applications. ASRU 2019: 449-456 - [c18]Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim:
Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation. INTERSPEECH 2019: 699-703 - [c17]Katsuya Tanaka, Ryuichi Yamamoto:
Assessment of Traceability Implementation of a Cross-Institutional Secure Data Collection System Based on Distributed Standardized EMR Storage. MedInfo 2019: 1373-1377 - [c16]Mayumi Yoshida, Ryuichi Yamamoto:
A Survey on Health Care and Health Concerning Workers for Considering Appropriate Personal Health Record Service. MedInfo 2019: 1819-1820 - [i4]Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim:
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation. CoRR abs/1904.04472 (2019) - [i3]Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang:
A Comparative Study on Transformer vs RNN in Speech Applications. CoRR abs/1909.06317 (2019) - [i2]Tomoki Hayashi, Ryuichi Yamamoto, Katsuki Inoue, Takenori Yoshimura, Shinji Watanabe, Tomoki Toda, Kazuya Takeda, Yu Zhang, Xu Tan:
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit. CoRR abs/1910.10909 (2019) - [i1]Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim:
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram. CoRR abs/1910.11480 (2019) - 2018
- [c15]Katsuya Tanaka, Ryuichi Yamamoto, Kazuhisa Nakasho, Atsuko Miyaji:
Development of a Secure Cross-Institutional Data Collection System Based on Distributed Standardized EMR Storage. EFMI-STC 2018: 35-39 - 2014
- [c14]Shinji Sako, Ryuichi Yamamoto, Tadashi Kitamura:
Ryry: A Real-Time Score-Following Automatic Accompaniment Playback System Capable of Real Performances with Errors, Repeats and Jumps. AMT 2014: 134-145 - [c13]Katsuya Tanaka, Takashi Noguchi, Ryuichi Yamamoto, Kengo Miyo, Kazuhiko Ohe:
A Nationwide Remote EHR Backup Project in anticipation of Large-scale Disaster in Japan. MIE 2014: 1241 - [e1]Shu-Heng Chen, Takao Terano, Ryuichi Yamamoto, Chung-Ching Tai:
Advances in Computational Social Science, The Fourth World Congress [Post-Conference Proceedings of the World Congress on Social Simulation, WCSS 2012, Taipei, Taiwan, Sepemtember 4-7, 2012]. Agent-Based Social Systems 11, Springer 2014, ISBN 978-4-431-54846-1 [contents] - 2013
- [c12]Ryuichi Yamamoto, Shinji Sako, Tadashi Kitamura:
Robust on-line algorithm for real-time audio-to-score alignment based on a delayed decision and anticipation framework. ICASSP 2013: 191-195
2000 – 2009
- 2009
- [c11]Kesami Sano, Satoko Tsuru, Mariko Matsuki, Shogo Kato, Ryuichi Yamamoto, Sawako Kawamura, Masaharu Ito, Mari Kimata, Masahiko Munechika, Yoshinori Iizuka:
Trial to Structuralize and IT-systematize Home-visit Nursing Based on PCAPS for Quality Improvement. Nursing Informatics 2009: 793 - [c10]Mariko Matsuki, Satoko Tsuru, Kesami Sano, Shogo Kato, Junko Yamazaki, Akemi Izumiyama, Satoko Yamaji, Satsuki Tanahashi, Ryuichi Yamamoto, Sawako Kawamura:
Electronic Standard Care Plans of Home-visit Nursing for Patients of Euronal Intractable Diseases Based on PCAPS. Nursing Informatics 2009: 797-798 - 2007
- [c9]Yasuyuki Hirose, Ryuichi Yamamoto, Shinichiro Ueda:
The Nodes Focusing Tool for Clinical Course Data of Hypergraph Structure in the Ontological Framework CSX Output from POMR-based EMR system. MedInfo 2007: 741-745 - [c8]Katsuya Tanaka, Mayumi Yoshida, Ryuichi Yamamoto:
Secure Remote Access for Web Based Clinical Information System Using Policy Control of PCs Healthcare PKI Authentication. MedInfo 2007: 1480 - 2005
- [p1]Mitja Lenic, Peter Kokol, Milan Zorman, Petra Povalej, Bruno Stiglic, Ryuichi Yamamoto:
Improved Knowledge Mining with the Multimethod Approach. Foundations of Data Mining and knowledge Discovery 2005: 305-318 - 2002
- [c7]Gou Masuda, Norihiro Sakamoto, Ryuichi Yamamoto:
A Framework for Dynamic Evidence Based Medicine using Data Mining. CBMS 2002: 117-122 - [c6]Milan Zorman, Gou Masuda, Peter Kokol, Ryuichi Yamamoto, Bruno Stiglic:
Mining Diabetes Database With Decision Trees and Association Rules. CBMS 2002: 134- - [c5]Mitja Lenic, Peter Kokol, Ryuichi Yamamoto:
IFSIMS -Internet Frame ork Service for Intelligent Medical Systems. CBMS 2002: 295- - 2001
- [c4]Gou Masuda, Norihiro Sakamoto, Rumi Sakai, Ryuichi Yamamoto:
An Exchange Format for Use-cases of Hospital Information Systems. MedInfo 2001: 109-113 - [c3]Matej Sprogar, Peter Kokol, Milan Zorman, Vili Podgorelec, Ryuichi Yamamoto, Gou Masuda, Norihiro Sakamoto:
Supporting Medical Decisions with Vector Decision Trees. MedInfo 2001: 552-556 - [c2]Kenji Hatano, Kazuhiko Ohe, Ryuichi Yamamoto:
Development of the Set of Data Identifiers for Medical Record Information Exchange. MedInfo 2001: 706 - 2000
- [j1]Hiroshi Takeda, Yasushi Matsumura, Shigeki Kuwata, Hirohiko Nakano, Norihiro Sakamoto, Ryuichi Yamamoto:
Architecture for networked electronic patient record systems. Int. J. Medical Informatics 60(2): 161-167 (2000)
1990 – 1999
- 1998
- [c1]Hiroshi Mizushima, Eiko Uchiyama, Masanori Akiyama, Ryuichi Yamamoto, Hiroyuki Tatsumi:
Medical Internet Exchange Project in JAPAN. MedInfo 1998: 417-419
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 21:18 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint