default search action

combined dblp search
author search
venue search
publication search

ask others

Ryo Masumura

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c118]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MizunoHSSISTKKM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MizunoHSSISTKKM24
Saki Mizuno, Nobukatsu Hojo, Kazutoshi Shinoda, Keita Suzuki, Mana Ihori, Hiroshi Sato, Tomohiro Tanaka, Naotaka Kawata, Satoshi Kobashikawa, Ryo Masumura:
Talking Face Generation for Impression Conversion Considering Speech Semantics. ICASSP 2024: 8411-8415
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/MasumuraTSO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/MasumuraTSO24
Ryo Masumura, Akihiko Takashima, Satoshi Suzuki, Shota Orihashi:
Born-Again Multi-task Self-training for Multi-task Facial Emotion Recognition. ICPR (16) 2024: 94-108
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18910
Atsushi Ando, Takafumi Moriya, Shota Horiguchi, Ryo Masumura:
Factor-Conditioned Speaking-Style Captioning. CoRR abs/2406.18910 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-20301
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-20301
Takafumi Moriya, Shota Horiguchi, Marc Delcroix, Ryo Masumura, Takanori Ashihara, Hiroshi Sato, Kohei Matsuura, Masato Mimura:
Alignment-Free Training for Transducer-based Multi-Talker ASR. CoRR abs/2409.20301 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-20313
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-20313
Takafumi Moriya, Takanori Ashihara, Masato Mimura, Hiroshi Sato, Kohei Matsuura, Ryo Masumura, Taichi Asami:
Boosting Hybrid Autoregressive Transducer-based ASR with Internal Acoustic Model Training and Dual Blank Thresholding. CoRR abs/2409.20313 (2024)
2023
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/MasumuraMITTO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MasumuraMITTO23
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Text-to-Text Pre-Training with Paraphrasing for Improving Transformer-Based Image Captioning. EUSIPCO 2023: 516-520
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HojoMKM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HojoMKM23
Nobukatsu Hojo, Saki Mizuno, Satoshi Kobashikawa, Ryo Masumura:
Modeling Lead-Lag Structure in Facial Expression Synchrony for Social-Psychological Outcome Prediction from Negotiation Interaction. ICASSP Workshops 2023: 1-5
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MatsuuraAMTODM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MatsuuraAMTODM23
Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Tomohiro Tanaka, Atsunori Ogawa, Marc Delcroix, Ryo Masumura:
Leveraging Large Text Corpora For End-To-End Speech Summarization. ICASSP 2023: 1-5
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MizunoHKM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MizunoHKM23
Saki Mizuno, Nobukatsu Hojo, Satoshi Kobashikawa, Ryo Masumura:
Next-Speaker Prediction Based on Non-Verbal Information in Multi-Party Video Conversation. ICASSP 2023: 1-5
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoriyaASMTM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoriyaASMTM23
Takafumi Moriya, Takanori Ashihara, Hiroshi Sato, Kohei Matsuura, Tomohiro Tanaka, Ryo Masumura:
Improving Scheduled Sampling for Neural Transducer-Based ASR. ICASSP 2023: 1-5
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanakaMISYAMM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanakaMISYAMM23
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Hiroshi Sato, Taiga Yamane, Takanori Ashihara, Kohei Matsuura, Takafumi Moriya:
Leveraging Language Embeddings for Cross-Lingual Self-Supervised Speech Representation Learning. ICASSP 2023: 1-5
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/SuzukiYTKMAM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/SuzukiYTKMAM23
Satoshi Suzuki, Shin'ya Yamaguchi, Shoichiro Takeda, Sekitoshi Kanai, Naoki Makishima, Atsushi Ando, Ryo Masumura:
Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness Tradeoff. ICCV 2023: 4367-4378
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/UchidaOTYM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/UchidaOTYM23
Mihiro Uchida, Shota Orihashi, Akihiko Takashima, Yoshihiro Yamazaki, Ryo Masumura:
Open-Set Recognition for Facial-Expression Recognition. ICIP 2023: 780-784
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/SuzukiYMSAM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/SuzukiYMSAM23
Satoshi Suzuki, Taiga Yamane, Naoki Makishima, Keita Suzuki, Atsushi Ando, Ryo Masumura:
OnDA-DETR: Online Domain Adaptation for Detection Transformers with Self-Training Framework. ICIP 2023: 1780-1785
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/OrihashiYUTM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/OrihashiYUTM23
Shota Orihashi, Yoshihiro Yamazaki, Mihiro Uchida, Akihiko Takashima, Ryo Masumura:
Distilling Knowledge of Bidirectional Language Model for Scene Text Recognition. ICIP 2023: 2165-2169
[c106]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/inlg/IhoriSTM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/inlg/IhoriSTM23
Mana Ihori, Hiroshi Sato, Tomohiro Tanaka, Ryo Masumura:
Retrieval, Masking, and Generation: Feedback Comment Generation using Masked Comment Examples. INLG (Generation Challenges) 2023: 60-67
[c105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IhoriSTMMH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IhoriSTMMH23
Mana Ihori, Hiroshi Sato, Tomohiro Tanaka, Ryo Masumura, Saki Mizuno, Nobukatsu Hojo:
Transcribing Speech as Spoken and Written Dual Text Using an Autoregressive Model. INTERSPEECH 2023: 461-465
[c104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SatoMODMASMITH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SatoMODMASMITH23
Hiroshi Sato, Ryo Masumura, Tsubasa Ochiai, Marc Delcroix, Takafumi Moriya, Takanori Ashihara, Kentaro Shinayama, Saki Mizuno, Mana Ihori, Tomohiro Tanaka, Nobukatsu Hojo:
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss. INTERSPEECH 2023: 854-858
[c103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriyaSODAMTMOA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriyaSODAMTMOA23
Takafumi Moriya, Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Takanori Ashihara, Kohei Matsuura, Tomohiro Tanaka, Ryo Masumura, Atsunori Ogawa, Taichi Asami:
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data. INTERSPEECH 2023: 899-903
[c102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitagishiTOMA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitagishiTOMA23
Yuki Kitagishi, Naohiro Tawara, Atsunori Ogawa, Ryo Masumura, Taichi Asami:
What are differences? Comparing DNN and Human by Their Performance and Characteristics in Speaker Age Estimation. INTERSPEECH 2023: 1873-1877
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HojoMKMIST23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HojoMKMIST23
Nobukatsu Hojo, Saki Mizuno, Satoshi Kobashikawa, Ryo Masumura, Mana Ihori, Hiroshi Sato, Tomohiro Tanaka:
Audio-Visual Praise Estimation for Conversational Video based on Synchronization-Guided Multimodal Transformer. INTERSPEECH 2023: 2663-2667
[c100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraMYYMIUS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraMYYMIUS23
Ryo Masumura, Naoki Makishima, Taiga Yamane, Yoshihiko Yamazaki, Saki Mizuno, Mana Ihori, Mihiro Uchida, Keita Suzuki, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando:
End-to-End Joint Target and Non-Target Speakers ASR. INTERSPEECH 2023: 2903-2907
[c99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakishimaSSAM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakishimaSSAM23
Naoki Makishima, Keita Suzuki, Satoshi Suzuki, Atsushi Ando, Ryo Masumura:
Joint Autoregressive Modeling of End-to-End Multi-Talker Overlapped Speech Recognition and Utterance-level Timestamp Prediction. INTERSPEECH 2023: 2913-2917
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/mmasia/SuzukiSMAM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmasia/SuzukiSMAM23
Keita Suzuki, Satoshi Suzuki, Ryo Masumura, Atsushi Ando, Naoki Makishima:
Multi-region CNN-Transformer for Micro-gesture Recognition in Face and Upper Body. MMAsia 2023: 89:1-89:5
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-00978
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-00978
Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Tomohiro Tanaka, Atsunori Ogawa, Marc Delcroix, Ryo Masumura:
Leveraging Large Text Corpora for End-to-End Speech Summarization. CoRR abs/2303.00978 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14723
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14723
Hiroshi Sato, Ryo Masumura, Tsubasa Ochiai, Marc Delcroix, Takafumi Moriya, Takanori Ashihara, Kentaro Shinayama, Saki Mizuno, Mana Ihori, Tomohiro Tanaka, Nobukatsu Hojo:
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss. CoRR abs/2305.14723 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02273
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02273
Ryo Masumura, Naoki Makishima, Taiga Yamane, Yoshihiko Yamazaki, Saki Mizuno, Mana Ihori, Mihiro Uchida, Keita Suzuki, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando:
End-to-End Joint Target and Non-Target Speakers ASR. CoRR abs/2306.02273 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-16454
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-16454
Satoshi Suzuki, Shin'ya Yamaguchi, Shoichiro Takeda, Sekitoshi Kanai, Naoki Makishima, Atsushi Ando, Ryo Masumura:
Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness Tradeoff. CoRR abs/2308.16454 (2023)
2022
[j10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/SuzukiTMAMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/SuzukiTMAMS22
Satoshi Suzuki, Shoichiro Takeda, Naoki Makishima, Atsushi Ando, Ryo Masumura, Hayaru Shouno:
Knowledge Transferred Fine-Tuning: Convolutional Neural Network Is Born Again With Anti-Aliasing Even in Data-Limited Situations. IEEE Access 10: 68384-68396 (2022)
[c97]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/IhoriSTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/IhoriSTM22
Mana Ihori, Hiroshi Sato, Tomohiro Tanaka, Ryo Masumura:
Multi-Perspective Document Revision. COLING 2022: 6128-6138
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoriyaAASTMMDS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoriyaAASTMMDS22
Takafumi Moriya, Takanori Ashihara, Atsushi Ando, Hiroshi Sato, Tomohiro Tanaka, Kohei Matsuura, Ryo Masumura, Marc Delcroix, Takahiro Shinozaki:
Hybrid RNN-T/Attention-Based Streaming ASR with Triggered Chunkwise Attention and Dual Internal Language Model Integration. ICASSP 2022: 8282-8286
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AndoMMSMMAS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AndoMMSMMAS22
Atsushi Ando, Yumiko Murata, Ryo Masumura, Satoshi Suzuki, Naoki Makishima, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato:
Customer Satisfaction Estimation Using Unsupervised Representation Learning with Multi-Format Prediction Loss. ICASSP 2022: 8497-8501
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/OrihashiYUTM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/OrihashiYUTM22
Shota Orihashi, Yoshihiro Yamazaki, Mihiro Uchida, Akihiko Takashima, Ryo Masumura:
Fully Shareable Scene Text Recognition Modeling for Horizontal and Vertical Writing. ICIP 2022: 2636-2640
[c93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakishimaSAM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakishimaSAM22
Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura:
Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data. INTERSPEECH 2022: 526-530
[c92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SatoODKMMITM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SatoODKMMITM22
Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Takafumi Moriya, Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Ryo Masumura:
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations. INTERSPEECH 2022: 996-1000
[c91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaMSIMAM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaMSIMAM22
Tomohiro Tanaka, Ryo Masumura, Hiroshi Sato, Mana Ihori, Kohei Matsuura, Takanori Ashihara, Takafumi Moriya:
Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks. INTERSPEECH 2022: 1066-1070
[c90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiheiINNMFN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiheiINNMFN22
Fumio Nihei, Ryo Ishii, Yukiko I. Nakano, Kyosuke Nishida, Ryo Masumura, Atsushi Fukayama, Takao Nakamura:
Dialogue Acts Aided Important Utterance Detection Based on Multiparty and Multimodal Information. INTERSPEECH 2022: 1086-1090
[c89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraYMMIUST22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraYMMIUST22
Ryo Masumura, Yoshihiro Yamazaki, Saki Mizuno, Naoki Makishima, Mana Ihori, Mihiro Uchida, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Shota Orihashi, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando:
End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training. INTERSPEECH 2022: 3218-3222
[c88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakataKTSIMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakataKTSIMS22
Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis. INTERSPEECH 2022: 4551-4555
[c87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakashimaMAYUO22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakashimaMAYUO22
Akihiko Takashima, Ryo Masumura, Atsushi Ando, Yoshihiro Yamazaki, Mihiro Uchida, Shota Orihashi:
Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition. INTERSPEECH 2022: 4740-4744
[c86]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/HojoKMM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/HojoKMM22
Nobukatsu Hojo, Satoshi Kobashikawa, Saki Mizuno, Ryo Masumura:
Multimodal Negotiation Corpus with Various Subjective Assessments for Social-Psychological Outcome Prediction from Non-Verbal Cues. LREC 2022: 6794-6801
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/AndoMTSMSMAS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/AndoMTSMSMAS22
Atsushi Ando, Ryo Masumura, Akihiko Takashima, Satoshi Suzuki, Naoki Makishima, Keita Suzuki, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato:
On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis. SLT 2022: 739-746
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09979
Yoshihiro Yamazaki, Shota Orihashi, Ryo Masumura, Mihiro Uchida, Akihiko Takashima:
Audio Visual Scene-Aware Dialog Generation with Transformer-based Video Representations. CoRR abs/2202.09979 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08174
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08174
Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Takafumi Moriya, Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Ryo Masumura:
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations. CoRR abs/2206.08174 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04659
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04659
Naoki Makishima, Satoshi Suzuki, Atsushi Ando, Ryo Masumura:
Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data. CoRR abs/2207.04659 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15937
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15937
Atsushi Ando, Ryo Masumura, Akihiko Takashima, Satoshi Suzuki, Naoki Makishima, Keita Suzuki, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato:
On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis. CoRR abs/2210.15937 (2022)
2021
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/TanakaMO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/TanakaMO21
Tomohiro Tanaka, Ryo Masumura, Takanobu Oba:
Neural candidate-aware language models for speech recognition. Comput. Speech Lang. 66: 101157 (2021)
[j8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jip/MasumuraAOS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jip/MasumuraAOS21
Ryo Masumura, Taichi Asami, Takanobu Oba, Sumitaka Sakauchi:
Hierarchical Latent Words Language Models for Automatic Speech Recognition. J. Inf. Process. 29: 360-369 (2021)
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/OrihashiYMITTM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/OrihashiYMITTM21
Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura:
Hierarchical Knowledge Distillation for Dialogue Sequence Labeling. ASRU 2021: 433-440
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoriyaATOSAIMS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoriyaATOSAIMS21
Takafumi Moriya, Takanori Ashihara, Tomohiro Tanaka, Tsubasa Ochiai, Hiroshi Sato, Atsushi Ando, Yusuke Ijima, Ryo Masumura, Yusuke Shinohara:
Simpleflat: A Simple Whole-Network Pre-Training Approach for RNN Transducer-Based End-to-End Speech Recognition. ICASSP 2021: 5664-5668
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasumuraMITTO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasumuraMITTO21
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Hierarchical Transformer-Based Large-Context End-To-End ASR with Large-Context Knowledge Distillation. ICASSP 2021: 5879-5883
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AndoMSMAIT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AndoMSMAIT21
Atsushi Ando, Ryo Masumura, Hiroshi Sato, Takafumi Moriya, Takanori Ashihara, Yusuke Ijima, Tomoki Toda:
Speech Emotion Recognition Based on Listener Adaptive Models. ICASSP 2021: 6274-6278
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MakishimaITTOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MakishimaITTOM21
Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi, Ryo Masumura:
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss. ICASSP 2021: 6673-6677
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/IhoriMTTOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/IhoriMTTOM21
Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
MAPGN: Masked Pointer-Generator Network for Sequence-to-Sequence Pre-Training. ICASSP 2021: 7563-7567
[c78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakishimaITTOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakishimaITTOM21
Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
Enrollment-Less Training for Personalized Voice Activity Detection. Interspeech 2021: 346-350
[c77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IhoriMTTOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IhoriMTTOM21
Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks Using Switching Tokens. Interspeech 2021: 776-780
[c76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriyaTAOSAMDA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriyaTAOSAMDA21
Takafumi Moriya, Tomohiro Tanaka, Takanori Ashihara, Tsubasa Ochiai, Hiroshi Sato, Atsushi Ando, Ryo Masumura, Marc Delcroix, Taichi Asami:
Streaming End-to-End Speech Recognition for Hybrid RNN-T/Attention Architecture. Interspeech 2021: 1787-1791
[c75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraOMITTO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraOMITTO21
Ryo Masumura, Daiki Okamura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation. Interspeech 2021: 2591-2595
[c74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaMITMAOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaMITMAOM21
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Takafumi Moriya, Takanori Ashihara, Shota Orihashi, Naoki Makishima:
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition. Interspeech 2021: 4059-4063
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaMITOM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaMITOM21
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Shota Orihashi, Naoki Makishima:
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning. Interspeech 2021: 4458-4462
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/mmasia/OrihashiYMITTM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmasia/OrihashiYMITTM21
Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura:
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages. MMAsia 2021: 41:1-41:5
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MasumuraMITTO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MasumuraMITTO21
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Large-Context Conversational Representation Learning: Self-Supervised Learning For Conversational Documents. SLT 2021: 1012-1019
[c70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/NakataKTTIMS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/NakataKTTIMS21
Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Naoko Tanji, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
Audiobook Speech Synthesis Conditioned by Cross-Sentence Context-Aware Word Embeddings. SSW 2021: 211-215
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-07380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-07380
Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
MAPGN: MAsked Pointer-Generator Network for sequence-to-sequence pre-training. CoRR abs/2102.07380 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-07935
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-07935
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Hierarchical Transformer-based Large-Context End-to-end ASR with Large-Context Knowledge Distillation. CoRR abs/2102.07935 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-08147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-08147
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Large-Context Conversational Representation Learning: Self-Supervised Learning for Conversational Documents. CoRR abs/2102.08147 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-08154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-08154
Ryo Masumura, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Takanori Ashihara:
End-to-End Automatic Speech Recognition with Deep Mutual Learning. CoRR abs/2102.08154 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-01463
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-01463
Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi, Ryo Masumura:
Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss. CoRR abs/2103.01463 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-12131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-12131
Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens. CoRR abs/2106.12131 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-12132
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-12132
Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura:
Enrollment-less training for personalized voice activity detection. CoRR abs/2106.12132 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-01549
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-01549
Ryo Masumura, Daiki Okamura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation. CoRR abs/2107.01549 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-01569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-01569
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Takafumi Moriya, Takanori Ashihara, Shota Orihashi, Naoki Makishima:
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition. CoRR abs/2107.01569 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-05382
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-05382
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Shota Orihashi, Naoki Makishima:
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning. CoRR abs/2107.05382 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-10957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-10957
Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura:
Hierarchical Knowledge Distillation for Dialogue Sequence Labeling. CoRR abs/2111.10957 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-12276
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-12276
Shota Orihashi, Yoshihiro Yamazaki, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Ryo Masumura:
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages. CoRR abs/2111.12276 (2021)
2020
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/AndoMKKAT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/AndoMKKAT20
Atsushi Ando, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono, Tomoki Toda:
Customer Satisfaction Estimation in Contact Center Calls Based on a Hierarchical Multi-Task Model. IEEE ACM Trans. Audio Speech Lang. Process. 28: 715-728 (2020)
[c69]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/ImaizumiMSK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ImaizumiMSK20
Ryo Imaizumi, Ryo Masumura, Sayaka Shiota, Hitoshi Kiya:
Dialect-Aware Modeling for End-to-End Japanese Dialect Speech Recognition. APSIPA 2020: 297-301
[c68]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/MasumuraITTA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MasumuraITTA20
Ryo Masumura, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Takanori Ashihara:
End-to-End Automatic Speech Recognition with Deep Mutual Learning. APSIPA 2020: 632-637
[c67]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/TakashimaMITOM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TakashimaMITOM20
Akihiko Takashima, Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Shota Orihashi, Ryo Masumura:
Unsupervised Domain Adversarial Training in Angular Space for Facial Expression Recognition. APSIPA 2020: 1054-1059
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/gcce/ImaizumiMSK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gcce/ImaizumiMSK20
Ryo Imaizumi, Ryo Masumura, Sayaka Shiota, Hitoshi Kiya:
Sequence-To-One Neural Networks for Japanese Dialect Speech Classification. GCCE 2020: 933-935
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoriyaSTAMS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoriyaSTAMS20
Takafumi Moriya, Hiroshi Sato, Tomohiro Tanaka, Takanori Ashihara, Ryo Masumura, Yusuke Shinohara:
Distilling Attention Weights for CTC-Based ASR Systems. ICASSP 2020: 6894-6898
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasumuraITMAS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasumuraITMAS20
Ryo Masumura, Mana Ihori, Akihiko Takashima, Takafumi Moriya, Atsushi Ando, Yusuke Shinohara:
Sequence-Level Consistency Training for Semi-Supervised End-to-End Automatic Speech Recognition. ICASSP 2020: 7054-7058
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/IhoriTM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/IhoriTM20
Mana Ihori, Akihiko Takashima, Ryo Masumura:
Large-Context Pointer-Generator Networks for Spoken-to-Written Style Conversion. ICASSP 2020: 8189-8193
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/inlg/IhoriMMTTO20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/inlg/IhoriMMTTO20
Mana Ihori, Ryo Masumura, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi:
Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model. INLG 2020: 1-6
[c61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriyaOKSTAMSD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriyaOKSTAMSD20
Takafumi Moriya, Tsubasa Ochiai, Shigeki Karita, Hiroshi Sato, Tomohiro Tanaka, Takanori Ashihara, Ryo Masumura, Yusuke Shinohara, Marc Delcroix:
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. INTERSPEECH 2020: 546-550
[c60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OrihashiITM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OrihashiITM20
Shota Orihashi, Mana Ihori, Tomohiro Tanaka, Ryo Masumura:
Unsupervised Domain Adaptation for Dialogue Sequence Labeling Based on Hierarchical Adversarial Training. INTERSPEECH 2020: 1575-1579
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoizumiMNYS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoizumiMNYS20
Yuma Koizumi, Ryo Masumura, Kyosuke Nishida, Masahiro Yasuda, Shoichiro Saito:
A Transformer-Based Audio Captioning Model with Keyword Estimation. INTERSPEECH 2020: 1977-1981
[c58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraMITTO20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraMITTO20
Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi:
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition. INTERSPEECH 2020: 2822-2826
[c57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamashitaKSTIMS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamashitaKSTIMS20
Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis. INTERSPEECH 2020: 3201-3205
[c56]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/KodamaHMMANAK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/KodamaHMMANAK20
Takashi Kodama, Ryuichiro Higashinaka, Koh Mitsuda, Ryo Masumura, Yushi Aono, Ryuta Nakamura, Noritake Adachi, Hidetoshi Kawabata:
Generating Responses that Reflect Meta Information in User-Generated Question Answer Pairs. LREC 2020: 5433-5441
[c55]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/IhoriTM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/IhoriTM20
Mana Ihori, Akihiko Takashima, Ryo Masumura:
Parallel Corpus for Japanese Spoken-to-Written Style Conversion. LREC 2020: 6346-6353
[c54]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/YamashitaKSTIMS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/YamashitaKSTIMS20
Yuki Yamashita, Tomoki Koriyama, Yuki Saito, Shinnosuke Takamichi, Yusuke Ijima, Ryo Masumura, Hiroshi Saruwatari:
DNN-based Speech Synthesis Using Abundant Tags of Spontaneous Speech Corpus. LREC 2020: 6438-6443
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-00222
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-00222
Yuma Koizumi, Ryo Masumura, Kyosuke Nishida, Masahiro Yasuda, Shoichiro Saito:
A Transformer-based Audio Captioning Model with Keyword Estimation. CoRR abs/2007.00222 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15437
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15437
Mana Ihori, Ryo Masumura, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi:
Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model. CoRR abs/2010.15437 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/AsamiMAS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/AsamiMAS19
Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda:
Recurrent out-of-vocabulary word detection based on distribution of features. Comput. Speech Lang. 58: 247-259 (2019)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/MasumuraAOSI19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/MasumuraAOSI19
Ryo Masumura, Taichi Asami, Takanobu Oba, Sumitaka Sakauchi, Akinori Ito:
Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition. IEICE Trans. Inf. Syst. 102-D(12): 2557-2567 (2019)
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jip/MasumuraAOMS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jip/MasumuraAOMS19
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi:
Viterbi Approximation of Latent Words Language Models for Automatic Speech Recognition. J. Inf. Process. 27: 168-176 (2019)
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/SatoMSMFMAYA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/SatoMSMFMAYA19
Hiroshi Sato, Takafumi Moriya, Yusuke Shinohara, Ryo Masumura, Takaaki Fukutomi, Kiyoaki Matsui, Takanori Ashihara, Yoshikazu Yamaguchi, Yushi Aono:
Revisiting Dynamic Adjustment of Language Model Scaling Factor for Automatic Speech Recognition. APSIPA 2019: 186-191
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MasumuraIKOA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MasumuraIKOA19
Ryo Masumura, Yusuke Ijima, Satoshi Kobashikawa, Takanobu Oba, Yushi Aono:
Can We Simulate Generative Process of Acoustic Modeling Data? Towards Data Restoration for Acoustic Modeling. APSIPA 2019: 655-661
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/KamiyamaAMKA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/KamiyamaAMKA19
Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono:
Likability Estimation of Call-center Agents by Suppressing Annotator Variability. APSIPA 2019: 911-916
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/KamiyamaAMKA19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/KamiyamaAMKA19a
Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono:
Urgent Voicemail Detection Focused on Long-term Temporal Variation. APSIPA 2019: 917-921
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TanakaMMOA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TanakaMMOA19
Tomohiro Tanaka, Ryo Masumura, Takafumi Moriya, Takanobu Oba, Yushi Aono:
Disfluency Detection Based on Speech-Aware Token-by-Token Sequence Labeling with BLSTM-CRFs and Attention Mechanisms. APSIPA 2019: 1009-1013
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MasumuraITSNO19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MasumuraITSNO19
Ryo Masumura, Mana Ihori, Tomohiro Tanaka, Itsumi Saito, Kyosuke Nishida, Takanobu Oba:
Generalized Large-Context Language Models Based on Forward-Backward Hierarchical Recurrent Encoder-Decoder Models. ASRU 2019: 554-561
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MasumuraITAIOH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MasumuraITAIOH19
Ryo Masumura, Mana Ihori, Tomohiro Tanaka, Atsushi Ando, Ryo Ishii, Takanobu Oba, Ryuichiro Higashinaka:
Improving Speech-Based End-of-Turn Detection Via Cross-Modal Representation Learning with Punctuated Text Data. ASRU 2019: 1062-1069
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/MasumuraMKFOA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MasumuraMKFOA19
Ryo Masumura, Kiyoaki Matsui, Yuma Koizumi, Takaaki Fukutomi, Takanobu Oba, Yushi Aono:
Context-Aware Neural Voice Activity Detection Using Auxiliary Networks for Phoneme Recognition, Speech Enhancement and Acoustic Scene Classification. EUSIPCO 2019: 1-5
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasumuraTMSOA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasumuraTMSOA19
Ryo Masumura, Tomohiro Tanaka, Takafumi Moriya, Yusuke Shinohara, Takanobu Oba, Yushi Aono:
Large Context End-to-end Automatic Speech Recognition via Extension of Hierarchical Recurrent Encoder-decoder Models. ICASSP 2019: 5661-5665
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraTAKOKA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraTAKOKA19
Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Hosana Kamiyama, Takanobu Oba, Satoshi Kobashikawa, Yushi Aono:
Improving Conversation-Context Language Models with Multiple Spoken Language Understanding Models. INTERSPEECH 2019: 834-838
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraSTMIO19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraSTMIO19
Ryo Masumura, Hiroshi Sato, Tomohiro Tanaka, Takafumi Moriya, Yusuke Ijima, Takanobu Oba:
End-to-End Automatic Speech Recognition with a Reconstruction Criterion Using Speech-to-Text and Text-to-Speech Encoder-Decoders. INTERSPEECH 2019: 1606-1610
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaMMOA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaMMOA19
Tomohiro Tanaka, Ryo Masumura, Takafumi Moriya, Takanobu Oba, Yushi Aono:
A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge. INTERSPEECH 2019: 2210-2214
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AndoMKKA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AndoMKKA19
Atsushi Ando, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono:
Speech Emotion Recognition Based on Multi-Label Emotion Existence Model. INTERSPEECH 2019: 2818-2822
[c40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriyaWTMSYA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriyaWTMSYA19
Takafumi Moriya, Jian Wang, Tomohiro Tanaka, Ryo Masumura, Yusuke Shinohara, Yoshikazu Yamaguchi, Yushi Aono:
Joint Maximization Decoder with Neural Converters for Fully Neural Network-Based Japanese Speech Recognition. INTERSPEECH 2019: 4410-4414
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/KobashikawaONME19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/KobashikawaONME19
Satoshi Kobashikawa, Atushi Odakura, Takao Nakamura, Takeshi Mori, Kimitaka Endo, Takafumi Moriya, Ryo Masumura, Yushi Aono, Nobuaki Minematsu:
Does Speaking Training Application with Speech Recognition Motivate Junior High School Students in Actual Classroom? - A Case Study. SLaTE 2019: 119-123
2018
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/MasumuraAOMSI18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/MasumuraAOMSI18
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition. IEICE Trans. Inf. Syst. 101-D(6): 1581-1590 (2018)
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TanakaMMA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TanakaMMA18
Tomohiro Tanaka, Ryo Masumura, Takafumi Moriya, Yushi Aono:
Neural Speech-to-Text Language Models for Rescoring Hypotheses of DNN-HMM Hybrid Automatic Speech Recognition Systems. APSIPA 2018: 196-200
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MasumuraYTAKA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MasumuraYTAKA18
Ryo Masumura, Setsuo Yamada, Tomohiro Tanaka, Atsushi Ando, Hosana Kamiyama, Yushi Aono:
Online Call Scene Segmentation of Contact Center Dialogues based on Role Aware Hierarchical LSTM-RNNs. APSIPA 2018: 811-815
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MoriyaMASDYA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MoriyaMASDYA18
Takafumi Moriya, Ryo Masumura, Taichi Asami, Yusuke Shinohara, Marc Delcroix, Yoshikazu Yamaguchi, Yushi Aono:
Progressive Neural Network-based Knowledge Transfer in Acoustic Models. APSIPA 2018: 998-1002
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MasumuraKMKYA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MasumuraKMKYA18
Ryo Masumura, Suguru Kabashima, Takafumi Moriya, Satoshi Kobashikawa, Yoshikazu Yamaguchi, Yushi Aono:
Relevant Phonetic-aware Neural Acoustic Models using Native English and Japanese Speech for Japanese-English Automatic Speech Recognition. APSIPA 2018: 1435-1439
[c34]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/MasumuraTHMA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/MasumuraTHMA18
Ryo Masumura, Tomohiro Tanaka, Ryuichiro Higashinaka, Hirokazu Masataki, Yushi Aono:
Multi-task and Multi-lingual Joint Learning of Neural Lexical Utterance Classification based on Partially-shared Modeling. COLING 2018: 3586-3596
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/MasumuraSHA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/MasumuraSHA18
Ryo Masumura, Yusuke Shinohara, Ryuichiro Higashinaka, Yushi Aono:
Adversarial Training for Multi-task and Multi-lingual Joint Modeling of Utterance Intent Classification. EMNLP 2018: 633-639
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AndoKKMIA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AndoKKMIA18
Atsushi Ando, Satoshi Kobashikawa, Hosana Kamiyama, Ryo Masumura, Yusuke Ijima, Yushi Aono:
Soft-Target Training with Ambiguous Emotional Utterances for DNN-Based Speech Emotion Classification. ICASSP 2018: 4964-4968
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasumuraIAMH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasumuraIAMH18
Ryo Masumura, Yusuke Ijima, Taichi Asami, Hirokazu Masataki, Ryuichiro Higashinaka:
Neural Confnet Classification: Fully Neural Network Based Spoken Utterance Classification Using Word Confusion Networks. ICASSP 2018: 6039-6043
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaMMA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaMMA18
Tomohiro Tanaka, Ryo Masumura, Hirokazu Masataki, Yushi Aono:
Neural Error Corrective Language Models for Automatic Speech Recognition. INTERSPEECH 2018: 401-405
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraTAMA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraTAMA18
Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Hirokazu Masataki, Yushi Aono:
Role Play Dialogue Aware Language Models Based on Conditional Hierarchical Recurrent Encoder-Decoder. INTERSPEECH 2018: 1259-1263
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AndoAMKKA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AndoAMKKA18
Atsushi Ando, Reine Asakawa, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono:
Automatic Question Detection from Acoustic and Phonetic Features Using Feature-wise Pre-training. INTERSPEECH 2018: 1731-1735
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/sigdial/MasumuraTAIHA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigdial/MasumuraTAIHA18
Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Ryo Ishii, Ryuichiro Higashinaka, Yushi Aono:
Neural Dialogue Context Online End-of-Turn Detection. SIGDIAL Conference 2018: 224-228
2017
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/MasumuraAMA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/MasumuraAMA17
Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono:
Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling. APSIPA 2017: 1588-1591
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AsamiMYMA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AsamiMYMA17
Taichi Asami, Ryo Masumura, Yoshikazu Yamaguchi, Hirokazu Masataki, Yushi Aono:
Domain adaptation of DNN acoustic models using knowledge distillation. ICASSP 2017: 5185-5189
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasumuraAMA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasumuraAMA17
Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono:
Parallel phonetically aware DNNs and LSTM-RNNS for frame-by-frame discriminative modeling of spoken language identification. ICASSP 2017: 5260-5264
[c23]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcnlp/MasumuraAMSNH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/MasumuraAMSNH17
Ryo Masumura, Taichi Asami, Hirokazu Masataki, Kugatsu Sadamitsu, Kyosuke Nishida, Ryuichiro Higashinaka:
Hyperspherical Query Likelihood Models with Word Embeddings. IJCNLP(2) 2017: 210-216
[c22]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcnlp/SaitoSNSKMMT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/SaitoSNSKMMT17
Itsumi Saito, Jun Suzuki, Kyosuke Nishida, Kugatsu Sadamitsu, Satoshi Kobashikawa, Ryo Masumura, Yuji Matsumoto, Junji Tomita:
Improving Neural Text Normalization with Data Augmentation at Character- and Morphological Levels. IJCNLP(2) 2017: 257-262
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IjimaHMA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IjimaHMA17
Yusuke Ijima, Nobukatsu Hojo, Ryo Masumura, Taichi Asami:
Prosody Aware Word-Level Encoder Based on BLSTM-RNNs for DNN-Based Speech Synthesis. INTERSPEECH 2017: 764-768
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraAMIH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraAMIH17
Ryo Masumura, Taichi Asami, Hirokazu Masataki, Ryo Ishii, Ryuichiro Higashinaka:
Online End-of-Turn Detection from Speech Based on Stacked Time-Asynchronous Sequential Networks. INTERSPEECH 2017: 1661-1665
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AndoMKKA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AndoMKKA17
Atsushi Ando, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono:
Hierarchical LSTMs with Joint Learning for Estimating Customer Satisfaction from Contact Center Calls. INTERSPEECH 2017: 1716-1720
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SawadaMN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SawadaMN17
Naoki Sawada, Ryo Masumura, Hiromitsu Nishizaki:
Parallel Hierarchical Attention Networks with Shared Memory Reader for Multi-Stream Conversational Document Classification. INTERSPEECH 2017: 3311-3315
2016
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/MasumuraAOMSI16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/MasumuraAOMSI16
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Investigation of Combining Various Major Language Model Technologies including Data Expansion and Adaptation. IEICE Trans. Inf. Syst. 99-D(10): 2452-2461 (2016)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/MasumuraAOMST16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/MasumuraAOMST16
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Satoshi Takahashi:
N-gram Approximation of Latent Words Language Models for Domain Robust Automatic Speech Recognition. IEICE Trans. Inf. Syst. 99-D(10): 2462-2470 (2016)
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/humanoids/KaminagaKYSMKIM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/humanoids/KaminagaKYSMKIM16
Hiroshi Kaminaga, Tianyi Ko, Satoshi Yorita, Shunsuke Sato, Ryo Masumura, Mitsuo Komagata, Tatsuya Ishikawa, Taira Miyatake, Yoshihiko Nakamura:
Enhancement of mechanical strength, computational power, and heat management for fieldwork humanoid robots. Humanoids 2016: 786-793
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsamiMAS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsamiMAS16
Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda:
Recurrent Out-of-Vocabulary Word Detection Using Distribution of Features. INTERSPEECH 2016: 1320-1324
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraAMAS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraAMAS16
Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono, Sumitaka Sakauchi:
Language Identification Based on Generative Modeling of Posteriorgram Sequences Extracted from Frame-by-Frame DNNs and LSTM-RNNs. INTERSPEECH 2016: 3275-3279
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/iser/KaminagaKMKSYN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iser/KaminagaKMKSYN16
Hiroshi Kaminaga, Tianyi Ko, Ryo Masumura, Mitsuo Komagata, Shunsuke Sato, Satoshi Yorita, Yoshihiko Nakamura:
Mechanism and Control of Whole-Body Electro-Hydrostatic Actuator Driven Humanoid Robot Hydra. ISER 2016: 656-665
2015
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/MasumuraAOMSI15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/MasumuraAOMSI15
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Hierarchical Latent Words Language Models for Robust Modeling to Out-Of Domain Tasks. EMNLP 2015: 1896-1901
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraAOMSI15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraAOMSI15
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Combinations of various language model technologies including data expansion and adaptation in spontaneous speech recognition. INTERSPEECH 2015: 463-467
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraAOMSI15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraAOMSI15a
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Latent words recurrent neural network language models. INTERSPEECH 2015: 2380-2384
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsamiMMOS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsamiMMOS15
Taichi Asami, Ryo Masumura, Hirokazu Masataki, Manabu Okamoto, Sumitaka Sakauchi:
Training data selection for acoustic modeling via submodular optimization of joint kullback-leibler divergence. INTERSPEECH 2015: 3645-3649
[c9]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/paclic/OtsukaHMMHMM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/paclic/OtsukaHMMHMM15
Atsushi Otsuka, Toru Hirano, Chiaki Miyazaki, Ryo Masumura, Ryuichiro Higashinaka, Toshiro Makino, Yoshihiro Matsuo:
Discourse Relation Recognition by Comparing Various Units of Sentence Expression with Recursive Neural Network. PACLIC 2015
2014
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasumuraOMYT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasumuraOMYT14
Ryo Masumura, Takanobu Oba, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi:
Role play dialogue topic model for language model adaptation in multi-party conversation speech recognition. ICASSP 2014: 4873-4877
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraAOMS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraAOMS14
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi:
Mixture of latent words language models for domain adaptation. INTERSPEECH 2014: 1425-1429
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsamiMMS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsamiMMS14
Taichi Asami, Ryo Masumura, Hirokazu Masataki, Sumitaka Sakauchi:
Read and spontaneous speech classification based on variance of GMM supervectors. INTERSPEECH 2014: 2375-2379
2013
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MasumuraMOYT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MasumuraMOYT13
Ryo Masumura, Hirokazu Masataki, Takanobu Oba, Osamu Yoshioka, Satoshi Takahashi:
Use of latent words language models in ASR: A sampling-based implementation. ICASSP 2013: 8445-8449
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraMOYT13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraMOYT13
Ryo Masumura, Hirokazu Masataki, Takanobu Oba, Osamu Yoshioka, Satoshi Takahashi:
Viterbi decoding for latent words language models using gibbs sampling. INTERSPEECH 2013: 3429-3433
2011
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraHI11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraHI11
Ryo Masumura, Seongjun Hahm, Akinori Ito:
Training a Language Model Using Webdata for Large Vocabulary Japanese Spontaneous Speech Recognition. INTERSPEECH 2011: 1465-1468
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraHI11a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraHI11a
Ryo Masumura, Seongjun Hahm, Akinori Ito:
Language Model Expansion Using Webdata for Spoken Document Retrieval. INTERSPEECH 2011: 2133-2136
2010
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/nlpke/MasumuraIUIM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nlpke/MasumuraIUIM10
Ryo Masumura, Akinori Ito, Yu Uno, Masashi Ito, Shozo Makino:
Document expansion using relevant web documents for spoken document retrieval. NLPKE 2010: 1-8

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.