


default search action
Masato Mimura
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Hao Shi
, Masato Mimura, Tatsuya Kawahara
:
Waveform-Domain Speech Enhancement Using Spectrogram Encoding for Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3049-3060 (2024) - [i17]Hiroshi Sato, Takafumi Moriya, Masato Mimura, Shota Horiguchi, Tsubasa Ochiai, Takanori Ashihara, Atsushi Ando, Kentaro Shinayama, Marc Delcroix:
SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling. CoRR abs/2407.01857 (2024) - [i16]Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Masato Mimura, Takatomo Kano, Atsunori Ogawa, Marc Delcroix:
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation. CoRR abs/2408.00205 (2024) - [i15]Takafumi Moriya, Shota Horiguchi, Marc Delcroix, Ryo Masumura, Takanori Ashihara, Hiroshi Sato, Kohei Matsuura, Masato Mimura:
Alignment-Free Training for Transducer-based Multi-Talker ASR. CoRR abs/2409.20301 (2024) - [i14]Takafumi Moriya, Takanori Ashihara, Masato Mimura, Hiroshi Sato, Kohei Matsuura, Ryo Masumura, Taichi Asami:
Boosting Hybrid Autoregressive Transducer-based ASR with Internal Acoustic Model Training and Dual Blank Thresholding. CoRR abs/2409.20313 (2024) - 2023
- [c40]Hao Shi, Masato Mimura, Longbiao Wang, Jianwu Dang, Tatsuya Kawahara:
Time-Domain Speech Enhancement Assisted by Multi-Resolution Frequency Encoder and Decoder. ICASSP 2023: 1-5 - [c39]Jaeyoung Lee, Masato Mimura, Tatsuya Kawahara:
Embedding Articulatory Constraints for Low-resource Speech Recognition Based on Large Pre-trained Model. INTERSPEECH 2023: 1394-1398 - [i13]Hao Shi, Masato Mimura, Longbiao Wang, Jianwu Dang, Tatsuya Kawahara:
Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder. CoRR abs/2303.14593 (2023) - 2022
- [j7]Sebastian M. Cioaba, Jack H. Koolen, Masato Mimura, Hiroshi Nozaki, Takayuki Okuda:
On the spectrum and linear programming bound for hypergraphs. Eur. J. Comb. 104: 103535 (2022) - [c38]Heran Zhang, Masato Mimura, Tatsuya Kawahara
, Kenkichi Ishizuka:
Selective Multi-Task Learning For Speech Emotion Recognition Using Corpora Of Different Styles. ICASSP 2022: 7707-7711 - [c37]Soky Kak, Sheng Li
, Masato Mimura, Chenhui Chu, Tatsuya Kawahara:
Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism. INTERSPEECH 2022: 1362-1366 - [c36]Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM. INTERSPEECH 2022: 3889-3893 - [i12]Hayato Futami, Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Distilling the Knowledge of BERT for CTC-based ASR. CoRR abs/2209.02030 (2022) - [i11]Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM. CoRR abs/2209.04062 (2022) - 2021
- [j6]Masato Mimura, Norihide Tokushige
:
Solving linear equations in a vector space over a finite field. Discret. Math. 344(12): 112603 (2021) - [j5]Soky Kak
, Masato Mimura, Tatsuya Kawahara
, Chenhui Chu, Sheng Li
, Chenchen Ding, Sethserey Sam:
TriECCC: Trilingual Corpus of the Extraordinary Chambers in the Courts of Cambodia for Speech Recognition and Translation Studies. Int. J. Asian Lang. Process. 31(3&4): 2250007:1-2250007:21 (2021) - [c35]Soky Kak, Sheng Li, Masato Mimura, Chenhui Chu, Tatsuya Kawahara:
On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora. APSIPA ASC 2021: 433-437 - [c34]Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
An End-To-End Model from Speech to Clean Transcript for Parliamentary Meetings. APSIPA ASC 2021: 465-470 - [c33]Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Data Augmentation for ASR Using TTS Via a Discrete Representation. ASRU 2021: 68-75 - [c32]Hayato Futami, Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
ASR Rescoring and Confidence Estimation with Electra. ASRU 2021: 380-387 - [c31]Soky Kak
, Masato Mimura, Tatsuya Kawahara
, Sheng Li
, Chenchen Ding, Chenhui Chu, Sethserey Sam:
Khmer Speech Translation Corpus of the Extraordinary Chambers in the Courts of Cambodia (ECCC). O-COCOSDA 2021: 122-127 - [i10]Hayato Futami, Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
ASR Rescoring and Confidence Estimation with ELECTRA. CoRR abs/2110.01857 (2021) - 2020
- [c30]Jeongwoo Woo, Masato Mimura, Kazuyoshi Yoshii, Tatsuya Kawahara:
End-to-end Music-mixed Speech Recognition. APSIPA 2020: 800-804 - [c29]Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara
:
CTC-Synchronous Training for Monotonic Attention Model. INTERSPEECH 2020: 571-575 - [c28]Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara
:
Enhancing Monotonic Multihead Attention for Streaming ASR. INTERSPEECH 2020: 2137-2141 - [c27]Kohei Matsuura, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Generative Adversarial Training Data Adaptation for Very Low-Resource Automatic Speech Recognition. INTERSPEECH 2020: 2737-2741 - [c26]Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR. INTERSPEECH 2020: 3635-3639 - [c25]Kohei Matsuura, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language. LREC 2020: 2622-2628 - [i9]Kohei Matsuura, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language. CoRR abs/2002.06675 (2020) - [i8]Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara:
CTC-synchronous Training for Monotonic Attention Model. CoRR abs/2005.04712 (2020) - [i7]Kohei Matsuura, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition. CoRR abs/2005.09256 (2020) - [i6]Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara:
Enhancing Monotonic Multihead Attention for Streaming ASR. CoRR abs/2005.09394 (2020) - [i5]Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR. CoRR abs/2008.03822 (2020) - [i4]Sebastian M. Cioaba, Jack H. Koolen, Masato Mimura, Hiroshi Nozaki, Takayuki Okuda:
On the spectrum and linear programming bound for hypergraphs. CoRR abs/2009.03022 (2020)
2010 – 2019
- 2019
- [j4]Masato Mimura:
Amenability versus non-exactness of dense subgroups of a compact group. J. Lond. Math. Soc. 100(2): 592-622 (2019) - [j3]Kazuki Shimada
, Yoshiaki Bando
, Masato Mimura, Katsutoshi Itoyama
, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(5): 960-971 (2019) - [c24]Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Multi-speaker Sequence-to-sequence Speech Synthesis for Data Augmentation in Acoustic-to-word Speech Recognition. ICASSP 2019: 6161-6165 - [i3]Kazuki Shimada, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara:
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition. CoRR abs/1903.09341 (2019) - [i2]Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR. CoRR abs/1909.09993 (2019) - 2018
- [c23]Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization. ICASSP 2018: 716-720 - [c22]Kazuki Shimada
, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
Unsupervised Beamforming Based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition. ICASSP 2018: 5734-5738 - [c21]Sei Ueno, Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara
:
Acoustic-to-Word Attention-Based Model Complemented with Character-Level CTC-Based Model. ICASSP 2018: 5804-5808 - [c20]Hirofumi Inaguma, Masato Mimura, Koji Inoue, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition. ICASSP 2018: 6214-6218 - [c19]Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Forward-Backward Attention Decoder. INTERSPEECH 2018: 2232-2236 - [c18]Sei Ueno, Takafumi Moriya, Masato Mimura, Shinsuke Sakai, Yusuke Shinohara, Yoshikazu Yamaguchi, Yushi Aono, Tatsuya Kawahara
:
Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition. INTERSPEECH 2018: 2424-2428 - [c17]Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR. SLT 2018: 212-218 - [c16]Masato Mimura, Sei Ueno, Hirofumi Inaguma, Shinsuke Sakai, Tatsuya Kawahara
:
Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition. SLT 2018: 477-484 - 2017
- [c15]Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Cross-domain speech recognition using nonparallel corpora with cycle-consistent adversarial networks. ASRU 2017: 134-140 - [c14]Sheng Li
, Xugang Lu, Shinsuke Sakai, Masato Mimura, Tatsuya Kawahara
:
Semi-supervised ensemble DNN acoustic model training. ICASSP 2017: 5270-5274 - [c13]Hirofumi Inaguma, Koji Inoue, Masato Mimura, Tatsuya Kawahara
:
Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC. INTERSPEECH 2017: 1691-1695 - [c12]Masato Mimura, Yoshiaki Bando, Kazuki Shimada
, Shinsuke Sakai, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition. INTERSPEECH 2017: 2451-2455 - [c11]Masaya Wake, Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
Semi-Blind speech enhancement basedon recurrent neural network for source separation and dereverberation. MLSP 2017: 1-6 - [i1]Yoshiaki Bando, Masato Mimura, Katsutoshi Itoyama, Kazuyoshi Yoshii, Tatsuya Kawahara:
Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization. CoRR abs/1710.11439 (2017) - 2016
- [c10]Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Joint Optimization of Denoising Autoencoder and DNN Acoustic Model Based on Multi-Target Learning for Noisy Speech Recognition. INTERSPEECH 2016: 3803-3807 - 2015
- [j2]Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature. EURASIP J. Adv. Signal Process. 2015: 62 (2015) - [c9]Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Deep autoencoders augmented with phone-class feature for reverberant speech recognition. ICASSP 2015: 4365-4369 - [c8]Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Speech dereverberation using long short-term memory. INTERSPEECH 2015: 2435-2439 - 2014
- [c7]Masato Mimura, Tatsuya Kawahara
:
Unsupervised speaker adaptation of DNN-HMM by selecting similar speakers for lecture transcription. APSIPA 2014: 1-4 - [c6]Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
:
Exploring deep neural networks and deep autoencoders in reverberant speech recognition. HSCMA 2014: 197-201 - 2012
- [j1]Graham Neubig, Masato Mimura, Shinsuke Mori, Tatsuya Kawahara
:
Bayesian Learning of a Language Model from Continuous Speech. IEICE Trans. Inf. Syst. 95-D(2): 614-625 (2012) - 2010
- [c5]Yuya Akita, Masato Mimura, Graham Neubig, Tatsuya Kawahara:
Semi-automated update of automatic transcription system for the Japanese national congress. INTERSPEECH 2010: 338-341 - [c4]Graham Neubig, Masato Mimura, Shinsuke Mori, Tatsuya Kawahara:
Learning a language model from continuous speech. INTERSPEECH 2010: 1053-1056
2000 – 2009
- 2009
- [c3]Tatsuya Kawahara
, Masato Mimura, Yuya Akita:
Language model transformation applied to lightly supervised training of acoustic model for congress meetings. ICASSP 2009: 3853-3856 - [c2]Yuya Akita, Masato Mimura, Tatsuya Kawahara:
Automatic transcription system for meetings of the Japanese national congress. INTERSPEECH 2009: 84-87 - 2002
- [c1]Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Masato Mimura, Atsushi Yamada, Akinori Ito, Katsunobu Itou, Kiyohiro Shikano:
Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models. LREC 2002
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:02 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint