default search action
Yoshiki Masuyama
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j9]Yoshiki Masuyama, Kouei Yamaoka, Yuma Kinoshita, Taishi Nakashima, Nobutaka Ono:
Causal and Relaxed-Distortionless Response Beamforming for Online Target Source Extraction. IEEE ACM Trans. Audio Speech Lang. Process. 32: 310-324 (2024) - [j8]Yoshiki Masuyama, Kouei Yamaoka, Takao Kawamura, Nobutaka Ono:
Efficient Joint Optimization of Sampling Rate Offsets Using Entire Multichannel Signal. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1816-1828 (2024) - [c25]Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. ICASSP 2024: 1016-1020 - [i23]Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. CoRR abs/2402.17907 (2024) - [i22]Koichi Miyazaki, Yoshiki Masuyama, Masato Murata:
Exploring the Capability of Mamba in Speech Applications. CoRR abs/2406.16808 (2024) - [i21]Jiatong Shi, Jinchuan Tian, Yihan Wu, Jee-weon Jung, Jia Qi Yip, Yoshiki Masuyama, William Chen, Yuning Wu, Yuxun Tang, Massa Baali, Dareen Alharthi, Dong Zhang, Ruifan Deng, Tejes Srivastava, Haibin Wu, Alexander H. Liu, Bhiksha Raj, Qin Jin, Ruihua Song, Shinji Watanabe:
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech. CoRR abs/2409.15897 (2024) - 2023
- [j7]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing. J. Open Source Softw. 8(91): 5403 (2023) - [j6]Yoshiki Masuyama, Kohei Yatabe, Kento Nagatomo, Yasuhiro Oikawa:
Online Phase Reconstruction via DNN-Based Phase Differences Estimation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 163-176 (2023) - [c24]Kenta Yamada, Yoshiki Masuyama, Kouei Yamaoka, Nobutaka Ono:
Fundamental Frequency Estimation Based on Finite-Order Harmonic Constraint Differential Equation. APSIPA ASC 2023: 868-872 - [c23]Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction. ASRU 2023: 1-8 - [c22]Yoshiaki Bando, Yoshiki Masuyama, Aditya Arie Nugraha, Kazuyoshi Yoshii:
Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation. EUSIPCO 2023: 51-55 - [c21]Samuele Cornell, Zhong-Qiu Wang, Yoshiki Masuyama, Shinji Watanabe, Manuel Pariente, Nobutaka Ono, Stefano Squartini:
Multi-Channel Speaker Extraction with Adversarial Training: The Wavlab Submission to The Clarity ICASSP 2023 Grand Challenge. ICASSP 2023: 1-2 - [c20]Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe:
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation. WASPAA 2023: 1-5 - [c19]Yoshiki Masuyama, Natsuki Ueno, Nobutaka Ono:
Signal Reconstruction from Mel-Spectrogram Based on Bi-Level Consistency of Full-Band Magnitude and Phase. WASPAA 2023: 1-5 - [d1]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing (espnet-v.202310). Zenodo, 2023 - [i20]Samuele Cornell, Zhong-Qiu Wang, Yoshiki Masuyama, Shinji Watanabe, Manuel Pariente, Nobutaka Ono:
Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge. CoRR abs/2302.07928 (2023) - [i19]Yoshiaki Bando, Yoshiki Masuyama, Aditya Arie Nugraha, Kazuyoshi Yoshii:
Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation. CoRR abs/2306.10240 (2023) - [i18]Samuele Cornell, Matthew Wiesner, Shinji Watanabe, Desh Raj, Xuankai Chang, Paola García, Yoshiki Masuyama, Zhong-Qiu Wang, Stefano Squartini, Sanjeev Khudanpur:
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios. CoRR abs/2306.13734 (2023) - [i17]Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe:
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation. CoRR abs/2307.12231 (2023) - [i16]Yoshiki Masuyama, Natsuki Ueno, Nobutaka Ono:
Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase. CoRR abs/2307.12232 (2023) - [i15]Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction. CoRR abs/2310.19644 (2023) - 2022
- [c18]Yoshiki Masuyama, Kouei Yamaoka, Nobutaka Ono:
Joint Optimization of Sampling Rate Offsets Based on Entire Signal Relationship Among Distributed Microphones. INTERSPEECH 2022: 704-708 - [c17]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. INTERSPEECH 2022: 5458-5462 - [c16]Yoshiki Masuyama, Xuankai Chang, Samuele Cornell, Shinji Watanabe, Nobutaka Ono:
End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation. SLT 2022: 260-265 - [i14]Yoshiki Masuyama, Kouei Yamaoka, Nobutaka Ono:
Joint Optimization of Sampling Rate Offsets Based on Entire Signal Relationship Among Distributed Microphones. CoRR abs/2206.13014 (2022) - [i13]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. CoRR abs/2207.09514 (2022) - [i12]Yoshiki Masuyama, Xuankai Chang, Samuele Cornell, Shinji Watanabe, Nobutaka Ono:
End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation. CoRR abs/2210.10742 (2022) - [i11]Yoshiki Masuyama, Kohei Yatabe, Kento Nagatomo, Yasuhiro Oikawa:
Online Phase Reconstruction via DNN-based Phase Differences Estimation. CoRR abs/2211.08246 (2022) - 2021
- [j5]Yoshiaki Bando, Yoshiki Masuyama, Yoko Sasaki, Masaki Onishi:
Robust Auditory Functions Based on Probabilistic Integration of MUSIC and CGMM. IEEE Access 9: 38718-38730 (2021) - [j4]Yoshiki Masuyama, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Deep Griffin-Lim Iteration: Trainable Iterative Phase Reconstruction Using Neural Network. IEEE J. Sel. Top. Signal Process. 15(1): 37-50 (2021) - [j3]Yoshiaki Bando, Kouhei Sekiguchi, Yoshiki Masuyama, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii:
Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation. IEEE Signal Process. Lett. 28: 1670-1674 (2021) - [c15]Yoshiki Masuyama, Kouei Yamaoka, Yuma Kinoshita, Nobutaka Ono:
Causal Distortionless Response Beamforming by Alternating Direction Method of Multipliers. APSIPA ASC 2021: 585-590 - [c14]Yoshiki Masuyama, Tomoro Tanaka, Kohei Yatabe, Tsubasa Kusano, Yasuhiro Oikawa:
Simultaneous Declipping and Beamforming via Alternating Direction Method of Multipliers. EUSIPCO 2021: 316-320 - 2020
- [j2]Yoshiki Masuyama, Kohei Yatabe, Kento Nagatomo, Yasuhiro Oikawa:
Joint Amplitude and Phase Refinement for Monaural Source Separation. IEEE Signal Process. Lett. 27: 1939-1943 (2020) - [c13]Masahito Togami, Yoshiki Masuyama, Tatsuya Komatsu, Kazuyoshi Yoshii, Tatsuya Kawahara:
Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers. APSIPA 2020: 788-793 - [c12]Masahito Togami, Yoshiki Masuyama, Tatsuya Komatsu, Yu Nakagome:
Unsupervised Training for Deep Speech Source Separation with Kullback-Leibler Divergence Based Probabilistic Loss Function. ICASSP 2020: 56-60 - [c11]Yuma Koizumi, Kohei Yatabe, Marc Delcroix, Yoshiki Masuyama, Daiki Takeuchi:
Speech Enhancement Using Self-Adaptation and Multi-Head Self-Attention. ICASSP 2020: 181-185 - [c10]Yoshiki Masuyama, Masahito Togami, Tatsuya Komatsu:
Consistency-Aware Multi-Channel Speech Enhancement Using Deep Neural Networks. ICASSP 2020: 821-825 - [c9]Yoshiki Masuyama, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Phase Reconstruction Based On Recurrent Phase Unwrapping With Deep Neural Networks. ICASSP 2020: 826-830 - [c8]Yoshiki Masuyama, Yoshiaki Bando, Kohei Yatabe, Yoko Sasaki, Masaki Onishi, Yasuhiro Oikawa:
Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling. IROS 2020: 4848-4854 - [i10]Yoshiki Masuyama, Masahito Togami, Tatsuya Komatsu:
Consistency-aware multi-channel speech enhancement using deep neural networks. CoRR abs/2002.05831 (2020) - [i9]Yoshiki Masuyama, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Phase reconstruction based on recurrent phase unwrapping with deep neural networks. CoRR abs/2002.05832 (2020) - [i8]Yuma Koizumi, Kohei Yatabe, Marc Delcroix, Yoshiki Masuyama, Daiki Takeuchi:
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention. CoRR abs/2002.05873 (2020) - [i7]Yoshiki Masuyama, Yoshiaki Bando, Kohei Yatabe, Yoko Sasaki, Masaki Onishi, Yasuhiro Oikawa:
Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling. CoRR abs/2007.13976 (2020)
2010 – 2019
- 2019
- [j1]Yoshiki Masuyama, Kohei Yatabe, Yasuhiro Oikawa:
Griffin-Lim Like Phase Recovery via Alternating Direction Method of Multipliers. IEEE Signal Process. Lett. 26(1): 184-188 (2019) - [c7]Yoshiki Masuyama, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Deep Griffin-Lim Iteration. ICASSP 2019: 61-65 - [c6]Yoshiki Masuyama, Kohei Yatabe, Yasuhiro Oikawa:
Low-rankness of Complex-valued Spectrogram and Its Application to Phase-aware Audio Processing. ICASSP 2019: 855-859 - [c5]Yoshiki Masuyama, Kohei Yatabe, Yasuhiro Oikawa:
Phase-aware Harmonic/percussive Source Separation via Convex Optimization. ICASSP 2019: 985-989 - [c4]Yoshiki Masuyama, Masahito Togami, Tatsuya Komatsu:
Multichannel Loss Function for Supervised Speech Source Separation by Mask-Based Beamforming. INTERSPEECH 2019: 2708-2712 - [i6]Yoshiki Masuyama, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Deep Griffin-Lim Iteration. CoRR abs/1903.03971 (2019) - [i5]Yoshiki Masuyama, Kohei Yatabe, Yasuhiro Oikawa:
Phase-aware Harmonic/Percussive Source Separation via Convex Optimization. CoRR abs/1903.05600 (2019) - [i4]Yoshiki Masuyama, Kohei Yatabe, Yasuhiro Oikawa:
Low-rankness of Complex-valued Spectrogram and Its Application to Phase-aware Audio Processing. CoRR abs/1903.05603 (2019) - [i3]Yoshiki Masuyama, Masahito Togami, Tatsuya Komatsu:
Multichannel Loss Function for Supervised Speech Source Separation by Mask-based Beamforming. CoRR abs/1907.04984 (2019) - [i2]Masahito Togami, Yoshiki Masuyama, Tatsuya Komatsu, Yu Nakagome:
Unsupervised Training for Deep Speech Source Separation with Kullback-Leibler Divergence Based Probabilistic Loss Function. CoRR abs/1911.04228 (2019) - 2018
- [c3]Yoshiki Masuyama, Tsubasa Kusano, Kohei Yatabe, Yasuhiro Oikawa:
Modal Decomposition of Musical Instrument Sound Via Alternating Direction Method of Multipliers. ICASSP 2018: 631-635 - [c2]Yoshiki Masuyama, Kohei Yatabe, Yasuhiro Oikawa:
Model-Based Phase Recovery of Spectrograms via Optimization on Riemannian Manifolds. IWAENC 2018: 126-130 - [c1]Kohei Yatabe, Yoshiki Masuyama, Yasuhiro Oikawa:
Rectified Linear Unit Can Assist Griffin-Lim Phase Recovery. IWAENC 2018: 555-559 - [i1]Tsubasa Kusano, Yoshiki Masuyama, Kohei Yatabe, Yasuhiro Oikawa:
Designing nearly tight window for improving time-frequency masking. CoRR abs/1811.08783 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 22:02 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint