


default search action
Keisuke Kinoshita
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j31]Tetsuya Ueda
, Tomohiro Nakatani
, Rintaro Ikeshita
, Keisuke Kinoshita
, Shoko Araki
, Shoji Makino
:
Blind and Spatially-Regularized Online Joint Optimization of Source Separation, Dereverberation, and Noise Reduction. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1157-1172 (2024) - [i44]Se Jin Park, Julian Salazar, Aren Jansen, Keisuke Kinoshita, Yong Man Ro, R. J. Skerry-Ryan:
Long-Form Speech Generation with Spoken Language Models. CoRR abs/2412.18603 (2024) - 2023
- [j30]Katerina Zmolíková
, Marc Delcroix
, Tsubasa Ochiai
, Keisuke Kinoshita
, Jan Cernocký
, Dong Yu
:
Neural Target Speech Extraction: An overview. IEEE Signal Process. Mag. 40(3): 8-29 (2023) - [j29]Marc Delcroix
, Jorge Bennasar Vázquez, Tsubasa Ochiai, Keisuke Kinoshita
, Yasunori Ohishi
, Shoko Araki
:
SoundBeam: Target Sound Extraction Conditioned on Sound-Class Labels and Enrollment Clues for Increased Performance and Continuous Learning. IEEE ACM Trans. Audio Speech Lang. Process. 31: 121-136 (2023) - [j28]Thilo von Neumann
, Keisuke Kinoshita
, Christoph Böddeker
, Marc Delcroix
, Reinhold Haeb-Umbach
:
Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE ACM Trans. Audio Speech Lang. Process. 31: 576-589 (2023) - [j27]Hiroshi Sawada
, Rintaro Ikeshita
, Keisuke Kinoshita
, Tomohiro Nakatani
:
Multi-Frame Full-Rank Spatial Covariance Analysis for Underdetermined Blind Source Separation and Dereverberation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3589-3602 (2023) - [c135]Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023: 1-5 - [i43]Katerina Zmolíková, Marc Delcroix, Tsubasa Ochiai, Keisuke Kinoshita
, Jan Cernocký, Dong Yu:
Neural Target Speech Extraction: An Overview. CoRR abs/2301.13341 (2023) - 2022
- [j26]Keisuke Kinoshita
, Takehito Kuge, Yoshie Hara, Kojiro Mekata:
Putamen Atrophy Is a Possible Clinical Evaluation Index for Parkinson's Disease Using Human Brain Magnetic Resonance Imaging. J. Imaging 8(11): 299 (2022) - [j25]Tomohiro Nakatani
, Rintaro Ikeshita
, Keisuke Kinoshita
, Hiroshi Sawada
, Naoyuki Kamo, Shoko Araki:
Switching Independent Vector Analysis and its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithms. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1032-1047 (2022) - [c134]Naoyuki Kamo, Rintaro Ikeshita, Keisuke Kinoshita
, Tomohiro Nakatani:
Importance of Switch Optimization Criterion in Switching WPE Dereverberation. ICASSP 2022: 176-180 - [c133]Hiroshi Sawada, Rintaro Ikeshita, Keisuke Kinoshita
, Tomohiro Nakatani:
Multi-Frame Full-Rank Spatial Covariance Analysis for Underdetermined BSS in Reverberant Environments. ICASSP 2022: 496-500 - [c132]Thilo von Neumann, Keisuke Kinoshita
, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach
:
SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022: 6022-6026 - [c131]Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita
, Naoyuki Kamo, Takafumi Moriya:
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition. ICASSP 2022: 6287-6291 - [c130]Keisuke Kinoshita
, Marc Delcroix, Tomoharu Iwata:
Tight Integration Of Neural- And Clustering-Based Diarization Through Deep Unfolding Of Infinite Gaussian Mixture Model. ICASSP 2022: 8382-8386 - [c129]Marc Delcroix, Keisuke Kinoshita
, Tsubasa Ochiai, Katerina Zmolíková, Hiroshi Sato, Tomohiro Nakatani:
Listen only to me! How well can target speech extraction handle false alarms? INTERSPEECH 2022: 216-220 - [c128]Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita
, Takafumi Moriya, Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Ryo Masumura:
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations. INTERSPEECH 2022: 996-1000 - [c127]Keisuke Kinoshita
, Thilo von Neumann, Marc Delcroix, Christoph Böddeker, Reinhold Haeb-Umbach:
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. INTERSPEECH 2022: 1486-1490 - [i42]Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Naoyuki Kamo, Takafumi Moriya:
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition. CoRR abs/2201.03881 (2022) - [i41]Keisuke Kinoshita, Marc Delcroix, Tomoharu Iwata:
Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model. CoRR abs/2202.06524 (2022) - [i40]Ayako Yamamoto, Toshio Irino, Shoko Araki, Kenichi Arai, Atsunori Ogawa, Keisuke Kinoshita, Tomohiro Nakatani:
Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening. CoRR abs/2203.16760 (2022) - [i39]Marc Delcroix, Jorge Bennasar Vázquez, Tsubasa Ochiai, Keisuke Kinoshita
, Yasunori Ohishi, Shoko Araki:
SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning. CoRR abs/2204.03895 (2022) - [i38]Marc Delcroix, Keisuke Kinoshita
, Tsubasa Ochiai, Katerina Zmolíková, Hiroshi Sato, Tomohiro Nakatani:
Listen only to me! How well can target speech extraction handle false alarms? CoRR abs/2204.04811 (2022) - [i37]Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita
, Takafumi Moriya, Naoki Makishima, Mana Ihori, Tomohiro Tanaka, Ryo Masumura:
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations. CoRR abs/2206.08174 (2022) - [i36]Keisuke Kinoshita
, Thilo von Neumann, Marc Delcroix, Christoph Böddeker, Reinhold Haeb-Umbach:
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. CoRR abs/2207.13888 (2022) - [i35]Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems. CoRR abs/2211.16112 (2022) - 2021
- [j24]Rintaro Ikeshita
, Keisuke Kinoshita
, Naoyuki Kamo, Tomohiro Nakatani
:
Online Speech Dereverberation Using Mixture of Multichannel Linear Prediction Models. IEEE Signal Process. Lett. 28: 1580-1584 (2021) - [c126]Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
Speeding Up Permutation Invariant Training for Source Separation. ITG Conference on Speech Communication 2021: 1-5 - [c125]Tomohiro Nakatani, Rintaro Ikeshita, Naoyuki Kamo, Keisuke Kinoshita
, Shoko Araki, Hiroshi Sawada:
Switching Convolutional Beamformer. EUSIPCO 2021: 266-270 - [c124]Tetsuya Ueda, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita
, Shoko Araki, Shoji Makino:
Low Latency Online Source Separation and Noise Reduction Based on Joint Optimization with Dereverberation. EUSIPCO 2021: 1000-1004 - [c123]Tetsuya Ueda, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita
, Shoko Araki, Shoji Makino
:
Low Latency Online Blind Source Separation Based on Joint Optimization with Blind Dereverberation. ICASSP 2021: 506-510 - [c122]Julio Wissing
, Benedikt T. Boenninghoff, Dorothea Kolossa
, Tsubasa Ochiai, Marc Delcroix
, Keisuke Kinoshita
, Tomohiro Nakatani, Shoko Araki, Christopher Schymura
:
Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain. ICASSP 2021: 4705-4709 - [c121]Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita
, Marc Delcroix
, Shinji Watanabe
, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. ICASSP 2021: 5739-5743 - [c120]Marc Delcroix
, Katerina Zmolíková
, Tsubasa Ochiai, Keisuke Kinoshita
, Tomohiro Nakatani:
Speaker Activity Driven Neural Speech Extraction. ICASSP 2021: 6099-6103 - [c119]Tsubasa Ochiai, Marc Delcroix
, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita
, Shoko Araki:
Neural Network-Based Virtual Microphone Estimator. ICASSP 2021: 6114-6118 - [c118]Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita
, Hiroshi Sawada, Shoko Araki:
Blind and Neural Network-Guided Convolutional Beamformer for Joint Denoising, Dereverberation, and Source Separation. ICASSP 2021: 6129-6133 - [c117]Wangyou Zhang, Christoph Böddeker, Shinji Watanabe
, Tomohiro Nakatani, Marc Delcroix
, Keisuke Kinoshita
, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach
, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. ICASSP 2021: 6898-6902 - [c116]Keisuke Kinoshita
, Marc Delcroix
, Naohiro Tawara:
Integrating End-to-End Neural and Clustering-Based Diarization: Getting the Best of Both Worlds. ICASSP 2021: 7198-7202 - [c115]Christoph Böddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita
, Tsubasa Ochiai, Marc Delcroix
, Naoyuki Kamo, Yanmin Qian, Reinhold Haeb-Umbach
:
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. ICASSP 2021: 8428-8432 - [c114]Ayako Yamamoto, Toshio Irino, Kenichi Arai
, Shoko Araki, Atsunori Ogawa, Keisuke Kinoshita
, Tomohiro Nakatani:
Comparison of Remote Experiments Using Crowdsourcing and Laboratory Experiments on Speech Intelligibility. Interspeech 2021: 181-185 - [c113]Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix
, Keisuke Kinoshita
, Takafumi Moriya, Naoyuki Kamo:
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition. Interspeech 2021: 1149-1153 - [c112]Christopher Schymura, Benedikt T. Bönninghoff, Tsubasa Ochiai, Marc Delcroix
, Keisuke Kinoshita
, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa:
PILOT: Introducing Transformers for Probabilistic Sound Event Localization. Interspeech 2021: 2117-2121 - [c111]Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita
, Shinji Watanabe
, Marc Delcroix
, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen:
Continuous Speech Separation Using Speaker Inventory for Long Recording. Interspeech 2021: 3036-3040 - [c110]Thilo von Neumann, Keisuke Kinoshita
, Christoph Böddeker, Marc Delcroix
, Reinhold Haeb-Umbach
:
Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021: 3490-3494 - [c109]Marc Delcroix
, Jorge Bennasar Vázquez, Tsubasa Ochiai, Keisuke Kinoshita
, Shoko Araki:
Few-Shot Learning of New Sound Classes for Target Sound Extraction. Interspeech 2021: 3500-3504 - [c108]Keisuke Kinoshita
, Marc Delcroix
, Naohiro Tawara:
Advances in Integration of End-to-End Neural and Clustering-Based Diarization for Real Conversational Speech. Interspeech 2021: 3565-3569 - [c107]Hiroshi Sato, Tsubasa Ochiai, Keisuke Kinoshita
, Marc Delcroix
, Tomohiro Nakatani, Shoko Araki:
Multimodal Attention Fusion for Target Speaker Extraction. SLT 2021: 778-784 - [c106]Chenda Li, Yi Luo, Cong Han, Jinyu Li
, Takuya Yoshioka, Tianyan Zhou, Marc Delcroix
, Keisuke Kinoshita
, Christoph Böddeker, Yanmin Qian, Shinji Watanabe
, Zhuo Chen:
Dual-Path RNN for Long Recording Speech Separation. SLT 2021: 865-872 - [i34]Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Shoko Araki:
Neural Network-based Virtual Microphone Estimator. CoRR abs/2101.04315 (2021) - [i33]Marc Delcroix, Katerina Zmolíková, Tsubasa Ochiai, Keisuke Kinoshita, Tomohiro Nakatani:
Speaker activity driven neural speech extraction. CoRR abs/2101.05516 (2021) - [i32]Hiroshi Sato, Tsubasa Ochiai, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Shoko Araki:
Multimodal Attention Fusion for Target Speaker Extraction. CoRR abs/2102.01326 (2021) - [i31]Wangyou Zhang, Christoph Böddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. CoRR abs/2102.11525 (2021) - [i30]Julio Wissing, Benedikt T. Boenninghoff, Dorothea Kolossa, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Christopher Schymura:
Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain. CoRR abs/2102.11588 (2021) - [i29]Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. CoRR abs/2102.11634 (2021) - [i28]Christopher Schymura, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa:
Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization. CoRR abs/2103.00417 (2021) - [i27]Ayako Yamamoto, Toshio Irino, Kenichi Arai, Shoko Araki, Atsunori Ogawa, Keisuke Kinoshita, Tomohiro Nakatani:
Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibility. CoRR abs/2104.10001 (2021) - [i26]Keisuke Kinoshita, Marc Delcroix, Naohiro Tawara:
Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech. CoRR abs/2105.09040 (2021) - [i25]Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Takafumi Moriya, Naoyuki Kamo:
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition. CoRR abs/2106.00949 (2021) - [i24]Christopher Schymura, Benedikt T. Bönninghoff, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa:
PILOT: Introducing Transformers for Probabilistic Sound Event Localization. CoRR abs/2106.03903 (2021) - [i23]Marc Delcroix, Jorge Bennasar Vázquez, Tsubasa Ochiai, Keisuke Kinoshita, Shoko Araki:
Few-shot learning of new sound classes for target sound extraction. CoRR abs/2106.07144 (2021) - [i22]Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
Speeding Up Permutation Invariant Training for Source Separation. CoRR abs/2107.14445 (2021) - [i21]Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers. CoRR abs/2107.14446 (2021) - [i20]Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Hiroshi Sawada, Shoko Araki:
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation. CoRR abs/2108.01836 (2021) - [i19]Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
SA-SDR: A novel loss function for separation of meeting style data. CoRR abs/2110.15581 (2021) - [i18]Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Hiroshi Sawada, Naoyuki Kamo, Shoko Araki:
Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm. CoRR abs/2111.10574 (2021) - 2020
- [j23]Katsuhiko Yamamoto
, Toshio Irino, Shoko Araki
, Keisuke Kinoshita
, Tomohiro Nakatani:
GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech. Speech Commun. 123: 43-58 (2020) - [j22]Tomohiro Nakatani
, Christoph Böddeker, Keisuke Kinoshita
, Rintaro Ikeshita
, Marc Delcroix
, Reinhold Haeb-Umbach
:
Jointly Optimal Denoising, Dereverberation, and Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2267-2282 (2020) - [c105]Christopher Schymura
, Tsubasa Ochiai, Marc Delcroix
, Keisuke Kinoshita
, Tomohiro Nakatani, Shoko Araki, Dorothea Kolossa
:
Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization. EUSIPCO 2020: 231-235 - [c104]Christoph Böddeker, Tomohiro Nakatani, Keisuke Kinoshita
, Reinhold Haeb-Umbach
:
Jointly Optimal Dereverberation and Beamforming. ICASSP 2020: 216-220 - [c103]Keisuke Kinoshita
, Marc Delcroix
, Shoko Araki
, Tomohiro Nakatani:
Tackling Real Noisy Reverberant Meetings with All-Neural Source Separation, Counting, and Diarization System. ICASSP 2020: 381-385 - [c102]Christopher Schymura
, Tsubasa Ochiai, Marc Delcroix
, Keisuke Kinoshita
, Tomohiro Nakatani, Shoko Araki
, Dorothea Kolossa
:
A Dynamic Stream Weight Backprop Kalman Filter for Audiovisual Speaker Tracking. ICASSP 2020: 581-585 - [c101]Marc Delcroix
, Tsubasa Ochiai, Katerina Zmolíková
, Keisuke Kinoshita
, Naohiro Tawara, Tomohiro Nakatani, Shoko Araki
:
Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam. ICASSP 2020: 691-695 - [c100]Tsubasa Ochiai, Marc Delcroix
, Rintaro Ikeshita, Keisuke Kinoshita
, Tomohiro Nakatani, Shoko Araki
:
Beam-TasNet: Time-domain Audio Separation Network Meets Frequency-domain Beamformer. ICASSP 2020: 6384-6388 - [c99]Tomohiro Nakatani, Riki Takahashi, Tsubasa Ochiai, Keisuke Kinoshita
, Rintaro Ikeshita, Marc Delcroix
, Shoko Araki
:
DNN-supported Mask-based Convolutional Beamforming for Simultaneous Denoising, Dereverberation, and Source Separation. ICASSP 2020: 6399-6403 - [c98]Thilo von Neumann, Keisuke Kinoshita
, Lukas Drude, Christoph Böddeker, Marc Delcroix
, Tomohiro Nakatani, Reinhold Haeb-Umbach
:
End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020: 7004-7008 - [c97]Keisuke Kinoshita
, Tsubasa Ochiai, Marc Delcroix
, Tomohiro Nakatani:
Improving Noise Robust Automatic Speech Recognition with Single-Channel Time-Domain Enhancement Network. ICASSP 2020: 7009-7013 - [c96]Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita
, Hiroshi Sawada, Shoko Araki:
Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation. INTERSPEECH 2020: 91-95 - [c95]Kenichi Arai
, Shoko Araki, Atsunori Ogawa, Keisuke Kinoshita
, Tomohiro Nakatani, Toshio Irino:
Predicting Intelligibility of Enhanced Speech Using Posteriors Derived from DNN-Based ASR System. INTERSPEECH 2020: 1156-1160 - [c94]Tsubasa Ochiai, Marc Delcroix
, Yuma Koizumi, Hiroaki Ito, Keisuke Kinoshita
, Shoko Araki:
Listen to What You Want: Neural Network-Based Universal Sound Selector. INTERSPEECH 2020: 1441-1445 - [c93]Keisuke Kinoshita
, Thilo von Neumann, Marc Delcroix
, Tomohiro Nakatani, Reinhold Haeb-Umbach
:
Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. INTERSPEECH 2020: 2652-2656 - [c92]Thilo von Neumann, Christoph Böddeker, Lukas Drude, Keisuke Kinoshita
, Marc Delcroix
, Tomohiro Nakatani, Reinhold Haeb-Umbach
:
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. INTERSPEECH 2020: 3097-3101 - [c91]Ali Aroudi, Marc Delcroix
, Tomohiro Nakatani, Keisuke Kinoshita
, Shoko Araki
, Simon Doclo:
Cognitive-Driven Convolutional Beamforming Using EEG-Based Auditory Attention Decoding. MLSP 2020: 1-6 - [i17]Marc Delcroix, Tsubasa Ochiai, Katerina Zmolíková, Keisuke Kinoshita, Naohiro Tawara, Tomohiro Nakatani, Shoko Araki:
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam. CoRR abs/2001.08378 (2020) - [i16]Keisuke Kinoshita, Marc Delcroix, Shoko Araki, Tomohiro Nakatani:
Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system. CoRR abs/2003.03987 (2020) - [i15]Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani:
Improving noise robust automatic speech recognition with single-channel time-domain enhancement network. CoRR abs/2003.03998 (2020) - [i14]Ali Aroudi, Marc Delcroix, Tomohiro Nakatani, Keisuke Kinoshita, Shoko Araki, Simon Doclo:
Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding. CoRR abs/2005.04669 (2020) - [i13]Tomohiro Nakatani, Christoph Böddeker, Keisuke Kinoshita, Rintaro Ikeshita, Marc Delcroix, Reinhold Haeb-Umbach:
Jointly optimal denoising, dereverberation, and source separation. CoRR abs/2005.09843 (2020) - [i12]Thilo von Neumann, Christoph Böddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR. CoRR abs/2006.02786 (2020) - [i11]Tsubasa Ochiai, Marc Delcroix, Yuma Koizumi, Hiroaki Ito, Keisuke Kinoshita, Shoko Araki:
Listen to What You Want: Neural Network-based Universal Sound Selector. CoRR abs/2006.05712 (2020) - [i10]Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-path RNN for hierarchical modeling of long sequential data and its application to speaker stream separation. CoRR abs/2006.13579 (2020) - [i9]Keisuke Kinoshita, Marc Delcroix, Naohiro Tawara:
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds. CoRR abs/2010.13366 (2020) - [i8]Christoph Böddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Naoyuki Kamo, Yanmin Qian, Shinji Watanabe, Reinhold Haeb-Umbach:
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation. CoRR abs/2011.15003 (2020) - [i7]Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita, Shinji Watanabe, Marc Delcroix, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen:
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording. CoRR abs/2012.09727 (2020)
2010 – 2019
- 2019
- [j21]Katerina Zmolíková
, Marc Delcroix
, Keisuke Kinoshita
, Tsubasa Ochiai, Tomohiro Nakatani
, Lukás Burget
, Jan Cernocký
:
SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures. IEEE J. Sel. Top. Signal Process. 13(4): 800-814 (2019) - [j20]Tomohiro Nakatani
, Keisuke Kinoshita
:
A Unified Convolutional Beamformer for Simultaneous Denoising and Dereverberation. IEEE Signal Process. Lett. 26(6): 903-907 (2019) - [c90]Shoko Araki
, Nobutaka Ono, Keisuke Kinoshita
, Marc Delcroix
:
Projection Back onto Filtered Observations for Speech Separation with Distributed Microphone Array. CAMSAP 2019: 291-295 - [c89]Tomohiro Nakatani
, Keisuke Kinoshita
:
Maximum likelihood convolutional beamformer for simultaneous denoising and dereverberation. EUSIPCO 2019: 1-5 - [c88]Thilo von Neumann, Keisuke Kinoshita
, Marc Delcroix
, Shoko Araki
, Tomohiro Nakatani, Reinhold Haeb-Umbach
:
All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis. ICASSP 2019: 91-95 - [c87]Shoko Araki
, Nobutaka Ono
, Keisuke Kinoshita
, Marc Delcroix
:
Estimation of Sampling Frequency Mismatch between Distributed Asynchronous Microphones under Existence of Source Movements with Stationary Time Periods Detection. ICASSP 2019: 785-789 - [c86]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach
, Keisuke Kinoshita
, Tomohiro Nakatani:
Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. ICASSP 2019: 6655-6659 - [c85]Yuki Kubo, Tomohiro Nakatani, Marc Delcroix
, Keisuke Kinoshita
, Shoko Araki
:
Mask-based MVDR Beamformer for Noisy Multisource Environments: Introduction of Time-varying Spatial Covariance Model. ICASSP 2019: 6855-6859 - [c84]Marc Delcroix
, Katerina Zmolíková
, Tsubasa Ochiai, Keisuke Kinoshita
, Shoko Araki
, Tomohiro Nakatani:
Compact Network for Speakerbeam Target Speaker Extraction. ICASSP 2019: 6965-6969 - [c83]Tsubasa Ochiai, Marc Delcroix
, Keisuke Kinoshita
, Atsunori Ogawa, Tomohiro Nakatani:
A Unified Framework for Neural Speech Separation and Extraction. ICASSP 2019: 6975-6979 - [c82]Tomohiro Nakatani, Keisuke Kinoshita
:
Simultaneous Denoising and Dereverberation for Low-Latency Applications Using Frame-by-Frame Online Unified Convolutional Beamformer. INTERSPEECH 2019: 111-115 - [c81]Marc Delcroix
, Shinji Watanabe
, Tsubasa Ochiai, Keisuke Kinoshita
, Shigeki Karita, Atsunori Ogawa, Tomohiro Nakatani:
End-to-End SpeakerBeam for Single Channel Target Speech Recognition. INTERSPEECH 2019: 451-455 - [c80]Tsubasa Ochiai, Marc Delcroix
, Keisuke Kinoshita
, Atsunori Ogawa, Tomohiro Nakatani:
Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues. INTERSPEECH 2019: 2718-2722 - [c79]Kenichi Arai
, Shoko Araki
, Atsunori Ogawa, Keisuke Kinoshita
, Tomohiro Nakatani, Katsuhiko Yamamoto, Toshio Irino:
Predicting Speech Intelligibility of Enhanced Speech Using Phone Accuracy of DNN-Based ASR System. INTERSPEECH 2019: 4275-4279 - [c78]Tomohiro Nakatani, Keisuke Kinoshita
, Rintaro Ikeshita, Hiroshi Sawada, Shoko Araki
:
Simultaneous Denoising, Dereverberation, and Source Separation Using a Unified Convolutional Beamformer. WASPAA 2019: 224-228 - [i6]Thilo von Neumann, Keisuke Kinoshita, Marc Delcroix, Shoko Araki, Tomohiro Nakatani, Reinhold Haeb-Umbach:
All-neural online source separation, counting, and diarization for meeting analysis. CoRR abs/1902.07881 (2019) - [i5]Katsuhiko Yamamoto
, Toshio Irino, Shoko Araki, Keisuke Kinoshita, Tomohiro Nakatani:
GEDI: Gammachirp Envelope Distortion Index for Predicting Intelligibility of Enhanced Speech. CoRR abs/1904.02096 (2019) - [i4]Tomohiro Nakatani, Keisuke Kinoshita:
Maximum likelihood convolutional beamformer for simultaneous denoising and dereverberation. CoRR abs/1908.02710 (2019) - [i3]Christoph Böddeker, Tomohiro Nakatani, Keisuke Kinoshita, Reinhold Haeb-Umbach:
Jointly optimal dereverberation and beamforming. CoRR abs/1910.13707 (2019) - [i2]Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Böddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
End-to-end training of time domain audio separation and recognition. CoRR abs/1912.08462 (2019) - 2018
- [j19]Marc Delcroix
, Keisuke Kinoshita
, Atsunori Ogawa
, Christian Huemmer
, Tomohiro Nakatani:
Context Adaptive Neural Network Based Acoustic Models for Rapid Adaptation. IEEE ACM Trans. Audio Speech Lang. Process. 26(5): 895-908 (2018) - [c77]Takuya Higuchi, Keisuke Kinoshita
, Nobutaka Ito, Shigeki Karita, Tomohiro Nakatani:
Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming. ICASSP 2018: 531-535 - [c76]Lukas Drude, Takuya Higuchi, Keisuke Kinoshita
, Tomohiro Nakatani, Reinhold Haeb-Umbach
:
Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation. ICASSP 2018: 691-695 - [c75]Keisuke Kinoshita
, Lukas Drude, Marc Delcroix
, Tomohiro Nakatani:
Listening to Each Speaker One by One with Recurrent Selective Hearing Networks. ICASSP 2018: 5064-5068 - [c74]Marc Delcroix
, Katerina Zmolíková
, Keisuke Kinoshita
, Atsunori Ogawa, Tomohiro Nakatani:
Single Channel Target Speaker Extraction and Recognition with Speaker Beam. ICASSP 2018: 5554-5558 - [c73]Shoko Araki
, Nobutaka Ono
, Keisuke Kinoshita
, Marc Delcroix
:
Meeting Recognition with Asynchronous Distributed Microphone Array Using Block-Wise Refinement of Mask-Based MVDR Beamformer. ICASSP 2018: 5694-5698 - [c72]Katerina Zmolíková
, Marc Delcroix
, Keisuke Kinoshita
, Takuya Higuchi, Tomohiro Nakatani, Jan Cernocký
:
Optimization of Speaker-Aware Multichannel Speech Extraction with ASR Criterion. ICASSP 2018: 6702-6706 - [c71]Katsuhiko Yamamoto
, Toshio Irino, Narumi Ohashi, Shoko Araki
, Keisuke Kinoshita
, Tomohiro Nakatani:
Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech. INTERSPEECH 2018: 1863-1867 - [c70]Lukas Drude, Christoph Böddeker, Jahn Heymann, Reinhold Haeb-Umbach
, Keisuke Kinoshita
, Marc Delcroix
, Tomohiro Nakatani:
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation. INTERSPEECH 2018: 3043-3047 - [c69]Yutaro Matsui, Tomohiro Nakatani, Marc Delcroix
, Keisuke Kinoshita
, Nobutaka Ito, Shoko Araki
, Shoji Makino:
Online Integration of DNN-Based and Spatial Clustering-Based Mask Estimation for Robust MVDR Beamforming. IWAENC 2018: 71-75 - [c68]Shoko Araki
, Nobutaka Ono
, Keisuke Kinoshita
, Marc Delcroix
:
Comparison of Reference Microphone Selection Algorithms for Distributed Microphone Array Based Speech Enhancement in Meeting Recognition Scenarios. IWAENC 2018: 316-320 - [c67]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach
, Keisuke Kinoshita
, Tomohiro Nakatani:
Frame-Online DNN-WPE Dereverberation. IWAENC 2018: 466-470 - [i1]Tomohiro Nakatani, Keisuke Kinoshita:
A unified convolutional beamformer for simultaneous denoising and dereverberation. CoRR abs/1812.08400 (2018) - 2017
- [c66]Katerina Zmolíková
, Marc Delcroix
, Keisuke Kinoshita
, Takuya Higuchi, Atsunori Ogawa, Tomohiro Nakatani:
Learning speaker representation for neural network based multichannel speaker extraction. ASRU 2017: 8-15 - [c65]Shoko Araki
, Nobutaka Ono
, Keisuke Kinoshita
, Marc Delcroix
:
Meeting recognition with asynchronous distributed microphone array. ASRU 2017: 32-39 - [c64]Takuya Higuchi, Keisuke Kinoshita
, Marc Delcroix
, Tomohiro Nakatani:
Adversarial training for data-driven speech enhancement without parallel corpus. ASRU 2017: 40-47 - [c63]Shoko Araki
, Nobutaka Ito, Marc Delcroix
, Atsunori Ogawa, Keisuke Kinoshita
, Takuya Higuchi, Takuya Yoshioka, Dung T. Tran, Shigeki Karita, Tomohiro Nakatani:
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming. HSCMA 2017: 16-20 - [c62]Keisuke Kinoshita
, Marc Delcroix
, Atsunori Ogawa, Takuya Higuchi, Tomohiro Nakatani:
Deep mixture density network for statistical model-based feature enhancement. ICASSP 2017: 251-255 - [c61]Tomohiro Nakatani, Nobutaka Ito, Takuya Higuchi, Shoko Araki
, Keisuke Kinoshita
:
Integrating DNN-based and spatial clustering-based mask estimation for robust MVDR beamforming. ICASSP 2017: 286-290 - [c60]Christian Huemmer, Marc Delcroix
, Atsunori Ogawa, Keisuke Kinoshita
, Tomohiro Nakatani, Walter Kellermann:
Online environmental adaptation of CNN-based acoustic models using spatial diffuseness features. ICASSP 2017: 4875-4879 - [c59]Takuya Higuchi, Takuya Yoshioka, Keisuke Kinoshita
, Tomohiro Nakatani:
Unsupervised utterance-wise beamformer estimation with speech recognition-level criterion. ICASSP 2017: 5170-5174 - [c58]Tsubasa Ochiai, Marc Delcroix
, Keisuke Kinoshita
, Atsunori Ogawa, Taichi Asami, Shigeru Katagiri, Tomohiro Nakatani:
Cumulative moving averaged bottleneck speaker vectors for online speaker adaptation of CNN-based acoustic models. ICASSP 2017: 5175-5179 - [c57]Keisuke Kinoshita
, Marc Delcroix
, Haeyong Kwon, Takuma Mori, Tomohiro Nakatani:
Neural Network-Based Spectrum Estimation for Online WPE Dereverberation. INTERSPEECH 2017: 384-388 - [c56]Takuya Higuchi, Keisuke Kinoshita
, Marc Delcroix
, Katerina Zmolíková
, Tomohiro Nakatani:
Deep Clustering-Based Beamforming for Separation with Unknown Number of Sources. INTERSPEECH 2017: 1183-1187 - [c55]Atsunori Ogawa, Keisuke Kinoshita
, Marc Delcroix
, Tomohiro Nakatani:
Improved Example-Based Speech Enhancement by Using Deep Neural Network Acoustic Model for Noise Robust Example Search. INTERSPEECH 2017: 1963-1967 - [c54]Katerina Zmolíková
, Marc Delcroix
, Keisuke Kinoshita
, Takuya Higuchi, Atsunori Ogawa, Tomohiro Nakatani:
Speaker-Aware Neural Network Based Beamformer for Speaker Extraction in Speech Mixtures. INTERSPEECH 2017: 2655-2659 - [c53]Katsuhiko Yamamoto
, Toshio Irino, Toshie Matsui, Shoko Araki
, Keisuke Kinoshita
, Tomohiro Nakatani:
Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio. INTERSPEECH 2017: 2949-2953 - [p3]Marc Delcroix, Takuya Yoshioka, Nobutaka Ito, Atsunori Ogawa, Keisuke Kinoshita, Masakiyo Fujimoto, Takuya Higuchi, Shoko Araki, Tomohiro Nakatani:
Multichannel Speech Enhancement Approaches to DNN-Based Far-Field Speech Recognition. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 21-49 - [p2]Keisuke Kinoshita, Marc Delcroix, Sharon Gannot, Emanuël A. P. Habets, Reinhold Haeb-Umbach, Walter Kellermann, Volker Leutnant, Roland Maas, Tomohiro Nakatani, Bhiksha Raj, Armin Sehr, Takuya Yoshioka:
The REVERB Challenge: A Benchmark Task for Reverberation-Robust ASR Techniques. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 345-354 - 2016
- [j18]Keisuke Kinoshita
, Marc Delcroix
, Sharon Gannot
, Emanuël A. P. Habets, Reinhold Haeb-Umbach
, Walter Kellermann, Volker Leutnant, Roland Maas, Tomohiro Nakatani, Bhiksha Raj, Armin Sehr, Takuya Yoshioka:
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research. EURASIP J. Adv. Signal Process. 2016: 7 (2016) - [c52]Naoki Murata, Hirokazu Kameoka, Keisuke Kinoshita
, Shoko Araki
, Tomohiro Nakatani, Shoichi Koyama
, Hiroshi Saruwatari:
Reverberation-robust underdetermined source separation with non-negative tensor double deconvolution. EUSIPCO 2016: 1648-1652 - [c51]Marc Delcroix
, Keisuke Kinoshita
, Chengzhu Yu, Atsunori Ogawa, Takuya Yoshioka, Tomohiro Nakatani:
Context adaptive deep neural networks for fast acoustic model adaptation in noisy conditions. ICASSP 2016: 5270-5274 - [c50]Keigo Watanabe, Keisuke Kinoshita, Isaku Nagai, Maki K. Habib
:
Development of a camera-mounted tethered Quadrotor for inspecting infrastructures. IECON 2016: 6128-6133 - [c49]Marc Delcroix
, Keisuke Kinoshita
, Atsunori Ogawa, Takuya Yoshioka, Dung T. Tran, Tomohiro Nakatani:
Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models. INTERSPEECH 2016: 1573-1577 - [c48]Katsuhiko Yamamoto
, Toshio Irino, Toshie Matsui, Shoko Araki
, Keisuke Kinoshita
, Tomohiro Nakatani:
Speech Intelligibility Prediction Based on the Envelope Power Spectrum Model with the Dynamic Compressive Gammachirp Auditory Filterbank. INTERSPEECH 2016: 2885-2889 - [c47]Atsunori Ogawa, Shogo Seki, Keisuke Kinoshita
, Marc Delcroix
, Takuya Yoshioka, Tomohiro Nakatani, Kazuya Takeda:
Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement. INTERSPEECH 2016: 3733-3737 - 2015
- [j17]Miquel Espi, Masakiyo Fujimoto, Keisuke Kinoshita
, Tomohiro Nakatani:
Exploiting spectro-temporal locality in deep learning based acoustic event detection. EURASIP J. Audio Speech Music. Process. 2015: 26 (2015) - [j16]Marc Delcroix
, Takuya Yoshioka, Atsunori Ogawa, Yotaro Kubo, Masakiyo Fujimoto, Nobutaka Ito, Keisuke Kinoshita
, Miquel Espi, Shoko Araki
, Takaaki Hori, Tomohiro Nakatani
:
Strategies for distant speech recognitionin reverberant environments. EURASIP J. Adv. Signal Process. 2015: 60 (2015) - [c46]Keigo Watanabe, Yusuke Ouchi, Keisuke Kinoshita, Isaku Nagai:
Control of the position and attitude of a tethered quadrotor considering the influence of a tether. ASCC 2015: 1-6 - [c45]Takuya Yoshioka, Nobutaka Ito, Marc Delcroix
, Atsunori Ogawa, Keisuke Kinoshita
, Masakiyo Fujimoto, Chengzhu Yu, Wojciech J. Fabian, Miquel Espi, Takuya Higuchi, Shoko Araki
, Tomohiro Nakatani
:
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices. ASRU 2015: 436-443 - [c44]Keisuke Kinoshita
, Tomohiro Nakatani:
Modeling inter-node acoustic dependencies with Restricted Boltzmann Machine for distributed microphone array based BSS. ICASSP 2015: 464-468 - [c43]Marc Delcroix
, Keisuke Kinoshita
, Takaaki Hori, Tomohiro Nakatani:
Context adaptive deep neural networks for fast acoustic model adaptation. ICASSP 2015: 4535-4539 - [c42]Keisuke Kinoshita, Marc Delcroix, Atsunori Ogawa, Tomohiro Nakatani:
Text-informed speech enhancement with deep neural networks. INTERSPEECH 2015: 1760-1764 - [c41]Miquel Espi, Masakiyo Fujimoto, Keisuke Kinoshita, Tomohiro Nakatani:
Feature extraction strategies in deep learning based acoustic event detection. INTERSPEECH 2015: 2922-2926 - 2014
- [j15]Mehrez Souden, Keisuke Kinoshita
, Marc Delcroix
, Tomohiro Nakatani:
Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays. IEEE ACM Trans. Audio Speech Lang. Process. 22(2): 354-367 (2014) - [c40]Marc Delcroix
, Takuya Yoshioka, Atsunori Ogawa, Yotaro Kubo, Masakiyo Fujimoto, Nobutaka Ito, Keisuke Kinoshita
, Miquel Espi, Shoko Araki
, Takaaki Hori, Tomohiro Nakatani:
Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition. GlobalSIP 2014: 522-526 - [c39]Atsunori Ogawa, Keisuke Kinoshita
, Takaaki Hori, Tomohiro Nakatani, Atsushi Nakamura:
Fast segment search for corpus-based speech enhancement based on speech recognition technology. ICASSP 2014: 1557-1561 - 2013
- [j14]Marc Delcroix
, Keisuke Kinoshita
, Tomohiro Nakatani, Shoko Araki
, Atsunori Ogawa, Takaaki Hori, Shinji Watanabe
, Masakiyo Fujimoto, Takuya Yoshioka, Takanobu Oba, Yotaro Kubo, Mehrez Souden, Seong-Jun Hahm, Atsushi Nakamura:
Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds. Comput. Speech Lang. 27(3): 851-873 (2013) - [j13]Mehrez Souden, Shoko Araki
, Keisuke Kinoshita
, Tomohiro Nakatani, Hiroshi Sawada:
A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction. IEEE Trans. Speech Audio Process. 21(9): 1913-1928 (2013) - [c38]Mehrez Souden, Keisuke Kinoshita
, Tomohiro Nakatani:
An integration of source location cues for speech clustering in distributed microphone arrays. ICASSP 2013: 111-115 - [c37]Roland Maas, Walter Kellermann, Armin Sehr, Takuya Yoshioka, Marc Delcroix
, Keisuke Kinoshita
, Tomohiro Nakatani:
Formulation of the REMOS concept from an uncertainty decoding perspective. DSP 2013: 1-6 - [c36]Keisuke Kinoshita, Mehrez Souden, Tomohiro Nakatani:
Blind source separation using spatially distributed microphones based on microphone-location dependent source activities. INTERSPEECH 2013: 822-826 - [c35]Yasufumi Uezu, Keisuke Kinoshita, Mehrez Souden, Tomohiro Nakatani:
On the robustness of distributed EM based BSS in asynchronous distributed microphone array scenarios. INTERSPEECH 2013: 3298-3302 - [c34]Armin Sehr, Takuya Yoshioka, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Roland Maas, Walter Kellermann:
Conditional emission densities for combining speech enhancement and recognition systems. INTERSPEECH 2013: 3502-3506 - [c33]Keisuke Kinoshita
, Tomohiro Nakatani:
Microphone-location dependent mask estimation for BSS using spatially distributed asynchronous microphones. ISPACS 2013: 326-331 - [c32]Keisuke Kinoshita
, Marc Delcroix
, Takuya Yoshioka, Tomohiro Nakatani, Armin Sehr, Walter Kellermann, Roland Maas:
The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech. WASPAA 2013: 1-4 - 2012
- [j12]Mehrez Souden, Marc Delcroix
, Keisuke Kinoshita
, Takuya Yoshioka, Tomohiro Nakatani:
Noise Power Spectral Density Tracking: A Maximum Likelihood Perspective. IEEE Signal Process. Lett. 19(8): 495-498 (2012) - [j11]Takuya Yoshioka, Armin Sehr, Marc Delcroix
, Keisuke Kinoshita
, Roland Maas, Tomohiro Nakatani
, Walter Kellermann:
Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition. IEEE Signal Process. Mag. 29(6): 114-126 (2012) - [j10]Takaaki Hori, Shoko Araki
, Takuya Yoshioka, Masakiyo Fujimoto, Shinji Watanabe
, Takanobu Oba, Atsunori Ogawa, Kazuhiro Otsuka, Dan Mikami
, Keisuke Kinoshita
, Tomohiro Nakatani, Atsushi Nakamura, Junji Yamato
:
Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera. IEEE Trans. Speech Audio Process. 20(2): 499-513 (2012) - [c31]Takuya Yoshioka, Armin Sehr, Marc Delcroix, Keisuke Kinoshita, Roland Maas, Tomohiro Nakatani, Walter Kellermann:
Survey on approaches to speech recognition in reverberant environments. APSIPA 2012: 1-4 - [c30]Mehrez Souden, Shoko Araki
, Keisuke Kinoshita
, Tomohiro Nakatani, Hiroshi Sawada:
A multichannel MMSE-based framework for joint blind source separation and noise reduction. ICASSP 2012: 109-112 - [c29]Keisuke Kinoshita, Marc Delcroix, Mehrez Souden, Tomohiro Nakatani:
Example-based speech enhancement with joint utilization of spatial, spectral & temporal cues of speech and noise. INTERSPEECH 2012: 1926-1929 - [c28]Mehrez Souden, Keisuke Kinoshita
, Marc Delcroix
, Tomohiro Nakatani:
Distributed microphone array processing for speech source separation with classifier fusion. MLSP 2012: 1-6 - 2011
- [c27]Keisuke Kinoshita, Mehrez Souden, Marc Delcroix, Tomohiro Nakatani:
Single Channel Dereverberation Using Example-Based Speech Enhancement with Uncertainty Decoding Technique. INTERSPEECH 2011: 197-200 - [c26]Mehrez Souden, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani:
A Multichannel Feature-Based Processing for Robust Speech Recognition. INTERSPEECH 2011: 689-692 - 2010
- [j9]Tomohiro Nakatani
, Takuya Yoshioka, Keisuke Kinoshita
, Masato Miyoshi, Biing-Hwang Juang:
Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction. IEEE Trans. Speech Audio Process. 18(7): 1717-1731 (2010) - [c25]Keisuke Kinoshita
, Tomohiro Nakatani, Masato Miyoshi:
Blind upmix of stereo music signals using multi-step linear prediction based reverberation extraction. ICASSP 2010: 49-52 - [c24]Takaaki Hori, Shoko Araki
, Takuya Yoshioka, Masakiyo Fujimoto, Shinji Watanabe
, Takanobu Oba, Atsunori Ogawa, Kazuhiro Otsuka, Dan Mikami, Keisuke Kinoshita
, Tomohiro Nakatani, Atsushi Nakamura, Junji Yamato
:
Real-time meeting recognition and understanding using distant microphones and omni-directional camera. SLT 2010: 424-429 - [p1]Masato Miyoshi, Marc Delcroix, Keisuke Kinoshita, Takuya Yoshioka, Tomohiro Nakatani, Takafumi Hikichi:
Inverse Filtering for Speech Dereverberation Without the Use of Room Acoustics Information. Speech Dereverberation 2010: 271-310
2000 – 2009
- 2009
- [j8]Keisuke Kinoshita
, Marc Delcroix
, Tomohiro Nakatani, Masato Miyoshi:
Suppression of Late Reverberation Effect on Speech Signal Using Long-Term Multiple-step Linear Prediction. IEEE Trans. Speech Audio Process. 17(4): 534-545 (2009) - [c23]Tomohiro Nakatani, Takuya Yoshioka, Keisuke Kinoshita
, Masato Miyoshi, Biing-Hwang Juang:
Real-time speech enhancement in noisy reverberant multi-talker environments based on a location-independent room acoustics model. ICASSP 2009: 137-140 - 2008
- [j7]Masato Miyoshi, Marc Delcroix
, Keisuke Kinoshita
:
Calculating Inverse Filters for Speech Dereverberation. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 91-A(6): 1303-1309 (2008) - [j6]Tomohiro Nakatani
, Biing-Hwang Juang, Takuya Yoshioka, Keisuke Kinoshita
, Marc Delcroix
, Masato Miyoshi:
Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model. IEEE Trans. Speech Audio Process. 16(8): 1512-1527 (2008) - [c22]Masato Miyoshi, Keisuke Kinoshita
, Takuya Yoshioka, Tomohiro Nakatani:
Principles and applications of dereverberation for noisy and reverberant audio signals. ACSCC 2008: 793-796 - [c21]Tomohiro Nakatani, Takuya Yoshioka, Keisuke Kinoshita
, Masato Miyoshi, Biing-Hwang Juang:
Blind speech dereverberation with multi-channel linear prediction based on short time fourier transform representation. ICASSP 2008: 85-88 - 2007
- [j5]Tomohiro Nakatani, Keisuke Kinoshita
, Masato Miyoshi:
Harmonicity-Based Blind Dereverberation for Single-Channel Speech Signals. IEEE Trans. Speech Audio Process. 15(1): 80-95 (2007) - [c20]Tomohiro Nakatani, Biing-Hwang Juang, Takafumi Hikichi, Takuya Yoshioka, Keisuke Kinoshita
, Marc Delcroix
, Masato Miyoshi:
Study on Speech Dereverberation with Autocorrelation Codebook. ICASSP (1) 2007: 193-196 - [c19]Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Masato Miyoshi:
Multi-step linear prediction based speech dereverberation in noisy reverberant environment. INTERSPEECH 2007: 854-857 - [c18]Tomohiro Nakatani, Takafumi Hikichi, Keisuke Kinoshita
, Takuya Yoshioka, Marc Delcroix
, Masato Miyoshi, Biing-Hwang Juang:
Robust blind dereverberation of speech signals based on characteristics of short-time speech segments. ISCAS 2007: 2986-2989 - 2006
- [j4]Tomohiro Nakatani, Masato Miyoshi, Keisuke Kinoshita
:
Blind dereverberation of monaural speech signals based on harmonic structure. Syst. Comput. Jpn. 37(6): 1-12 (2006) - [c17]Keisuke Kinoshita, Tomohiro Nakatani, Masato Miyoshi:
Spectral Subtraction Steered by Multi-Step Forward Linear Prediction For Single Channel Speech Dereverberation. ICASSP (1) 2006: 817-820 - [c16]Tomohiro Nakatani, Biing-Hwang Juang, Keisuke Kinoshita, Masato Miyoshi:
Speech Dereverberation Based on Probabilistic Models of Source and Room Acoustics. ICASSP (1) 2006: 821-824 - 2005
- [j3]Keisuke Kinoshita
, Tomohiro Nakatani, Masato Miyoshi:
Harmonicity Based Dereverberation for Improving Automatic Speech Recognition Performance and Speech Intelligibility. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 88-A(7): 1724-1731 (2005) - [j2]Akiko Kusumoto, Takayuki Arai, Keisuke Kinoshita
, Nao Hodoshima
, Nancy Vaughan:
Modulation enhancement of speech by a pre-processing algorithm for improving intelligibility in reverberant environments. Speech Commun. 45(2): 101-113 (2005) - [c15]Keisuke Kinoshita
, Tomohiro Nakatani, Masato Miyoshi:
Fast Estimation of a Precise Dereverberation Filter based on Speech Harmonicity. ICASSP (1) 2005: 1073-1076 - [c14]Keisuke Kinoshita, Tomohiro Nakatani, Masato Miyoshi:
Efficient blind dereverberation framework for automatic speech recognition. INTERSPEECH 2005: 3145-3148 - [c13]Sayoko Takano, Keisuke Kinoshita, Kiyoshi Honda:
Measurement of cricothyroid articulation using high-resolution MRI and 3d pattern matching. MAVEBA 2005: 141-144 - 2004
- [c12]Sabri Gurbuz, Keisuke Kinoshita, Marcia Riley, Sumio Yano:
Biologically valid jaw movements for talking humanoid robots. Humanoids 2004: 781-793 - [c11]Tomohiro Nakatani, Keisuke Kinoshita, Masato Miyoshi, Parham Zolfaghari:
Harmonicity based blind dereverberation with time warping. SAPA@INTERSPEECH 2004: 53 - [c10]Tomohiro Nakatani, Keisuke Kinoshita, Masato Miyoshi, Parham Zolfaghari:
Harmonicity based monaural speech dereverberation with time warping and F0 adaptive window. INTERSPEECH 2004: 873-876 - [c9]Keisuke Kinoshita, Tomohiro Nakatani, Masato Miyoshi:
Improving automatic speech recognition performance and speech inteligibility with harmonicity based dereverberation. INTERSPEECH 2004: 2649-2652 - 2003
- [c8]Nao Hodoshima, Takayuki Arai, Tsuyoshi Inoue, Keisuke Kinoshita, Akiko Kusumoto:
Improving speech intelligibility by steady-state suppression as pre-processing in small to medium sized halls. INTERSPEECH 2003: 1365-1368 - [c7]Tomohiro Nakatani, Masato Miyoshi, Keisuke Kinoshita:
One Microphone Blind Dereverberation Based on Quasi-periodicity of Speech Signals. NIPS 2003: 1417-1424 - 2002
- [c6]Keisuke Kinoshita, Dawn M. Behne, Takayuki Arai:
Duration and F0 as perceptual cues to Japanese vowel quantity. INTERSPEECH 2002: 757-760 - 2000
- [j1]Keisuke Kinoshita, Michael Lindenbaum:
Robotic Control with Partial Visual Information. Int. J. Comput. Vis. 37(1): 65-78 (2000) - [c5]Keisuke Kinoshita, Michael Lindenbaum:
Camera Model Selection Based on Geometric AIC. CVPR 2000: 2514-2519 - [c4]Tomoko Kitamura, Keisuke Kinoshita, Takayuki Arai, Akiko Kusumoto, Yuji Murahara:
Designing modulation filters for improving speech intelligibility in reverberant environments. INTERSPEECH 2000: 586-589
1990 – 1999
- 1998
- [c3]Keisuke Kinoshita, Michael Lindenbaum:
Robotic Control with Partial Visual Information. ICCV 1998: 883-888 - 1992
- [c2]Keisuke Kinoshita, Koichiro Deguchi:
3-D shape recognition by active vision-without camera velocity information. ICPR (1) 1992: 177-180 - 1990
- [c1]Keisuke Kinoshita, Koichiro Deguchi:
3-D Shape Reconstruction from Camera Motion with Inexact Motion Parameters. MVA 1990: 279-282
Coauthor Index
aka: Reinhold Haeb-Umbach

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-04 22:22 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint