default search action
Kiyohiro Shikano
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2010 – 2019
- 2014
- [j73]Daichi Kitamura, Hiroshi Saruwatari, Kosuke Yagi, Kiyohiro Shikano, Yu Takahashi, Kazunobu Kondo:
Music Signal Separation Based on Supervised Nonnegative Matrix Factorization with Orthogonality and Maximum-Divergence Penalties. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 97-A(5): 1113-1118 (2014) - [j72]Ryoichi Miyazaki, Hiroshi Saruwatari, Satoshi Nakamura, Kiyohiro Shikano, Kazunobu Kondo, Jonathan Blanchette, Martin Bouchard:
Musical-noise-free blind speech extraction integrating microphone array and iterative spectral subtraction. Signal Process. 102: 226-239 (2014) - [j71]Hironori Doi, Tomoki Toda, Keigo Nakamura, Hiroshi Saruwatari, Kiyohiro Shikano:
Alaryngeal Speech Enhancement Based on One-to-Many Eigenvoice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 22(1): 172-183 (2014) - 2013
- [j70]Rafael Torres, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano:
Comparison of Methods for Topic Classification of Spoken Inquiries. Inf. Media Technol. 8(2): 438-448 (2013) - [j69]Kiyohiro Shikano:
Comparison of Methods for Topic Classification of Spoken Inquiries. J. Inf. Process. 21(2): 157-167 (2013) - [c269]Fine Dwinita Aprilyanti, Hiroshi Saruwatari, Kiyohiro Shikano, Satoshi Nakamura, Tomoya Takatani:
Semi-blind algorithm for joint noise suppression and dereverberation based on higher-order statistics and acoustic model likelihood. APSIPA 2013: 1-6 - [c268]Ryoichi Miyazaki, Hiroshi Saruwatari, Satoshi Nakamura, Kiyohiro Shikano, Kazunobu Kondo, Jonathan Blanchette, Martin Bouchard:
Toward musical-noise-free blind speech extraction: Concept and its applications. APSIPA 2013: 1-10 - [c267]Daichi Kitamura, Hiroshi Saruwatari, Yusuke Iwao, Kiyohiro Shikano, Kazunobu Kondo, Yu Takahashi:
Superresolution-based stereo signal separation via supervised nonnegative matrix factorization. DSP 2013: 1-6 - [c266]Daichi Kitamura, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo, Yu Takahashi:
Music signal separation by supervised nonnegative matrix factorization with basis deformation. DSP 2013: 1-6 - [c265]Hiroshi Saruwatari, Suzumi Kanehara, Ryoichi Miyazaki, Kiyohiro Shikano, Kazunobu Kondo:
Musical noise analysis for Bayesian minimum mean-square error speech amplitude estimators based on higher-order statistics. INTERSPEECH 2013: 441-445 - [c264]Daichi Kitamura, Hiroshi Saruwatari, Kosuke Yagi, Kiyohiro Shikano, Yu Takahashi, Kazunobu Kondo:
Robust music signal separation based on supervised nonnegative matrix factorization with prevention of basis sharing. ISSPIT 2013: 392-397 - 2012
- [j68]Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano:
Theoretical Analysis of Amounts of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 95-A(2): 586-590 (2012) - [j67]Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani:
Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 95-A(2): 591-595 (2012) - [j66]Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech. Speech Commun. 54(1): 134-146 (2012) - [j65]Ryoichi Miyazaki, Hiroshi Saruwatari, Takayuki Inoue, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo:
Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction. IEEE Trans. Speech Audio Process. 20(7): 2080-2094 (2012) - [j64]Tomoki Toda, Mikihiro Nakagiri, Kiyohiro Shikano:
Statistical Voice Conversion Techniques for Body-Conducted Unvoiced Speech Enhancement. IEEE Trans. Speech Audio Process. 20(9): 2505-2517 (2012) - [c263]Fine Dwinita Aprilyanti, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani:
Optimization scheme of joint noise suppression and dereverberation based on higher-order statistics. APSIPA 2012: 1-6 - [c262]Suzumi Kanehara, Hiroshi Saruwatari, Ryoichi Miyazaki, Kiyohiro Shikano, Kazunobu Kondo:
Comparative study on various noise reduction methods with decision-directed a priori SNR estimator via higher-order statistics. APSIPA 2012: 1-6 - [c261]Kazuma Nishimura, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Response generation based on statistical machine translation for speech-oriented guidance system. APSIPA 2012: 1-4 - [c260]Yuji Onuma, Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano:
Real-time semi-blind speech extraction with speaker direction tracking on Kinect. APSIPA 2012: 1-6 - [c259]Hiroshi Saruwatari, Ryo Wakisaka, Kiyohiro Shikano, Frédéric Mustière, Louis Thibault, Hossein Najaf-Zadeh, Martin Bouchard:
Sound-localization-preserved binaural MMSE STSA estimator with explicit and implicit binaural cues. EUSIPCO 2012: 310-314 - [c258]Noriyoshi Kamado, Masayuki Hirata, Hiroshi Saruwatari, Kiyohiro Shikano:
Object-based stereo up-mixer for wave field synthesis based on spatial information clustering. EUSIPCO 2012: 594-598 - [c257]Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani:
Speech kurtosis estimation from observed noisy signal based on generalized Gaussian distribution prior and additivity of cumulants. ICASSP 2012: 4049-4052 - [c256]Kenzo Yamamoto, Tomoki Toda, Hironori Doi, Hiroshi Saruwatari, Kiyohiro Shikano:
Statistical approach to voice quality control in esophageal speech enhancement. ICASSP 2012: 4497-4500 - [c255]Ryoichi Miyazaki, Hiroshi Saruwatari, Takayuki Inoue, Kiyohiro Shikano, Kazunobu Kondo:
Musical-noise-free speech enhancement: Theory and evaluation. ICASSP 2012: 4565-4568 - [c254]Haruka Majima, Rafael Torres, Yoko Fujita, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano:
Spoken Inquiry Discrimination Using Bag-of-Words for Speech-Oriented Guidance System. INTERSPEECH 2012: 2097-2100 - [c253]Keigo Kubo, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Evaluation of Many-to-Many Alignment Algorithm by Automatic Pronunciation Annotation Using Web Text Mining. INTERSPEECH 2012: 2318-2321 - [c252]Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo:
Musical-noise-free blind speech extraction using ICA-based noise estimation and iterative spectral subtraction. ISSPA 2012: 286-291 - [c251]Miyuki Itoi, Ryoichi Miyazaki, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Blind speech extraction for Non-Audible Murmur speech with speaker's movement noise. ISSPIT 2012: 320-325 - [c250]Suzumi Kanehara, Hiroshi Saruwatari, Ryoichi Miyazaki, Kiyohiro Shikano, Kazunobu Kondo:
Theoretical Analysis of Musical Noise Generation in Noise Reduction Methods with Decision-Directed a Priori SNR Estimator. IWAENC 2012 - [c249]Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo:
Musical-Noise-Free Blind Speech Extraction Using ICA-Based Noise Estimation with Channel Selection. IWAENC 2012 - [c248]Sunao Hara, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Development of a Toolkit Handling Multiple Speech-Oriented Guidance Agents for Mobile Applications. IWSDS 2012: 79-85 - [c247]Rafael Torres, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano:
Topic Classification of Spoken Inquiries Using Transductive Support Vector Machine. IWSDS 2012: 261-267 - [c246]Haruka Majima, Rafael Torres, Hiromichi Kawanami, Sunao Hara, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano:
Evaluation of Invalid Input Discrimination Using Bag-of-Words for Speech-Oriented Guidance System. IWSDS 2012: 389-397 - 2011
- [j63]Noriyoshi Kamado, Haruhide Hokari, Shoji Shimada, Hiroshi Saruwatari, Kiyohiro Shikano:
Sound Field Reproduction by Wavefront Synthesis Using Directly Aligned Multi Point Control. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 94-A(3): 907-920 (2011) - [j62]Hiroshi Saruwatari, Yohei Ishikawa, Yu Takahashi, Takayuki Inoue, Kiyohiro Shikano, Kazunobu Kondo:
Musical Noise Controllable Algorithm of Channelwise Spectral Subtraction and Adaptive Beamforming Based on Higher Order Statistics. IEEE Trans. Speech Audio Process. 19(6): 1457-1466 (2011) - [j61]Takayuki Inoue, Hiroshi Saruwatari, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo:
Theoretical Analysis of Musical Noise in Generalized Spectral Subtraction Based on Higher Order Statistics. IEEE Trans. Speech Audio Process. 19(6): 1770-1779 (2011) - [c245]Hiroyuki Nawata, Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano:
Automatic musical thumbnailing based on audio object localization and its evaluation. ICASSP 2011: 41-44 - [c244]Noriyoshi Kamado, Hiroshi Saruwatari, Kiyohiro Shikano:
Robust sound field reproduction integrating multi-point sound field control and wave field synthesis. ICASSP 2011: 441-444 - [c243]Takayuki Inoue, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo:
Theoretical analysis of musical noise in Wiener filtering family via higher-order statistics. ICASSP 2011: 5076-5079 - [c242]Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques. ICASSP 2011: 5136-5139 - [c241]Denis Babani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Acoustic model training for non-audible murmur recognition using transformed normal speech data. ICASSP 2011: 5224-5227 - [c240]Ryoichi Miyazaki, Hiroshi Saruwatari, Kiyohiro Shikano:
Theoretical Analysis of Musical Noise and Speech Distortion in Structure-Generalized Parametric Blind Spatial Subtraction Array. INTERSPEECH 2011: 341-344 - [c239]Ryo Wakisaka, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani:
Blind Speech Prior Estimation for Generalized Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator. INTERSPEECH 2011: 361-364 - [c238]Nobuhiko Hattori, Tomoki Toda, Hisashi Kawai, Hiroshi Saruwatari, Kiyohiro Shikano:
Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation. INTERSPEECH 2011: 2769-2772 - [c237]Hiroshi Saruwatari, Nobuhisa Hirata, Toshiyuki Hatta, Ryo Wakisaka, Kiyohiro Shikano, Tomoya Takatani:
Semi-blind speech extraction for robot using visual information and noise statistics. ISSPIT 2011: 264-269 - 2010
- [j60]Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo:
Musical-Noise Analysis in Methods of Integrating Microphone Array and Spectral Subtraction Based on Higher-Order Statistics. EURASIP J. Adv. Signal Process. 2010 (2010) - [j59]Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Adaptive Training for Voice Conversion Based on Eigenvoices. IEICE Trans. Inf. Syst. 93-D(6): 1589-1598 (2010) - [j58]Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Evaluation of Extremely Small Sound Source Signals Used in Speaking-Aid System with Statistical Voice Conversion. IEICE Trans. Inf. Syst. 93-D(7): 1909-1917 (2010) - [j57]Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models. IEICE Trans. Inf. Syst. 93-D(9): 2472-2482 (2010) - [j56]Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Improvements of the One-to-Many Eigenvoice Conversion System. IEICE Trans. Inf. Syst. 93-D(9): 2491-2499 (2010) - [j55]Tatsuya Hirahara, Makoto Otani, Shota Shimizu, Tomoki Toda, Keigo Nakamura, Yoshitaka Nakajima, Kiyohiro Shikano:
Silent-speech enhancement using body-conducted vocal-tract resonance signals. Speech Commun. 52(4): 301-313 (2010) - [j54]Panikos Heracleous, V.-A. Tran, Takayuki Nagai, Kiyohiro Shikano:
Analysis and Recognition of NAM Speech Using HMM Distances and Visual Information. IEEE Trans. Speech Audio Process. 18(6): 1528-1538 (2010) - [c236]Yohei Ishikawa, Hiroshi Saruwatari, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo:
Musical noise controllable algorithm of channelwise spectral subtraction and beamforming based on higher-order statistics criterion. CIP 2010: 81-86 - [c235]Takayuki Inoue, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo:
Theoretical analysis of musical noise in generalized spectral subtraction: Why should not use power/amplitude subtraction? EUSIPCO 2010: 994-998 - [c234]Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani:
Blind signal extraction based joint suppression of diffuse background noise and late reverberation. EUSIPCO 2010: 1534-1538 - [c233]Hiroshi Saruwatari, Ryoi Okamoto, Yu Takahashi, Kiyohiro Shikano:
Blind Speech Extraction Combining Generalized MMSE STSA Estimator and ICA-Based Noise and Speech Probability Density Function Estimations. LVA/ICA 2010: 49-56 - [c232]Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani:
Complex Newton algorithm for blind signal extraction of speech in diffuse noise. ICASSP 2010: 213-216 - [c231]Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Statistical approach to enhancing esophageal speech based on Gaussian mixture models. ICASSP 2010: 4250-4253 - [c230]Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani:
Speech enhancement in presence of diffuse background noise: Why using blind signal extraction? ICASSP 2010: 4770-4773 - [c229]Ryoi Okamoto, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano:
MMSE STSA estimator with nonstationary noise estimation based on ICA for high-quality speech enhancement. ICASSP 2010: 4778-4781 - [c228]Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Non-parallel training for many-to-many eigenvoice conversion. ICASSP 2010: 4822-4825 - [c227]Rafael Torres, Shota Takeuchi, Hiromichi Kawanami, Tomoko Matsui, Hiroshi Saruwatari, Kiyohiro Shikano:
Comparison of methods for topic classification in a speech-oriented guidance system. INTERSPEECH 2010: 1261-1264 - [c226]Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion. INTERSPEECH 2010: 1628-1631 - [c225]Kumi Ohta, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano:
Adaptive voice-quality control based on one-to-many eigenvoice conversion. INTERSPEECH 2010: 2158-2161 - [c224]Hiroshi Sawada, Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani:
Improvement of speech recognition performance for spoken-oriented robot dialog system using end-fire array. IROS 2010: 970-975 - [c223]Chie Hayashida, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano:
Linear transformation approaches to many-to-one voice conversion. SSW 2010: 74-79
2000 – 2009
- 2009
- [j53]Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano:
Enhancement of speech signals separated from their convolutive mixture by FDICA algorithm. Digit. Signal Process. 19(1): 127-133 (2009) - [j52]Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Techniques in rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics. Speech Commun. 51(1): 42-57 (2009) - [j51]Yu Takahashi, Tomoya Takatani, Keiichi Osako, Hiroshi Saruwatari, Kiyohiro Shikano:
Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment. IEEE Trans. Speech Audio Process. 17(4): 650-664 (2009) - [c222]Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano:
Enhanced wiener post-processing based on partial projection back of the blind signal separation noise estimate. EUSIPCO 2009: 1442-1446 - [c221]Takashi Hiekata, Takashi Morita, Youhei Ikeda, Hiroshi Hashimoto, Ruoyu Zhang, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano:
Multiple ICA-based real-time blind source extraction applied to handy size microphone. ICASSP 2009: 121-124 - [c220]Yu Takahashi, Yoshihisa Uemura, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo:
Musical noise analysis based on higher order statistics for microphone array and nonlinear signal processing. ICASSP 2009: 229-232 - [c219]Shigeki Miyabe, Biing-Hwang Juang, Hiroshi Saruwatari, Kiyohiro Shikano:
Kernel-based nonlinear independent component analysis for underdetermined blind source separation. ICASSP 2009: 1641-1644 - [c218]Tomoki Toda, Keigo Nakamura, Hidehiko Sekimoto, Kiyohiro Shikano:
Voice conversion for various types of body transmitted speech. ICASSP 2009: 3601-3604 - [c217]Yu Takahashi, Hiroshi Saruwatari, Yuki Fujihara, Kentaro Tachibana, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka:
Source adaptive blind signal extraction using closed-form ICA for hands-free robot spoken dialogue system. ICASSP 2009: 3681-3684 - [c216]Hiroshi Saruwatari, Hiromichi Kawanami, Shota Takeuchi, Yu Takahashi, Tobias Cincarek, Kiyohiro Shikano:
Hands-free speech recognition challenge for real-world speech dialogue systems. ICASSP 2009: 3729-3732 - [c215]Daisuke Miyamoto, Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Acoustic compensation methods for body transmitted speech conversion. ICASSP 2009: 3901-3904 - [c214]Yoshihisa Uemura, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano, Kazunobu Kondo:
Musical noise generation analysis for noise reduction methods based on spectral subtraction and MMSE STSA estimation. ICASSP 2009: 4433-4436 - [c213]Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano:
Target Speech Enhancement in Presence of Jammer and Diffuse Background Noise. ICA 2009: 565-572 - [c212]Tomoki Toda, Keigo Nakamura, Takayuki Nagai, Tomomi Kaino, Yoshitaka Nakajima, Kiyohiro Shikano:
Technologies for processing body-conducted speech detected with non-audible murmur microphone. INTERSPEECH 2009: 632-635 - [c211]Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Electrolaryngeal speech enhancement based on statistical voice conversion. INTERSPEECH 2009: 1431-1434 - [c210]Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Many-to-many eigenvoice conversion with reference voice. INTERSPEECH 2009: 1623-1626 - [c209]Jani Even, Hiroshi Sawada, Hiroshi Saruwatari, Kiyohiro Shikano, Tomoya Takatani:
Semi-blind suppression of internal noise for hands-free robot spoken dialog system. IROS 2009: 658-663 - [c208]Shigeki Miyabe, Keisuke Masatoki, Hiroshi Saruwatari, Kiyohiro Shikano, Toshiyuki Nomura:
Temporal quantization of spatial information using directional clustering for multichannel audio coding. WASPAA 2009: 261-264 - 2008
- [j50]Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Cost Reduction of Acoustic Modeling for Real-Environment Applications Using Unsupervised and Selective Training. IEICE Trans. Inf. Syst. 91-D(3): 499-507 (2008) - [j49]Tobias Cincarek, Hiromichi Kawanami, Ryuichi Nisimura, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System. IEICE Trans. Inf. Syst. 91-D(3): 576-587 (2008) - [j48]Goshu Nagino, Makoto Shozakai, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Building an Effective Speech Corpus by Utilizing Statistical Multidimensional Scaling Method. IEICE Trans. Inf. Syst. 91-D(3): 607-614 (2008) - [j47]Yuki Yai, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura:
Rapid Compensation of Temperature Fluctuation Effect for Multichannel Sound Field Reproduction System. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 91-A(6): 1329-1336 (2008) - [j46]Keiichi Osako, Yoshimitsu Mori, Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano:
Fast Convergence Blind Source Separation Using Frequency Subband Interpolation by Null Beamforming. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 91-A(6): 1357-1361 (2008) - [c207]Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano:
Extension of score function difference for frequency domain blind source separation. EUSIPCO 2008: 1-5 - [c206]Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano:
Frequency domain semi-blind signal separation: application to the rejection of internal noises. ICASSP 2008: 157-160 - [c205]Yuuki Haraguchi, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Toshiyuki Nomura:
Source-oriented localization control of stereo audio signals based on blind source separation. ICASSP 2008: 177-180 - [c204]Yuuta Yuyama, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano:
Hybrid structure of inverse filtering and DOA-parameterized wavefront synthesis. ICASSP 2008: 401-404 - [c203]Randy Gomez, Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano:
Distant talking robust speech recognition using late reflection components of room impulse response. ICASSP 2008: 4581-4584 - [c202]Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Question and answer database optimization using speech recognition results. INTERSPEECH 2008: 451-454 - [c201]Hiroshi Saruwatari, Yu Takahashi, Hiroyuki Sakai, Shota Takeuchi, Tobias Cincarek, Hiromichi Kawanami, Kiyohiro Shikano:
Development and evaluation of hands-free spoken dialogue system for railway station guidance. INTERSPEECH 2008: 455-458 - [c200]Takashi Muramatsu, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory. INTERSPEECH 2008: 1076-1079 - [c199]Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
An improved one-to-many eigenvoice conversion system. INTERSPEECH 2008: 1080-1083 - [c198]Randy Gomez, Jani Even, Kiyohiro Shikano:
Rapid unsupervised speaker adaptation robust in reverberant environment conditions. INTERSPEECH 2008: 1309-1312 - [c197]Hideki Okamoto, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Speaker verification with non-audible murmur segments by combining global alignment kernel and penalized logistic regression machine. INTERSPEECH 2008: 1369-1372 - [c196]Daisuke Tani, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano:
Maximum a posteriori adaptation for many-to-one eigenvoice conversion. INTERSPEECH 2008: 1461-1463 - [c195]Keigo Nakamura, Tomoki Toda, Yoshitaka Nakajima, Hiroshi Saruwatari, Kiyohiro Shikano:
Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments. INTERSPEECH 2008: 2209-2212 - [c194]Yu Takahashi, Hiroshi Saruwatari, Kiyohiro Shikano:
Real-time implementation of blind spatial subtraction array for hands-free robot spoken dialogue system. IROS 2008: 1687-1692 - [c193]Jani Even, Hiroshi Saruwatari, Kiyohiro Shikano:
An improved permutation solver for blind signal separation based front-ends in robot audition. IROS 2008: 2172-2177 - [c192]Jumpei Miyake, Shota Takeuchi, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Language model for the web search task in a spoken dialogue system for children. WOCCI 2008: 10 - 2007
- [j45]Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano:
Unvoiced Speech Recognition Using Tissue-Conductive Acoustic Sensor. EURASIP J. Adv. Signal Process. 2007 (2007) - [j44]Shigeki Miyabe, Yoichi Hinamoto, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura:
Interface for Barge-in Free Spoken Dialogue System Based on Sound Field Reproduction and Microphone Array. EURASIP J. Adv. Signal Process. 2007 (2007) - [j43]Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics. IEICE Trans. Inf. Syst. 90-D(2): 554-561 (2007) - [c191]Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Development and portability of ASR and Q&A modules for real-environment speech-oriented guidance systems. ASRU 2007: 520-525 - [c190]Shigeki Miyabe, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura:
Barge-in- and noise-free spoken dialogue interface based on sound field control and semi-blind source separation. EUSIPCO 2007: 232-236 - [c189]Kentaro Tachibana, Hiroshi Saruwatari, Yoshimitsu Mori, Shigeki Miyabe, Kiyohiro Shikano, Akira Tanaka:
Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA. ICASSP (1) 2007: 45-48 - [c188]Yu Takahashi, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano:
Permutation-Robust Structure for ICA-Based Blind Source Extraction. ICASSP (1) 2007: 149-152 - [c187]Tobias Cincarek, Ryuichi Nisimura, Akinobu Lee, Kiyohiro Shikano:
Insights Gained from Development and Long-Term Operation of a Real-Environment Speech-Oriented Guidance System. ICASSP (4) 2007: 157-160 - [c186]Yoshimitsu Mori, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita:
High-Presence Hearing-Aid System using DSP-Based Real-Time Blind Source Separation Module. ICASSP (4) 2007: 609-612 - [c185]Tomoki Toda, Yamato Ohtani, Kiyohiro Shikano:
One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices. ICASSP (4) 2007: 1249-1252 - [c184]Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection. INTERSPEECH 2007: 262-265 - [c183]Goshu Nagino, Makoto Shozakai, Kiyohiro Shikano:
How to judge reusability of existing speech corpora for target task by utilizing statistical multidimensional scaling. INTERSPEECH 2007: 1302-1305 - [c182]Tobias Cincarek, Izumi Shindo, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task. INTERSPEECH 2007: 1469-1472 - [c181]Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model. INTERSPEECH 2007: 1981-1984 - [c180]Hideki Okamoto, Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Study on speaker verification with non-audible murmur segments. INTERSPEECH 2007: 2017-2020 - [c179]Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees. INTERSPEECH 2007: 2517-2520 - [c178]Yoshimitsu Mori, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita:
Noise-robust hands-free speech recognition using SIMO-model-based blind source separation. ISSPA 2007: 1-4 - [c177]Yu Takahashi, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano:
Robust spatial subtraction array with independent component analysis for speech enhancement. ISSPA 2007: 1-4 - [c176]Hiroaki Kokubo, Nobuo Hataoka, Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano:
Real-Time Continuous Speech Recognition System on SH-4A Microprocessor. MMSP 2007: 35-38 - [c175]Hiroyuki Sakai, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano, Akinobu Lee:
Voice activity detection applied to hands-free spoken dialogue robot based on decoding using acoustic and language model. ROBOCOMM 2007: 16 - [c174]Kumi Ohta, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Regression approaches to voice quality controll based on one-to-many eigenvoice conversion. SSW 2007: 101-106 - [c173]Daisuke Tani, Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
An evaluation of many-to-one voice conversion algorithms with pre-stored speaker data sets. SSW 2007: 107-112 - [p1]Hiroshi Saruwatari, Tomoya Takatani, Kiyohiro Shikano:
SIMO-Model-Based Blind Source Separation - Principle and its Applications. Blind Speech Separation 2007: 149-168 - 2006
- [j42]Yoshimitsu Mori, Hiroshi Saruwatari, Tomoya Takatani, Satoshi Ukai, Kiyohiro Shikano, Takashi Hiekata, Youhei Ikeda, Hiroshi Hashimoto, Takashi Morita:
Blind Separation of Acoustic Signals Combining SIMO-Model-Based Independent Component Analysis and Binary Masking. EURASIP J. Adv. Signal Process. 2006 (2006) - [j41]Yoshitaka Nakajima, Hideki Kashioka, Nick Campbell, Kiyohiro Shikano:
Non-Audible Murmur (NAM) Recognition. IEICE Trans. Inf. Syst. 89-D(1): 1-4 (2006) - [j40]Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura:
Interface for Barge-in Free Spoken Dialogue System Using Nullspace Based Sound Field Control and Beamforming. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 89-A(3): 716-726 (2006) - [j39]Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Utterance-Based Selective Training for the Automatic Creation of Task-Dependent Acoustic Models. IEICE Trans. Inf. Syst. 89-D(3): 962-969 (2006) - [j38]Randy Gomez, Akinobu Lee, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models. IEICE Trans. Inf. Syst. 89-D(3): 998-1005 (2006) - [j37]Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano:
An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis. Speech Commun. 48(1): 45-56 (2006) - [j36]Hiroshi Saruwatari, Toshiya Kawamura, Tsuyoki Nishikawa, Akinobu Lee, Kiyohiro Shikano:
Blind source separation based on a fast-convergence algorithm combining ICA and beamforming. IEEE Trans. Speech Audio Process. 14(2): 666-678 (2006) - [c172]Yoshimitsu Mori, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita:
Two-stage blind separation of moving sound sources with pocket-size real-time DSP module. EUSIPCO 2006: 1-5 - [c171]Yoshimitsu Mori, Hiroshi Saruwatari, Tomoya Takatani, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita:
ICA and Binary-Mask-Based Blind Source Separation with Small Directional Microphones. ICA 2006: 649-657 - [c170]Shigeki Miyabe, Tomoya Takatani, Yoshimitsu Mori, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura:
Double-Talk Free Spoken Dialogue Interface Combining Sound Field Control With Semi-Blind Source Separation. ICASSP (1) 2006: 809-812 - [c169]Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Improving Rapid Unsupervised Speaker Adaptation Based On Hmm Sufficient Statistics. ICASSP (1) 2006: 1001-1004 - [c168]Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training. INTERSPEECH 2006 - [c167]Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Speaker verification with non-audible murmur segments. INTERSPEECH 2006 - [c166]Mikihiro Nakagiri, Tomoki Toda, Hideki Kashioka, Kiyohiro Shikano:
Improving body transmitted unvoiced speech with statistical voice conversion. INTERSPEECH 2006 - [c165]Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech. INTERSPEECH 2006 - [c164]Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation. INTERSPEECH 2006 - [c163]Tomoki Toda, Yamato Ohtani, Kiyohiro Shikano:
Eigenvoice conversion based on Gaussian mixture model. INTERSPEECH 2006 - [c162]Tomoyuki Kato, Tomiki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria. LREC 2006: 789-792 - [c161]Hiromichi Kawanami, Takahiro Kitamura, Kiyohiro Shikano:
Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech. LREC 2006: 2586-2589 - [c160]Hiroaki Kokubo, Hiroaki Hataoka, Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano:
Embedded Julius: Continuous Speech Recognition Software for Microprocessor. MMSP 2006: 378-381 - 2005
- [j35]Kiyohiro Shikano:
Foreword. IEICE Trans. Inf. Syst. 88-D(3): 365 (2005) - [j34]Kazuki Adachi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Designing Target Cost Function Based on Prosody of Speech Database. IEICE Trans. Inf. Syst. 88-D(3): 519-524 (2005) - [j33]Satoshi Ukai, Tomoya Takatani, Hiroshi Saruwatari, Kiyohiro Shikano, Ryo Mukai, Hiroshi Sawada:
Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 88-A(3): 642-650 (2005) - [j32]Tatsunori Asai, Hiroshi Saruwatari, Kiyohiro Shikano:
Interface for Barge-in Free Spoken Dialogue System Combining Adaptive Sound Field Control and Microphone Array. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 88-A(6): 1613-1618 (2005) - [j31]Tomoya Takatani, Satoshi Ukai, Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
A Self-Generator Method for Initial Filters of SIMO-ICA Applied to Blind Separation of Binaural Sound Mixtures. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 88-A(7): 1673-1682 (2005) - [j30]Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano:
Blind Separation of Speech by Fixed-Point ICA with Source Adaptive Negentropy Approximation. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 88-A(7): 1683-1692 (2005) - [j29]Yosuke Tatekura, Shigefumi Urata, Hiroshi Saruwatari, Kiyohiro Shikano:
On-Line Relaxation Algorithm Applicable to Acoustic Fluctuation for Inverse Filter in Multichannel Sound Reproduction System. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 88-A(7): 1747-1756 (2005) - [j28]Hiroshi Saruwatari, Hiroaki Yamajo, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano:
Blind Separation and Deconvolution for Convolutive Mixture of Speech Combining SIMO-Model-Based ICA and Multichannel Inverse Filtering. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 88-A(9): 2387-2400 (2005) - [j27]Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano:
Estimation of Shape Parameter of GGD Function by Negentropy Matching. Neural Process. Lett. 22(3): 377-389 (2005) - [c159]Panikos Heracleous, Yoshitaka Nakajima, Hiroshi Saruwatari, Kiyohiro Shikano:
A tissue-conductive acoustic sensor applied in speech recognition for privacy. sOc-EUSAI 2005: 93-97 - [c158]Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano, Yosuke Tatekura:
Barge-in free spoken dialogue interface using nullspace-based sound field control and beamforming. EUSIPCO 2005: 1-4 - [c157]Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
Blind separation of more than two sources based on high-convergence algorithm combining ICA and beamforming. EUSIPCO 2005: 1-4 - [c156]Hiroshi Saruwatari, Satoshi Ukai, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano:
Two-stage blind source separation combining SIMO-model-based ICA and adaptive beamforming. EUSIPCO 2005: 1-4 - [c155]Tomoya Takatani, Satoshi Ukai, Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
Blind separation of binaural sound mixtures using SIMO-ICA with self-generator for initial filter. EUSIPCO 2005: 1-4 - [c154]Hiroshi Saruwatari, Katsuyuki Sawai, Tsuyoki Nishikawa, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata, Daisuke Saitoh:
Speech Enhancement Based on Blind Source Separation in Car Environments. ICDE Workshops 2005: 1205 - [c153]Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments. INTERSPEECH 2005: 293-296 - [c152]Yoshitaka Nakajima, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell:
Remodeling of the sensor for non-audible murmur (NAM). INTERSPEECH 2005: 389-392 - [c151]Ryuichi Nisimura, Akinobu Lee, Masashi Yamada, Kiyohiro Shikano:
Operating a public spoken guidance system in real environment. INTERSPEECH 2005: 845-848 - [c150]Tomoki Toda, Kiyohiro Shikano:
NAM-to-speech conversion with Gaussian mixture models. INTERSPEECH 2005: 1957-1960 - [c149]Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano:
Investigating the role of the Lombard reflex in non-audible murmur (NAM) recognition. INTERSPEECH 2005: 2649-2652 - [c148]Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano:
Applications of NAM microphones in speech recognition for privacy in human-machine communication. INTERSPEECH 2005: 3041-3044 - [c147]Tomoya Takatani, Satoshi Ukai, Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
Blind sound scene decomposition for robot audition using SIMO-model-based ICA. IROS 2005: 2247-2252 - [c146]Hiroshi Saruwatari, Yoshimitsu Mori, Tomoya Takatani, Satoshi Ukai, Kiyohiro Shikano, Takashi Hiekata, Takashi Morita:
Two-stage blind source separation based on ICA and binary masking for real-time robot audition system. IROS 2005: 2303-2308 - [c145]Yasuaki Ohashi, Tsuyoki Nishikawa, Hiroshi Saruwatari, Akinobu Lee, Kiyohiro Shikano:
Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition. IROS 2005: 2328-2332 - 2004
- [j26]Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano:
Robots that can hear, understand and talk. Adv. Robotics 18(5): 533-564 (2004) - [j25]Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano:
Negentropy based voice-activity detection for noise estimation in very low SNR condition. IEICE Electron. Express 1(16): 495-500 (2004) - [j24]Panikos Heracleous, Satoshi Nakamura, Kiyohiro Shikano:
Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers Based on the 3-D N-Best Search Method. J. VLSI Signal Process. 36(2-3): 105-116 (2004) - [c144]Panikos Heracleous, Yoshitaka Nakajima, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Audible (normal) speech and inaudible murmur recognition using NAM microphone. EUSIPCO 2004: 329-332 - [c143]Hiroaki Yamajo, Hiroshi Saruwatari, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano:
Evaluation of blind separation and deconvolution for binaural-sound mixtures using SIMO-model-based ICA. EUSIPCO 2004: 1709-1712 - [c142]Yosuke Tatekura, Shigefumi Urata, Hiroshi Saruwatari, Kiyohiro Shikano:
On-line adaptive algorithm to acoustic fluctuation for inverse filter relaxation in sound reproduction system. EUSIPCO 2004: 1765-1768 - [c141]Satoshi Ukai, Hiroshi Saruwatari, Tomoya Takatani, Kiyohiro Shikano, Ryo Mukai, Hiroshi Sawada:
Evaluation of Multistage SIMO-Model-Based Blind Source Separation Combining Frequency-Domain ICA and Time-Domain ICA. ICA 2004: 626-633 - [c140]Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano:
Single Channel Speech Enhancement: MAP Estimation Using GGD Prior Under Blind Setup. ICA 2004: 873-880 - [c139]Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano, Atsunobu Kaminuma:
Stable and Low-Distortion Algorithm Based on Overdetermined Blind Separation for Convolutive Mixtures of Speech. ICA 2004: 881-888 - [c138]Tomoya Takatani, Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
Blind separation of binaural sound mixtures using SIMO-model-based independent component analysis. ICASSP (4) 2004: 113-116 - [c137]Tsuyoki Nishikawa, Hiroshi Abe, Hiroshi Saruwatari, Kiyohiro Shikano:
Overdetermined blind separation for convolutive mixtures of speech based on multistage ICA using subarray processing. ICASSP (1) 2004: 225-228 - [c136]Ryuichi Nisimura, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Public speech-oriented guidance system with adult and child discrimination capability. ICASSP (1) 2004: 433-436 - [c135]Akinobu Lee, Kiyohiro Shikano, Tatsuya Kawahara:
Real-time word confidence scoring using local posterior probabilities on tree trellis search. ICASSP (1) 2004: 793-796 - [c134]Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano:
MAP estimation of speech spectral component under GGD a priori. SAPA@INTERSPEECH 2004: 115 - [c133]Akinobu Lee, Keisuke Nakamura, Ryuichi Nisimura, Hiroshi Saruwatari, Kiyohiro Shikano:
Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs. INTERSPEECH 2004: 173-176 - [c132]Shinichi Yoshizawa, Kiyohiro Shikano:
Rapid EM training based on model-integration. INTERSPEECH 2004: 649-652 - [c131]Panikos Heracleous, Yoshitaka Nakajima, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Non-audible murmur (NAM) speech recognition using a stethoscopic NAM microphone. INTERSPEECH 2004: 1469-1472 - [c130]Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Robust speech recognition with spectral subtraction in low SNR. INTERSPEECH 2004: 2077-2080 - [c129]Tatsunori Asai, Shigeki Miyabe, Hiroshi Saruwatari, Kiyohiro Shikano:
Interface for barge-in free spoken dialogue system using adaptive sound field control. INTERSPEECH 2004: 2665-2668 - [c128]Tatsuya Kawahara, Akinobu Lee, Kazuya Takeda, Katsunobu Itou, Kiyohiro Shikano:
Recent progress of open-source LVCSR engine julius and Japanese model repository. INTERSPEECH 2004: 3069-3072 - [c127]Kazuki Adachi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification. LREC 2004 - 2003
- [j23]Hiroshi Saruwatari, Satoshi Kurita, Kazuya Takeda, Fumitada Itakura, Tsuyoki Nishikawa, Kiyohiro Shikano:
Blind Source Separation Combining Independent Component Analysis and Beamforming. EURASIP J. Adv. Signal Process. 2003(11): 1135-1146 (2003) - [j22]Hiroshi Saruwatari, Toshiya Kawamura, Tsuyoki Nishikawa, Kiyohiro Shikano:
Fast-Convergence Algorithm for Blind Source Separation Based on Array Signal Processing. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 86-A(3): 634-639 (2003) - [j21]Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
Blind Source Separation of Acoustic Signals Based on Multistage ICA Combining Frequency-Domain ICA and Time-Domain ICA. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 86-A(4): 846-858 (2003) - [j20]Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
Stable Learning Algorithm for Blind Separation of Temporally Correlated Acoustic Signals Combining Multistage ICA and Linear Prediction. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 86-A(8): 2028-2036 (2003) - [j19]Takanobu Nishiura, Ryousuke Nishioka, Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Multiple beamforming with source localization based on CSP analysis. Syst. Comput. Jpn. 34(5): 69-80 (2003) - [c126]Yoichi Hinamoto, Kouichi Mino, Hiroshi Saruwatari, Kiyohiro Shikano:
Interface for barge-in free spoken dialogue system based on sound field control and microphone array. ICASSP (5) 2003: 505-508 - [c125]Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano:
Segment selection considering local degradation of naturalness in concatenative speech synthesis. ICASSP (1) 2003: 696-699 - [c124]Yoshitaka Nakajima, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell:
Non-audible murmur recognition input interface using stethoscopic microphone attached to the skin. ICASSP (5) 2003: 708-711 - [c123]Panikos Heracleous, Satoshi Nakamura, Kiyohiro Shikano:
A semi-blind source separation method for hands-free speech recognition of multiple talkers. INTERSPEECH 2003: 509-512 - [c122]Hiroaki Yamajo, Hiroshi Saruwatari, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano:
Blind separation and deconvolution for convolutive mixture of speech using SIMO-model-based ICA and multichannel inverse filtering. INTERSPEECH 2003: 537-540 - [c121]Shingo Yamade, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Unsupervised speaker adaptation based on HMM sufficient statistics in various noisy environments. INTERSPEECH 2003: 1493-1496 - [c120]Takanobu Nishiura, Satoshi Nakamura, Kazuhiro Miki, Kiyohiro Shikano:
Environmental sound source identification based on hidden Markov model for robust speech recognition. INTERSPEECH 2003: 2157-2160 - [c119]Tatsuya Shiraishi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Simple designing methods of corpus-based visual speech synthesis. INTERSPEECH 2003: 2241-2244 - [c118]Hiromichi Kawanami, Yohei Iwami, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
GMM-based voice conversion applied to emotional speech synthesis. INTERSPEECH 2003: 2401-2404 - [c117]Yoshitaka Nakajima, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell:
Non-audible murmur recognition. INTERSPEECH 2003: 2601-2604 - [c116]Shinichi Yoshizawa, Kiyohiro Shikano:
Model-integration rapid training based on maximum likelihood for speech recognition. INTERSPEECH 2003: 2621-2624 - [c115]Tomoya Takatani, Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
High-fidelity blind separation for convolutive mixture of acoustic signals using SIMO-model-based independent component analysis. ISSPA (2) 2003: 77-80 - [c114]Hiroshi Saruwatari, Hiroaki Yamajo, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano:
Blind separation and deconvolution of MIMO system driven by colored inputs using SIMO-model-based ICA with information-geometric learning. NNSP 2003: 379-388 - [c113]Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
Stable learning algorithm for low-distortion blind separation of real speech mixture combining multistage ICA and linear prediction. NOLISP 2003: 8 - 2002
- [j18]Yosuke Tatekura, Hiroshi Saruwatari, Kiyohiro Shikano:
Sound Reproduction System Including Adaptive Compensation of Temperature Fluctuation Effect for Broad-Band Sound Control. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 85-A(8): 1851-1860 (2002) - [j17]Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Distant-talking speech recognition based on a 3-D Viterbi search using a microphone array. IEEE Trans. Speech Audio Process. 10(2): 48-56 (2002) - [c112]Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
Comparison of time-domain ICA, frequency-domain ICA and multistage ICA for blind source separation. EUSIPCO 2002: 1-4 - [c111]Hiroshi Saruwatari, Toshiya Kawamura, Katsuyuki Sawai, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata:
Evaluation of fast-convergence algorithm for ICA-based blind source separation of real convolutive mixture. EUSIPCO 2002: 1-4 - [c110]Yosuke Tatekura, Hiroshi Saruwatari, Kiyohiro Shikano:
Adaptive compensation of temperature fluctuation effect in sound reproduction system. EUSIPCO 2002: 1-4 - [c109]Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano:
Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit. ICASSP 2002: 465-468 - [c108]Takanobu Nishiura, Satoshi Nakamura, Kiyohiro Shikano:
Talker localization in a real acoustic environment based on DOA estimation and statistical sound source identification. ICASSP 2002: 893-896 - [c107]Tsuyoki Nishikawa, Hiroshi Saruwatari, Kiyohiro Shikano:
Bund source separation based on Multi-Stage ICA combining frequency-domain ICA and time-domain ICA. ICASSP 2002: 917-920 - [c106]Yosuke Tatekura, Hiroshi Saruwatari, Kiyohiro Shikano:
Sound reproduction system with adaptive compensation of temperature fluctuation effect. DSP 2002: 989-992 - [c105]Mikiko Mashimo, Tomoki Toda, Hiromichi Kawanami, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell:
Evaluation of cross-language voice conversion using bilingual and non-bilingual databases. INTERSPEECH 2002: 293-296 - [c104]Shingo Yamade, Kanako Matsunami, Akira Baba, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics. INTERSPEECH 2002: 1045-1048 - [c103]Hiroshi Saruwatari, Katsuyuki Sawai, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata:
Speech enhancement in car environment using blind source separation. INTERSPEECH 2002: 1781-1784 - [c102]Takanobu Nishiura, Satoshi Nakamura, Yuka Okada, Takeshi Yamada, Kiyohiro Shikano:
Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition. INTERSPEECH 2002: 1789-1792 - [c101]Toshio Hirai, Seiichi Tenpaku, Kiyohiro Shikano:
Using start/end timings of spectral transitions between phonemes in concatenative speech synthesis. INTERSPEECH 2002: 2357-2360 - [c100]Hiromichi Kawanami, Tsuyoshi Masuda, Tomoki Toda, Kiyohiro Shikano:
Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer. INTERSPEECH 2002: 2425-2428 - [c99]Akinobu Lee, Yuichiro Mera, Hiroshi Saruwatari, Kiyohiro Shikano:
Selective multi-path acoustic model based on database likelihoods. INTERSPEECH 2002: 2661-2664 - [c98]Ryuichi Nisimura, Takashi Uchida, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano, Yoshio Matsumoto:
ASKA: receptionist robot with speech dialogue system. IROS 2002: 1314-1319 - [c97]Hiromichi Kawanami, Tsuyoshi Masuda, Tomoki Toda, Kiyohiro Shikano:
Designing speech database with prosodic variety for expressive TTS system. LREC 2002 - [c96]Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Masato Mimura, Atsushi Yamada, Akinori Ito, Katsunobu Itou, Kiyohiro Shikano:
Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models. LREC 2002 - 2001
- [j16]Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano:
HMM-separation-based speech recognition for a distant moving speaker. IEEE Trans. Speech Audio Process. 9(2): 127-140 (2001) - [c95]Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano:
Gaussian mixture selection using context-independent HMM. ICASSP 2001: 69-72 - [c94]Takanobu Nishiura, S. Nakanura, Kiyohiro Shikano:
Speech enhancement by multiple beamforming with reflection signal equalization. ICASSP 2001: 189-192 - [c93]Panikos Heracleous, Satoshi Nakamura, Kiyohiro Shikano:
A microphone array-based 3-D N-best search algorithm for the simultaneous recognition of multiple sound sources in real environments. ICASSP 2001: 193-196 - [c92]Shinichi Yoshizawa, Akira Baba, Kanako Matsunami, Yuichiro Mera, Miichi Yamada, Kiyohiro Shikano:
Unsupervised speaker adaptation based on sufficient HMM statistics of selected speakers. ICASSP 2001: 341-344 - [c91]Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum. ICASSP 2001: 841-844 - [c90]Ken'ichi Kumatani, Satoshi Nakamura, Kiyohiro Shikano:
An Adaptive Integration Based On Product Hmm For Audio-Visual Speech Recognition. ICME 2001 - [c89]Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
High quality voice conversion based on Gaussian mixture model with dynamic frequency warping. INTERSPEECH 2001: 349-352 - [c88]Mikiko Mashimo, Tomoki Toda, Kiyohiro Shikano, Nick Campbell:
Evaluation of cross-language voice conversion based on GMM and straight. INTERSPEECH 2001: 361-364 - [c87]Miichi Yamada, Akira Baba, Shinichi Yoshizawa, Yuichiro Mera, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Unsupervised noisy environment adaptation algorithm using MLLR and speaker selection. INTERSPEECH 2001: 869-872 - [c86]Shinichi Yoshizawa, Akira Baba, Kanako Matsunami, Yuichiro Mera, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano:
Evaluation on unsupervised speaker adaptation based on sufficient HMM statictics of selected speakers. INTERSPEECH 2001: 1219-1222 - [c85]Akira Baba, Shinichi Yoshizawa, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano:
Elderly acoustic model for large vocabulary continuous speech recognition. INTERSPEECH 2001: 1657-1660 - [c84]Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano:
Julius - an open source real-time large vocabulary recognition engine. INTERSPEECH 2001: 1691-1694 - [c83]Ryuichi Nisimura, Kumiko Komatsu, Yuka Kuroda, Kentaro Nagatomo, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Automatic n-gram language model creation from web resources. INTERSPEECH 2001: 2127-2130 - [c82]Hiroshi Saruwatari, Toshiya Kawamura, Kiyohiro Shikano:
Blind source separation for speech based on fast-convergence algorithm with ICA and beamforming. INTERSPEECH 2001: 2603-2606 - [c81]Takanobu Nishiura, Satoshi Nakamura, Kiyohiro Shikano:
Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array. INTERSPEECH 2001: 2611-2614 - 2000
- [j15]Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano:
Model adaptation by HMM decomposition and composition in noisy reverberant environments. Syst. Comput. Jpn. 31(5): 77-85 (2000) - [c80]Takanobu Nishiura, Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Localization of multiple sound sources based on a CSP analysis with a microphone array. ICASSP 2000: 1053-1056 - [c79]Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Kiyohiro Shikano:
A new phonetic tied-mixture model for efficient decoding. ICASSP 2000: 1269-1272 - [c78]Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano:
Speech recognition for a distant moving speaker based on HMM composition and separation. ICASSP 2000: 1403-1406 - [c77]Kiyotsugu Kakihara, Satoshi Nakamura, Kiyohiro Shikano:
Speech-to-Face Movement Synthesis based on HMMS. IEEE International Conference on Multimedia and Expo (I) 2000: 427- - [c76]Satoshi Nakamura, Hidetoshi Ito, Kiyohiro Shikano:
Stream weight optimization of speech and lip image sequence for audio-visual speech recognition. INTERSPEECH 2000: 20-24 - [c75]Hiroshi Saruwatari, Satoshi Kurita, Kazuya Takeda, Fumitada Itakura, Kiyohiro Shikano:
Blind source separation based on subband ICA and beamforming. INTERSPEECH 2000: 94-97 - [c74]Tomoki Toda, Jinlin Lu, Hiroshi Saruwatari, Kiyohiro Shikano:
Straight-based voice conversion algorithm based on Gaussian mixture model. INTERSPEECH 2000: 279-282 - [c73]Toshio Hirai, Seiichi Tenpaku, Kiyohiro Shikano:
Manipulating speech pitch periods according to optimal insertion/deletion position in residual signal for intonation control in speech synthesis. INTERSPEECH 2000: 330-333 - [c72]Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Shigeki Sagayama, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano:
Free software toolkit for Japanese large vocabulary continuous speech recognition. INTERSPEECH 2000: 476-479 - [c71]Parham Zolfaghari, Yoshinori Atake, Kiyohiro Shikano, Hideki Kawahara:
Investigation of analysis and synthesis parameters of straight by subjective evaluation. INTERSPEECH 2000: 498-501 - [c70]Yoshinori Atake, Toshio Irino, Hideki Kawahara, Jinlin Lu, Satoshi Nakamura, Kiyohiro Shikano:
Robust fundamental frequency estimation using instantaneous frequencies of harmonic components. INTERSPEECH 2000: 907-910 - [c69]Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kazuya Takeda, Atsushi Yamada, Akinori Ito, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee:
IPA Japanese Dictation Free Software Project. LREC 2000
1990 – 1999
- 1999
- [c68]Panikos Heracleous, Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Simultaneous recognition of multiple sound sources based on 3-d n-best search using microphone array. EUROSPEECH 1999: 69-72 - 1998
- [j14]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Lip movement synthesis from speech based on Hidden Markov Models. Speech Commun. 26(1-2): 105-115 (1998) - [c67]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Subjective Evaluation for HMM-Based Speech-To-Lip Movement Synthesis. AVSP 1998: 227-232 - [c66]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Lip Movement Synthesis from Speech Based on Hidden Markov Models. FG 1998: 154-159 - [c65]Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Hands-free speech recognition based on 3-D Viterbi search using a microphone array. ICASSP 1998: 245-248 - [c64]Makoto Shozakai, Satoshi Nakamura, Kiyohiro Shikano:
Robust speech recognition in car environments. ICASSP 1998: 269-272 - [c63]Hideki Banno, Jinlin Lu, Satoshi Nakamura, Kiyohiro Shikano, Hideki Kawahara:
Efficient representation of short-time phase based on group delay. ICASSP 1998: 861-864 - [c62]Alexandre Girardi, Kiyohiro Shikano, Satoshi Nakamura:
Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphing. ICSLP 1998 - [c61]Katunobu Itou, Mikio Yamamoto, Kazuya Takeda, Toshiyuki Takezawa, Tatsuo Matsuoka, Tetsunori Kobayashi, Kiyohiro Shikano, Shuichi Itahashi:
The design of the newspaper-based Japanese large vocabulary continuous speech recognition corpus. ICSLP 1998 - [c60]Tatsuya Kawahara, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano:
Sharable software repository for Japanese large vocabulary continuous speech recognition. ICSLP 1998 - [c59]Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano, Masatoshi Morishima, Toshihiro Isobe:
Evaluation of model adaptation by HMM decomposition on telephone speech recognition. ICSLP 1998 - [c58]Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi search. ICSLP 1998 - [c57]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Speech-to-lip movement synthesis based on the EM algorithm using audio-visual HMMs. ICSLP 1998 - [c56]Norimichi Yodo, Kiyohiro Shikano, Satoshi Nakamura:
Compression algorithm of trigram language models based on maximum likelihood estimation. ICSLP 1998 - [c55]Satoshi Nakamura, Eli Yamamoto, Kiyohiro Shikano:
Speech-to-lip movement synthesis maximizing audio-visual joint probability based on EM algorithm. MMSP 1998: 53-58 - 1997
- [c54]Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano:
Speech to lip movement synthesis by HMM. AVSP 1997: 137-140 - [c53]Tetsuya Takiguchi, Satoshi Nakamura, Qiang Hou, Kiyohiro Shikano:
Model adaptation based on HMM decomposition for reverberant speech recognition. ICASSP 1997: 827-830 - [c52]Alexandre Girardi, Harald Singer, Kiyohiro Shikano, Satoshi Nakamura:
Maximum likelihood successive state splitting algorithm for tied-mixture HMNET. EUROSPEECH 1997: 119-122 - [c51]Makoto Shozakai, Satoshi Nakamura, Kiyohiro Shikano:
A non-iterative model-adaptive e-CMN/PMC approach for speech recognition in car environments. EUROSPEECH 1997: 287-290 - [c50]Masaaki Inoue, Satoshi Nakamura, Takeshi Yamada, Kiyohiro Shikano:
Microphone array design measures for hands-free speech recognition. EUROSPEECH 1997: 331-334 - [c49]Satoshi Nakamura, Ron Nagai, Kiyohiro Shikano:
Improved bimodal speech recognition using tied-mixture HMMs and 5000 word audio-visual synchronous database. EUROSPEECH 1997: 1623-1626 - [c48]Satoshi Nakamura, Kiyohiro Shikano:
Room acoustics and reverberation: impact on hands-free recognition. EUROSPEECH 1997: 2419-2422 - 1996
- [c47]Satoshi Nakamura, Tetsuya Takiguchi, Kiyohiro Shikano:
Noise and room acoustics distorted speech recognition by HMM composition. ICASSP 1996: 69-72 - [c46]Tadashi Yonezaki, Kiyohiro Shikano:
Entropy coded vector quantization with hidden Markov models. ICSLP 1996: 310-313 - [c45]Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano:
Robust speech recognition with speaker localization by a microphone array. ICSLP 1996: 1317-1320 - 1995
- [j13]Osamu Yoshioka, Yasuhiro Minami, Kiyohiro Shikano:
A Speech Dialogue System with Multimodal Interface for Telephone Directory Assistance. IEICE Trans. Inf. Syst. 78-D(6): 616-621 (1995) - [j12]Satoshi Takahashi, Yasuhiro Minami, Kiyohiro Shikano:
An HMM State Duration Control Algorithm Applied to Large-Vocabulary Spontaneous Speech Recognition. IEICE Trans. Inf. Syst. 78-D(6): 648-653 (1995) - 1994
- [j11]Kiyohiro Shikano, Tomokazu Yamada, Takeshi Kawabata, Shoichi Matsunaga, Sadaoki Furui, Toshiyuki Hanazawa:
Dictation Machine Based on Japanese Character Source Modeling. Int. J. Pattern Recognit. Artif. Intell. 8(1): 181-196 (1994) - [j10]Yasuhiro Minami, Kiyohiro Shikano, Satoshi Takahashi, Tomokazu Yamada, Osamu Yoshioka, Sadaoki Furui:
Large-vocabulary continuous speech recognition algorithm applied to a multi-modal telephone directory assistance system. Speech Communication 15(3-4): 301-310 (1994) - [c44]Yasuhiro Minami, Kiyohiro Shikano, Satoshi Takahashi, Tomokazu Yamada:
Search algorithm that merges candidates in meaning level for very large vocabulary spontaneous speech recognition. ICASSP (2) 1994: 141-144 - [c43]Satoshi Takahashi, Yasuhiro Minami, Kiyohiro Shikano:
An HMM duration control algorithm with a low computational cost. ICSLP 1994: 267-270 - [c42]Osamu Yoshioka, Yasuhiro Minami, Kiyohiro Shikano:
A multi-modal dialogue system for telephone directory assistance. ICSLP 1994: 887-890 - [c41]Yasuhiro Minami, Kiyohiro Shikano, Osamu Yoshioka, Satoshi Takahashi, Tomokazu Yamada, Sadaoki Furui:
A Large-Vocabulary Continuous Speech Recognition Algorithm and its Application to a Multi-Modal Telephone Directory Assistance System. HLT 1994 - 1993
- [c40]Satoshi Takahashi, Tatsuo Matsuoka, Yasuhiro Minami, Kiyohiro Shikano:
Phoneme HMMs constrained by frame correlations. ICASSP (2) 1993: 219-222 - [c39]Franck Martin, Kiyohiro Shikano, Yasuhiro Minami:
Recognition of noisy speech by composition of hidden Markov models. EUROSPEECH 1993: 1031-1034 - [c38]Yasuhiro Minami, Kiyohiro Shikano, Tomokazu Yamada, Tatsuo Matsuoka:
Very-large-vocabulary continuous speech recognition algorithm for telephone directory assistance. EUROSPEECH 1993: 2129-2132 - [c37]Shoichi Matsunaga, Tomokazu Yamada, Kiyohiro Shikano:
Dictation system using inductively auto-generated syntax. EUROSPEECH 1993: 2135-2138 - 1992
- [c36]Tomokazu Yamada, Shoichi Matsunaga, Kiyohiro Shikano:
Japanese dictation system using character source modeling. ICASSP 1992: 37-40 - [c35]Shoichi Matsunaga, Tomokazu Yamada, Kiyohiro Shikano:
Task adaptation in stochastic language models for continuous speech recognition. ICASSP 1992: 165-168 - [c34]Satoshi Takahashi, Tatsuo Matsuoka, Kiyohiro Shikano:
Phonemic HMM constrained by statistical VQ-code transition. ICASSP 1992: 553-556 - [c33]Akito Nagai, Kenji Kita, Toshiyuki Hanazawa, Tadashi Suzuki, Tomohiro Iwasaki, Tsuyoshi Kawabata, Kunio Nakajima, Kiyohiro Shikano, Tsuyoshi Morimoto, Shigeki Sagayama, Akira Kurematsu:
Hardware implementation of realtime 1000-word HMM-LR continuous speech recognition. ICSLP 1992: 237-240 - [c32]Tatsuo Matsuoka, Kiyohiro Shikano:
Speaker adaptation by modifying mixture coefficients of speaker-independent mixture Gaussian HMMs. ICSLP 1992: 373-376 - [c31]Shoichi Matsunaga, Toshiaki Tsuboi, Tomokazu Yamada, Kiyohiro Shikano:
Continuous speech recognition for medical diagnoses using a character trigram model. ICSLP 1992: 727-730 - [c30]Yasuhiro Minami, Tatsuo Matsuoka, Kiyohiro Shikano:
Phoneme HMM evaluation algorithm without phoneme labeling. ICSLP 1992: 1535-1538 - [c29]Sadaoki Furui, Kiyohiro Shikano, Shoichi Matsunaga, Tatsuo Matsuoka, Satoshi Takahashi, Tomokazu Yamada:
Recent Topics in Speech Recognition Research at NTT Laboratories. HLT 1992 - 1991
- [j9]Akira Kurematsu, Hitoshi Iida, Tsuyoshi Morimoto, Kiyohiro Shikano:
Language processing in connection with speech translation at ATR interpreting telephony research laboratories. Speech Commun. 10(1): 1-9 (1991) - [c28]Tomokazu Yamada, Toshiyuki Hanazawa, Takeshi Kawabata, Shoichi Matsunaga, Kiyohiro Shikano:
Phonetic typewriter based on phoneme source modeling. ICASSP 1991: 169-172 - [c27]Tatsuo Matsuoka, Kiyohiro Shikano:
Robust HMM phoneme modeling for different speaking styles. ICASSP 1991: 265-268 - 1990
- [j8]Hidefumi Sawai, Masanori Miyatake, Alex Waibel, Kiyohiro Shikano:
Spotting Phonemes and Syllables for Continuous Speech Recognition Using Time-Delay Neural Networks. Syst. Comput. Jpn. 21(9): 71-79 (1990) - [j7]Kaichiro Hatazaki, Yasuhiro Komori, Takeshi Kawabata, Kiyohiro Shikano:
Phoneme segmentation expert system using spectrogram reading knowledge. Syst. Comput. Jpn. 21(12): 90-100 (1990) - [j6]Yasuhiro Komori, Takeshi Kawabata, Kaichiro Hatazaki, Kiyohiro Shikano:
Phoneme recognition expert system using spectrogram reading knowledge and neural networks. Syst. Comput. Jpn. 21(12): 101-111 (1990) - [j5]Akira Kurematsu, Kazuya Takeda, Yoshinori Sagisaka, Shigeru Katagiri, Hisao Kuwabara, Kiyohiro Shikano:
ATR Japanese speech database as a tool of speech recognition and synthesis. Speech Commun. 9(4): 357-363 (1990) - [c26]Masami Nakamura, Katsuteru Maruyama, Takeshi Kawabata, Kiyohiro Shikano:
Neural Network Approach To Word Category Prediction For English Texts. COLING 1990: 213-218 - [c25]Toshiyuki Hanazawa, Kenji Kita, Satoshi Nakamura, Takeshi Kawabata, Kiyohiro Shikano:
ATR HMM-LR continuous speech recognition system. ICASSP 1990: 53-56 - [c24]Hiroaki Hattori, Satoshi Nakamura, Kiyohiro Shikano:
Supplementation of HMM for articulatory variation in speaker adaptation. ICASSP 1990: 153-156 - [c23]Satoshi Nakamura, Kiyohiro Shikano:
A comparative study of spectral mapping for speaker adaptation. ICASSP 1990: 157-160 - [c22]Masanobu Abe, Kiyohiro Shikano, Hisao Kuwabara:
Cross-language voice conversion. ICASSP 1990: 345-348 - [c21]Masanori Miyatake, Hidefumi Sawai, Yasuhiro Minami, Kiyohiro Shikano:
Integrated training for spotting Japanese phonemes using large phonemic time-delay neural networks. ICASSP 1990: 449-452 - [c20]Hiroaki Hattori, Satoshi Nakamura, Kiyohiro Shikano, Shigeki Sagayama:
Speaker weighted training of HMM using multiple reference speakers. ICSLP 1990: 149-152 - [c19]Takeshi Kawabata, Toshiyuki Hanazawa, Katsunobu Itou, Kiyohiro Shikano:
Japanese phonetic typewriter using HMM phone units and syllable trigrams. ICSLP 1990: 717-720 - [c18]Tsuyoshi Morimoto, Kiyohiro Shikano, Hitoshi Iida, Akira Kurematsu:
Integration of speech recognition and language processing in spoken language translation system (SL-TRANS). ICSLP 1990: 921-924 - [c17]Yasuhiro Minami, Toshiyuki Hanazawa, Hitoshi Iwamida, Erik McDermott, Kiyohiro Shikano, Shigeru Katagiri, Masaona Kagawa:
On the robustness of HMM and ANN speech recognition algorithms. ICSLP 1990: 1345-1348
1980 – 1989
- 1989
- [j4]Alexander Waibel, Toshiyuki Hanazawa, Geoffrey E. Hinton, Kiyohiro Shikano, Kevin J. Lang:
Phoneme recognition using time-delay neural networks. IEEE Trans. Acoust. Speech Signal Process. 37(3): 328-339 (1989) - [j3]Alex Waibel, Hidefumi Sawai, Kiyohiro Shikano:
Modularity and scaling in large phonemic neural networks. IEEE Trans. Acoust. Speech Signal Process. 37(12): 1888-1898 (1989) - [c16]Hidefumi Sawai, Alex Waibel, Masanori Miyatake, Kiyohiro Shikano:
Spotting Japanese CV-syllables and phonemes using time-delay neural networks. ICASSP 1989: 25-28 - [c15]Satoshi Nakamura, Kiyohiro Shikano:
Speaker adaptation applied to HMM and neural networks. ICASSP 1989: 89-92 - [c14]Alex Waibel, Hidefumi Sawai, Kiyohiro Shikano:
Consonant recognition by modular construction of large phonemic time-delay neural networks. ICASSP 1989: 112-115 - [c13]Kaichiro Hatazaki, Yasuhiro Komori, Takeshi Kawabata, Kiyohiro Shikano:
Phoneme segmentation using spectrogram reading knowledge. ICASSP 1989: 393-396 - [c12]Takeshi Kawabata, Kiyohiro Shikano:
Island-driven continuous speech recognizer using phone-based HMM word spotting. ICASSP 1989: 461-464 - [c11]Masami Nakamura, Kiyohiro Shikano:
A study of English word category prediction based on neutral networks. ICASSP 1989: 731-734 - [c10]Yasuhiro Komori, Kaichiro Hatazaki, Takaharu Tanaka, Takeshi Kawabata, Kiyohiro Shikano:
Phoneme recognition expert system using spectrogram reading knowledge and neural networks. EUROSPEECH 1989: 2549-2552 - [c9]Patrick Haffner, Alex Waibel, Hidefumi Sawai, Kiyohiro Shikano:
Fast back-propagation learning methods for large phonemic neural networks. EUROSPEECH 1989: 2553-2556 - 1988
- [c8]Alex Waibel, Toshiyuki Hanazawa, Geoffrey E. Hinton, Kiyohiro Shikano, Kevin J. Lang:
Phoneme recognition: neural networks vs. hidden Markov models. ICASSP 1988: 107-110 - [c7]Masanobu Abe, Satoshi Nakamura, Kiyohiro Shikano, Hisao Kuwabara:
Voice conversion through vector quantization. ICASSP 1988: 655-658 - 1987
- [c6]Kiyohiro Shikano:
Improvement of word recognition results by trigram model. ICASSP 1987: 1261-1264 - 1986
- [j2]Shoichi Matsunaga, Kiyohiro Shikano:
Speech recognition based on top-down and bottom-up phoneme recognition. Syst. Comput. Jpn. 17(7): 95-106 (1986) - [c5]Kiyohiro Shikano, Kai-Fu Lee, Raj Reddy:
Speaker adaptation through vector quantization. ICASSP 1986: 2643-2646 - 1985
- [j1]Noboru Sugamura, Kiyohiro Shikano, Masaki Kohda:
Speaker-independent isolated word recognition based on multiple templates using split method. Syst. Comput. Jpn. 16(5): 10-20 (1985) - [c4]Kiyoaki Aikawa, Masahide Sugiyama, Kiyohiro Shikano:
Spoken word recognition based on top-down phoneme segmentation. ICASSP 1985: 33-36 - 1983
- [c3]Noboru Sugamura, Kiyohiro Shikano, Sadaoki Furui:
Isolated word recognition using phoneme-like templates. ICASSP 1983: 723-726 - 1981
- [c2]Kiyohiro Shikano:
Acoustic processing in the conversational speech recognition system. ICASSP 1981: 1164-1167
1970 – 1979
- 1976
- [c1]Masaki Kohda, Ryohei Nakatsu, Kiyohiro Shikano:
Speech recognition in the question-answering system operated by conversational speech. ICASSP 1976: 442-445
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-20 00:43 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint