default search action
Chiori Hori
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j31]Koichiro Yoshino, Yun-Nung Chen, Paul A. Crook, Satwik Kottur, Jinchao Li, Behnam Hedayatnia, Seungwhan Moon, Zhengcong Fei, Zekang Li, Jinchao Zhang, Yang Feng, Jie Zhou, Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Karthik Gopalakrishnan, Dilek Hakkani-Tur, Babak Damavandi, Alborz Geramifard, Chiori Hori, Ankit Shah, Chen Zhang, Haizhou Li, João Sedoc, Luis F. D'Haro, Rafael E. Banchs, Alexander Rudnicky:
Overview of the Tenth Dialog System Technology Challenge: DSTC10. IEEE ACM Trans. Audio Speech Lang. Process. 32: 765-778 (2024) - [c121]Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. ICASSP 2024: 1016-1020 - [c120]Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Generation or Replication: Auscultating Audio Latent Diffusion Models. ICASSP 2024: 1156-1160 - [c119]Chiori Hori, Pu Wang, Mahbub Rahman, Cristian J. Vaca-Rubio, Sameer Khurana, Anoop Cherian, Jonathan Le Roux:
WI-FI based Indoor Monitoring Enhanced by Multimodal Fusion. ICASSP 2024: 13296-13300 - [c118]Lingfeng Sun, Devesh K. Jha, Chiori Hori, Siddarth Jain, Radu Corcodel, Xinghao Zhu, Masayoshi Tomizuka, Diego Romeres:
Interactive Planning Using Large Language Models for Partially Observable Robotic Tasks. ICRA 2024: 14054-14061 - [i20]Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization. CoRR abs/2402.17907 (2024) - 2023
- [c117]Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction. ASRU 2023: 1-8 - [c116]Chiori Hori, Puyuan Peng, David Harwath, Xinyu Liu, Kei Ota, Siddarth Jain, Radu Corcodel, Devesh K. Jha, Diego Romeres, Jonathan Le Roux:
Style-transfer based Speech and Audio-visual Scene understanding for Robot Action Sequence Acquisition from Videos. INTERSPEECH 2023: 4663-4667 - [i19]Chiori Hori, Puyuan Peng, David Harwath, Xinyu Liu, Kei Ota, Siddarth Jain, Radu Corcodel, Devesh K. Jha, Diego Romeres, Jonathan Le Roux:
Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos. CoRR abs/2306.15644 (2023) - [i18]Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Generation or Replication: Auscultating Audio Latent Diffusion Models. CoRR abs/2310.10604 (2023) - [i17]Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux:
Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction. CoRR abs/2310.19644 (2023) - [i16]Lingfeng Sun, Devesh K. Jha, Chiori Hori, Siddarth Jain, Radu Corcodel, Xinghao Zhu, Masayoshi Tomizuka, Diego Romeres:
Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks. CoRR abs/2312.06876 (2023) - 2022
- [c115]Anoop Cherian, Chiori Hori, Tim K. Marks, Jonathan Le Roux:
(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering. AAAI 2022: 444-453 - [c114]Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori:
Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning. ICASSP 2022: 7732-7736 - [c113]Chiori Hori, Takaaki Hori, Jonathan Le Roux:
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers. INTERSPEECH 2022: 4511-4515 - [i15]Anoop Cherian, Chiori Hori, Tim K. Marks, Jonathan Le Roux:
(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering. CoRR abs/2202.09277 (2022) - 2021
- [j30]Seokhwan Kim, Hannes Schulz, R. Chulaka Gunasekara, Chiori Hori, Abhinav Rastogi, Luis Fernando D'Haro:
Editorial: Special Issue on the Eighth Dialog System Technology Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2434-2436 (2021) - [j29]Seokhwan Kim, Michel Galley, R. Chulaka Gunasekara, Sungjin Lee, Adam Atkinson, Baolin Peng, Hannes Schulz, Jianfeng Gao, Jinchao Li, Mahmoud Adada, Minlie Huang, Luis A. Lastras, Jonathan K. Kummerfeld, Walter S. Lasecki, Chiori Hori, Anoop Cherian, Tim K. Marks, Abhinav Rastogi, Xiaoxue Zang, Srinivas Sunkara, Raghav Gupta:
Overview of the Eighth Dialog System Technology Challenge: DSTC8. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2529-2540 (2021) - [c112]Shijie Geng, Peng Gao, Moitreya Chatterjee, Chiori Hori, Jonathan Le Roux, Yongfeng Zhang, Hongsheng Li, Anoop Cherian:
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers. AAAI 2021: 1415-1423 - [c111]Chiori Hori, Takaaki Hori, Jonathan Le Roux:
Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers. Interspeech 2021: 586-590 - [c110]Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux:
Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers. Interspeech 2021: 2097-2101 - [i14]Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux:
Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers. CoRR abs/2104.09426 (2021) - [i13]Chiori Hori, Takaaki Hori, Jonathan Le Roux:
Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers. CoRR abs/2108.02147 (2021) - [i12]Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori:
Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning. CoRR abs/2110.06894 (2021) - 2020
- [j28]Luis Fernando D'Haro, Koichiro Yoshino, Chiori Hori, Tim K. Marks, Lazaros Polymenakos, Jonathan K. Kummerfeld, Michel Galley, Xiang Gao:
Overview of the seventh Dialog System Technology Challenge: DSTC7. Comput. Speech Lang. 62: 101068 (2020) - [c109]Hiroki Nishikawa, Takumi Yamamoto, Bret A. Harsham, Ye Wang, Kota Uehara, Chiori Hori, Aiko Iwasaki, Kiyoto Kawauchi, Masakatsu Nishigaki:
Analysis of Malicious Email Detection using Cialdini's Principles. AsiaJCIS 2020: 137-142 - [c108]Lei Shi, Shijie Geng, Kai Shuang, Chiori Hori, Songxiang Liu, Peng Gao, Sen Su:
Multi-Layer Content Interaction Through Quaternion Product for Visual Question Answering. ICASSP 2020: 4412-4416 - [c107]Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux:
Transformer-Based Long-Context End-to-End Speech Recognition. INTERSPEECH 2020: 5011-5015 - [c106]Anoop Cherian, Jue Wang, Chiori Hori, Tim K. Marks:
Spatio-Temporal Ranked-Attention Networks for Video Captioning. WACV 2020: 1606-1615 - [i11]Lei Shi, Shijie Geng, Kai Shuang, Chiori Hori, Songxiang Liu, Peng Gao, Sen Su:
Multi-Layer Content Interaction Through Quaternion Product For Visual Question Answering. CoRR abs/2001.05840 (2020) - [i10]Anoop Cherian, Jue Wang, Chiori Hori, Tim K. Marks:
Spatio-Temporal Ranked-Attention Networks for Video Captioning. CoRR abs/2001.06127 (2020) - [i9]Shijie Geng, Peng Gao, Chiori Hori, Jonathan Le Roux, Anoop Cherian:
Spatio-Temporal Scene Graphs for Video Dialog. CoRR abs/2007.03848 (2020) - [i8]Peng Gao, Chiori Hori, Shijie Geng, Takaaki Hori, Jonathan Le Roux:
Multi-Pass Transformer for Machine Translation. CoRR abs/2009.11382 (2020)
2010 – 2019
- 2019
- [j27]Guy Barash, Mauricio Castillo-Effen, Niyati Chhaya, Peter Clark, Huáscar Espinoza, Eitan Farchi, Christopher W. Geib, Odd Erik Gundersen, Seán Ó hÉigeartaigh, José Hernández-Orallo, Chiori Hori, Xiaowei Huang, Kokil Jaidka, Pavan Kapanipathi, Sarah Keren, Seokhwan Kim, Marc Lanctot, Danny Lange, Julian J. McAuley, David R. Martinez, Marwan Mattar, Mausam, Martin Michalowski, Reuth Mirsky, Roozbeh Mottaghi, Joseph C. Osborn, Julien Pérolat, Martin Schmid, Arash Shaban-Nejad, Onn Shehory, Biplav Srivastava, William W. Streilein, Kartik Talamadupula, Julian Togelius, Koichiro Yoshino, Quanshi Zhang, Imed Zitouni:
Reports of the Workshops Held at the 2019 AAAI Conference on Artificial Intelligence. AI Mag. 40(3): 67-78 (2019) - [j26]Takaaki Hori, Wen Wang, Yusuke Koji, Chiori Hori, Bret Harsham, John R. Hershey:
Adversarial training and decoding strategies for end-to-end neural conversation models. Comput. Speech Lang. 54: 122-139 (2019) - [j25]Chiori Hori, Julien Perez, Ryuichiro Higashinaka, Takaaki Hori, Y-Lan Boureau, Michimasa Inaba, Yuiko Tsunomori, Tetsuro Takahashi, Koichiro Yoshino, Seokhwan Kim:
Overview of the sixth dialog system technology challenge: DSTC6. Comput. Speech Lang. 55: 1-25 (2019) - [j24]Luis Fernando D'Haro, Rafael E. Banchs, Chiori Hori, Haizhou Li:
Automatic evaluation of end-to-end dialog systems with adequacy-fluency metrics. Comput. Speech Lang. 55: 200-215 (2019) - [c105]Huda AlAmri, Vincent Cartillier, Abhishek Das, Jue Wang, Anoop Cherian, Irfan Essa, Dhruv Batra, Tim K. Marks, Chiori Hori, Peter Anderson, Stefan Lee, Devi Parikh:
Audio Visual Scene-Aware Dialog. CVPR 2019: 7558-7567 - [c104]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features. ICASSP 2019: 2352-2356 - [c103]Chiori Hori, Anoop Cherian, Tim K. Marks, Takaaki Hori:
Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog. INTERSPEECH 2019: 1886-1890 - [i7]Koichiro Yoshino, Chiori Hori, Julien Perez, Luis Fernando D'Haro, Lazaros Polymenakos, R. Chulaka Gunasekara, Walter S. Lasecki, Jonathan K. Kummerfeld, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan, Xiang Gao, Huda AlAmri, Tim K. Marks, Devi Parikh, Dhruv Batra:
Dialog System Technology Challenge 7. CoRR abs/1901.03461 (2019) - [i6]Huda AlAmri, Vincent Cartillier, Abhishek Das, Jue Wang, Stefan Lee, Peter Anderson, Irfan Essa, Devi Parikh, Dhruv Batra, Anoop Cherian, Tim K. Marks, Chiori Hori:
Audio-Visual Scene-Aware Dialog. CoRR abs/1901.09107 (2019) - [i5]Seokhwan Kim, Michel Galley, R. Chulaka Gunasekara, Sungjin Lee, Adam Atkinson, Baolin Peng, Hannes Schulz, Jianfeng Gao, Jinchao Li, Mahmoud Adada, Minlie Huang, Luis A. Lastras, Jonathan K. Kummerfeld, Walter S. Lasecki, Chiori Hori, Anoop Cherian, Tim K. Marks, Abhinav Rastogi, Xiaoxue Zang, Srinivas Sunkara, Raghav Gupta:
The Eighth Dialog System Technology Challenge. CoRR abs/1911.06394 (2019) - 2018
- [c102]Chiori Hori, Takaaki Hori, Gordon Wichern, Jue Wang, Teng-Yok Lee, Anoop Cherian, Tim K. Marks:
Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description. CVPR Workshops 2018: 2528-2531 - [i4]Huda AlAmri, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Jue Wang, Irfan Essa, Dhruv Batra, Devi Parikh, Anoop Cherian, Tim K. Marks, Chiori Hori:
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7. CoRR abs/1806.00525 (2018) - [i3]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features. CoRR abs/1806.08409 (2018) - 2017
- [c101]Chiori Hori, Takaaki Hori, Tim K. Marks, John R. Hershey:
Early and late integration of audio features for automatic video description. ASRU 2017: 430-436 - [c100]Chiori Hori, Takaaki Hori, Teng-Yok Lee, Ziming Zhang, Bret Harsham, John R. Hershey, Tim K. Marks, Kazuhiro Sumi:
Attention-Based Multimodal Fusion for Video Description. ICCV 2017: 4203-4212 - [i2]Chiori Hori, Takaaki Hori, Teng-Yok Lee, Kazuhiro Sumi, John R. Hershey, Tim K. Marks:
Attention-Based Multimodal Fusion for Video Description. CoRR abs/1701.03126 (2017) - [i1]Chiori Hori, Takaaki Hori:
End-to-end Conversation Modeling Track in DSTC6. CoRR abs/1706.07440 (2017) - 2016
- [j23]Tsubasa Ochiai, Shigeki Matsuda, Hideyuki Watanabe, Xugang Lu, Chiori Hori, Hisashi Kawai, Shigeru Katagiri:
Speaker Adaptive Training Localizing Speaker Modules in DNN for Hybrid DNN-HMM Speech Recognizers. IEICE Trans. Inf. Syst. 99-D(10): 2431-2443 (2016) - [j22]Peng Shen, Xugang Lu, Xinhui Hu, Naoyuki Kanda, Masahiro Saiko, Chiori Hori, Hisashi Kawai:
Combination of multiple acoustic models with unsupervised adaptation for lecture speech transcription. Speech Commun. 82: 1-13 (2016) - [j21]Jinfu Ni, Yoshinori Shiga, Chiori Hori:
Superpositional HMM-Based Intonation Synthesis Using a Functional F0 Model. J. Signal Process. Syst. 82(2): 273-286 (2016) - [c99]Takaaki Hori, Chiori Hori, Shinji Watanabe, John R. Hershey:
Minimum word error training of long short-term memory recurrent neural network language models for speech recognition. ICASSP 2016: 5990-5994 - [c98]Chiori Hori, Shinji Watanabe, Takaaki Hori, Bret A. Harsham, John R. Hershey, Yusuke Koji, Youichi Fujii, Yuki Furumoto:
Driver confusion status detection using recurrent neural networks. ICME 2016: 1-6 - [c97]Chiori Hori, Takaaki Hori, Shinji Watanabe, John R. Hershey:
Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs. INTERSPEECH 2016: 3236-3240 - [c96]Takaaki Hori, Hai Wang, Chiori Hori, Shinji Watanabe, Bret Harsham, Jonathan Le Roux, John R. Hershey, Yusuke Koji, Yi Jing, Zhaocheng Zhu, Takeyuki Aikawa:
Dialog state tracking with attention-based sequence-to-sequence learning. SLT 2016: 552-558 - 2015
- [j20]Komei Sugiura, Yoshinori Shiga, Hisashi Kawai, Teruhisa Misu, Chiori Hori:
A cloud robotics approach towards dialogue-oriented robot speech. Adv. Robotics 29(7): 449-456 (2015) - [j19]Youzheng Wu, Chiori Hori, Hideki Kashioka, Hisashi Kawai:
Leveraging social Q&A collections for improving complex question answering. Comput. Speech Lang. 29(1): 1-19 (2015) - [c95]Hay Mar Soe Naing, Aye Mya Hlaing, Win Pa Pa, Xinhui Hu, Ye Kyaw Thu, Chiori Hori, Hisashi Kawai:
A Myanmar large vocabulary continuous speech recognition system. APSIPA 2015: 320-327 - [c94]Tsubasa Ochiai, Shigeki Matsuda, Hideyuki Watanabe, Xugang Lu, Chiori Hori, Shigeru Katagiri:
Speaker adaptive training for deep neural networks embedding linear transformation networks. ICASSP 2015: 4605-4609 - [c93]Jinfu Ni, Yoshinori Shiga, Chiori Hori:
Extraction of pitch register from expressive speech in Japanese. ICASSP 2015: 4764-4768 - [c92]Xugang Lu, Peng Shen, Yu Tsao, Chiori Hori, Hisashi Kawai:
Sparse representation with temporal max-smoothing for acoustic event detection. INTERSPEECH 2015: 1176-1180 - [c91]Ye Kyaw Thu, Win Pa Pa, Jinfu Ni, Yoshinori Shiga, Andrew M. Finch, Chiori Hori, Hisashi Kawai, Eiichiro Sumita:
HMM based myanmar text to speech system. INTERSPEECH 2015: 2237-2241 - [c90]Ye Kyaw Thu, Win Pa Pa, Andrew M. Finch, Jinfu Ni, Eiichiro Sumita, Chiori Hori:
The Application of Phrase Based Statistical Machine Translation Techniques to Myanmar Grapheme to Phoneme Conversion. PACLING 2015: 238-250 - 2014
- [j18]Yu Tsao, Xugang Lu, Paul R. Dixon, Ting-Yao Hu, Shigeki Matsuda, Chiori Hori:
Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation. Comput. Speech Lang. 28(3): 709-726 (2014) - [j17]Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka, Chin-Hui Lee:
A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 22(2): 403-416 (2014) - [c89]Xinhui Hu, Masahiro Saiko, Chiori Hori:
Incorporating tone features to convolutional neural network to improve Mandarin/Thai speech recognition. APSIPA 2014: 1-5 - [c88]Jinfu Ni, Yoshinori Shiga, Chiori Hori:
Tuning intonation with pitch accent decomposition for HMM-based expressive speech synthesis. APSIPA 2014: 1-10 - [c87]Youzheng Wu, Taro Watanabe, Chiori Hori:
Recurrent Neural Network-based Tuple Sequence Model for Machine Translation. COLING 2014: 1908-1917 - [c86]Chien-Lin Huang, Chiori Hori:
Semantic context inference for spoken document retrieval using term association matrices. ICASSP 2014: 4116-4120 - [c85]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Sparse representation based on a bag of spectral exemplars for acoustic event detection. ICASSP 2014: 6255-6259 - [c84]Tsubasa Ochiai, Shigeki Matsuda, Xugang Lu, Chiori Hori, Shigeru Katagiri:
Speaker Adaptive Training using Deep Neural Networks. ICASSP 2014: 6349-6353 - [c83]Youzheng Wu, Xinhui Hu, Chiori Hori:
Translating TED speeches by recurrent neural network based translation model. ICASSP 2014: 7098-7102 - [c82]Komei Sugiura, Yoshinori Shiga, Hisashi Kawai, Teruhisa Misu, Chiori Hori:
Non-monologue HMM-based speech synthesis for service robots: A cloud robotics approach. ICRA 2014: 2237-2242 - [c81]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Ensemble modeling of denoising autoencoder for speech spectrum restoration. INTERSPEECH 2014: 885-889 - [c80]Xinhui Hu, Xugang Lu, Chiori Hori:
Mandarin speech recognition using convolution neural network with augmented tone features. ISCSLP 2014: 15-18 - [c79]Jinfu Ni, Yoshinori Shiga, Chiori Hori:
Superpositional HMM-based intonation synthesis using a functional F0 model. ISCSLP 2014: 270-274 - [c78]Xugang Lu, Yu Tsao, Peng Shen, Chiori Hori:
Spectral patch based sparse coding for acoustic event detection. ISCSLP 2014: 317-320 - [c77]Peng Shen, Yugang Lu, Xinhui Hu, Naoyuki Kanda, Masahiro Saiko, Chiori Hori:
The NCT ASR system for IWSLT 2014. IWSLT (Evaluation Campaign) 2014 - [c76]Masahiro Saiko, Hitoshi Yamamoto, Ryosuke Isotani, Chiori Hori:
Efficient multi-lingual unsupervised acoustic model training under mismatch conditions. SLT 2014: 24-29 - 2013
- [j16]Sakriani Sakti, Michael Paul, Andrew M. Finch, Shinsuke Sakai, Thang Tat Vu, Noriyuki Kimura, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou Li:
A-STAR: Toward translating Asian spoken languages. Comput. Speech Lang. 27(2): 509-527 (2013) - [j15]Xinhui Hu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Collecting Colloquial and Spontaneous-like Sentences from Web Resources for Constructing Chinese Language Models of Speech Recognition. Inf. Media Technol. 8(2): 449-456 (2013) - [j14]Xinhui Hu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Collecting Colloquial and Spontaneous-like Sentences from Web Resources for Constructing Chinese Language Models of Speech Recognition. J. Inf. Process. 21(2): 168-175 (2013) - [j13]Xugang Lu, Masashi Unoki, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Controlling Tradeoff Between Approximation Accuracy and Complexity of a Smooth Function in a Reproducing Kernel Hilbert Space for Noise Reduction. IEEE Trans. Signal Process. 61(3): 601-610 (2013) - [c75]Chien-Lin Huang, Chiori Hori:
Classification of children with voice impairments using deep neural networks. APSIPA 2013: 1-5 - [c74]Chien-Lin Huang, Shigeki Matsuda, Chiori Hori:
Feature normalization using MVAW processing for spoken language recognition. APSIPA 2013: 1-4 - [c73]Chien-Lin Huang, Chiori Hori, Hideki Kashioka, Bin Ma:
Speaker clustering using vector representation with long-term feature for lecture speech recognition. ICASSP 2013: 3532-3536 - [c72]Chien-Lin Huang, Chiori Hori, Hideki Kashioka, Bin Ma:
Joint analysis of vocal tract length and temporal information for robust speech recognition. ICASSP 2013: 7432-7436 - [c71]Chien-Lin Huang, Chiori Hori, Hideki Kashioka:
Semantic inference based on neural probabilistic language modeling for speech indexing. ICASSP 2013: 8480-8484 - [c70]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Speech enhancement based on deep denoising autoencoder. INTERSPEECH 2013: 436-440 - [c69]Jinfu Ni, Yoshinori Shiga, Chiori Hori, Yutaka Kidawara:
A targets-based superpositional model of fundamental frequency contours applied to HMM-based speech synthesis. INTERSPEECH 2013: 1052-1056 - [c68]Peter Bell, Hitoshi Yamamoto, Pawel Swietojanski, Youzheng Wu, Fergus McInnes, Chiori Hori, Steve Renals:
A lecture transcription system combining neural network acoustic and language models. INTERSPEECH 2013: 3087-3091 - [c67]Xugang Lu, Shigeki Matsuda, Chiori Hori:
Speech spectrum restoration based on conditional restricted boltzmann machine. INTERSPEECH 2013: 3259-3263 - [c66]Masahiro Saiko, Shigeki Matsuda, Ken Hanazawa, Ryosuke Isotani, Chiori Hori:
Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP. INTERSPEECH 2013: 3322-3326 - [c65]Chien-Lin Huang, Paul R. Dixon, Shigeki Matsuda, Youzheng Wu, Xugang Lu, Masahiro Saiko, Chiori Hori:
The NICT ASR system for IWSLT 2013. IWSLT (Evaluation Campaign) 2013 - [c64]Etsuo Mizukami, Teruhisa Misu, Chiori Hori:
WFST-Based Spoken Dialogue System on Smartphones - Its Development and Implementation for Field Use. MDM (2) 2013: 217-224 - [c63]Shigeki Matsuda, Xinhui Hu, Yoshinori Shiga, Hideki Kashioka, Chiori Hori, Keiji Yasuda, Hideo Okuma, Masao Uchiyama, Eiichiro Sumita, Hisashi Kawai, Satoshi Nakamura:
Multilingual Speech-to-Speech Translation System: VoiceTra. MDM (2) 2013: 229-233 - 2012
- [j12]Hansjörg Hofmann, Sakriani Sakti, Chiori Hori, Hideki Kashioka, Satoshi Nakamura, Wolfgang Minker:
Sequence-Based Pronunciation Variation Modeling for Spontaneous ASR Using a Noisy Channel Approach. IEICE Trans. Inf. Syst. 95-D(8): 2084-2093 (2012) - [j11]Sakriani Sakti, Michael Paul, Andrew M. Finch, Xinhui Hu, Jinfu Ni, Noriyuki Kimura, Shigeki Matsuda, Chiori Hori, Yutaka Ashikari, Hisashi Kawai, Hideki Kashioka, Eiichiro Sumita, Satoshi Nakamura:
Distributed speech translation technologies for multiparty multilingual communication. ACM Trans. Speech Lang. Process. 9(2): 4:1-4:27 (2012) - [c62]Youzheng Wu, Xugang Lu, Hitoshi Yamamoto, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Factored Language Model based on Recurrent Neural Network. COLING 2012: 2835-2850 - [c61]Paul R. Dixon, Chiori Hori, Hideki Kashioka:
A comparison of dynamic WFST decoding approaches. ICASSP 2012: 4209-4212 - [c60]Yu Tsao, Chien-Lin Huang, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
A linear projection approach to environment modeling for robust speech recognition. ICASSP 2012: 4329-4332 - [c59]Hitoshi Yamamoto, Paul R. Dixon, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Tied-State Mixture Language Model for WFST-based Speech Recognition. INTERSPEECH 2012: 174-177 - [c58]Youzheng Wu, Kazuhiko Abe, Paul R. Dixon, Chiori Hori, Hideki Kashioka:
Leveraging Social Annotation for Topic Language Model Adaptation. INTERSPEECH 2012: 190-193 - [c57]Paul R. Dixon, Chiori Hori, Hideki Kashioka:
A Specialized WFST Approach for Class Models and Dynamic Vocabulary. INTERSPEECH 2012: 1075-1078 - [c56]Xugang Lu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Speech restoration based on deep learning autoencoder with layer-wised pretraining. INTERSPEECH 2012: 1504-1507 - [c55]Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose, Chiori Hori, Hideki Kashioka, Paul R. Dixon:
Improving WFST-based G2P Conversion with Alignment Constraints and RNNLM N-best Rescoring. INTERSPEECH 2012: 2526-2529 - [c54]Chien-Lin Huang, Chiori Hori, Hideki Kashioka, Bin Ma:
Ensemble Classifiers Using Unsupervised Data Selection for Speaker Recognition. INTERSPEECH 2012: 2666-2669 - [c53]Xinhui Hu, Youzheng Wu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Collecting sentences from web resources for constructing spontaneous Chinese language model. ISCSLP 2012: 197-200 - [c52]Xugang Lu, Masashi Unoki, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Controlling the tradeoff property in a regularization framework for noise reduction. ISCSLP 2012: 201-205 - [c51]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Acoustic space partition based on broad phonetic class for ensemble acoustic modeling. ISCSLP 2012: 311-314 - [c50]Hitoshi Yamamoto, Youzheng Wu, Chien-Lin Huang, Xugang Lu, Paul R. Dixon, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
The NICT ASR system for IWSLT2012. IWSLT 2012: 34-37 - [c49]Youzheng Wu, Hitoshi Yamamoto, Xugang Lu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Factored recurrent neural network language model in TED lecture transcription. IWSLT 2012: 222-228 - 2011
- [j10]Teruhisa Misu, Komei Sugiura, Tatsuya Kawahara, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Modeling spoken decision support dialogue and optimization of its dialogue strategy. ACM Trans. Speech Lang. Process. 7(3): 10:1-10:18 (2011) - [c48]Teruhisa Misu, Komei Sugiura, Tatsuya Kawahara, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface. IWSDS 2011: 29-52 - [c47]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Dialogue Acts Annotation to Construct Dialogue Systems for Consulting. IWSDS 2011: 231-254 - [c46]Youzheng Wu, Chiori Hori, Hisashi Kawai, Hideki Kashioka:
Improving Related Entity Finding via Incorporating Homepages and Recognizing Fine-grained Entities. IJCNLP 2011: 174-182 - [c45]Youzheng Wu, Chiori Hori, Hisashi Kawai, Hideki Kashioka:
Answering Complex Questions via Exploiting Social Q&A Collection. IJCNLP 2011: 956-964 - [c44]Teruhisa Misu, Kiyonori Ohtake, Chiori Hori, Hisashi Kawai, Satoshi Nakamura:
User Study of Spoken Decision Support System. INTERSPEECH 2011: 797-800 - [c43]Yu Tsao, Paul R. Dixon, Chiori Hori, Hisashi Kawai:
Incorporating Regional Information to Enhance MAP-Based Stochastic Feature Compensation for Robust Speech Recognition. INTERSPEECH 2011: 2585-2588 - [c42]Sakriani Sakti, Andrew M. Finch, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Conditional Random Fields for Modeling Korean Pronunciation Variation. IWSDS 2011: 49-55 - [c41]Kazuhiko Abe, Youzheng Wu, Chien-Lin Huang, Paul R. Dixon, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
The NICT ASR system for IWSLT2011. IWSLT 2011: 28-33 - [c40]Paul R. Dixon, Andrew M. Finch, Chiori Hori, Hideki Kashioka:
Investigation on the effects of ASR tuning on speech translation performance. IWSLT 2011: 167-174 - 2010
- [c39]Teruhisa Misu, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Web text classification for response generation in spoken decision support dialogue systems. IUCS 2010: 131-134 - [c38]Naoto Kimura, Chiori Hori, Teruhisa Misu, Kiyonori Ohtake, Hisashi Kawai, Satoshi Nakamura:
Expansion of WFST-Based Dialog Management for Handling Multiple ASR Hypotheses. IWSDS 2010: 61-72 - [c37]Teruhisa Misu, Chiori Hori, Kiyonori Ohtake, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Construction and Experiment of a Spoken Consulting Dialogue System. IWSDS 2010: 169-175 - [c36]Teruhisa Misu, Chiori Hori, Kiyonori Ohtake, Etsuo Mizukami, Akihiro Kobayashi, Kentaro Kayama, Tetsuya Fujii, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Sightseeing Guidance Systems Based on WFST-Based Dialogue Manager. IWSDS 2010: 194-195 - [c35]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems. LREC 2010 - [c34]Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy. SIGDIAL Conference 2010: 221-224 - [c33]Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Hisashi Kawai, Satoshi Nakamura:
Dialogue strategy optimization to assist user's decision for spoken consulting dialogue systems. SLT 2010: 354-359 - [c32]Youzheng Wu, Chiori Hori, Hisashi Kawai:
NiCT at TREC 2010: Related Entity Finding. TREC 2010
2000 – 2009
- 2009
- [j9]Chiori Hori, Bing Zhao, Stephan Vogel, Alex Waibel, Hideki Kashioka, Satoshi Nakamura:
Consolidation-Based Speech Translation and Evaluation Approach. IEICE Trans. Inf. Syst. 92-D(3): 477-488 (2009) - [c31]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Annotating Dialogue Acts to Construct Dialogue Systems for Consulting. ALR7@IJCNLP 2009: 32-39 - [c30]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Weighted finite state transducer based statistical dialog management. ASRU 2009: 490-495 - [c29]Sakriani Sakti, Noriyuki Kimura, Michael Paul, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou Li:
The Asian network-based speech-to-speech translation system. ASRU 2009: 507-512 - [c28]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Statistical dialog management applied to WFST-based dialog systems. ICASSP 2009: 4793-4796 - [c27]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Recent advances in WFST-based dialog system. INTERSPEECH 2009: 268-271 - [c26]Teruhisa Misu, Kiyonori Ohtake, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Annotating communicative function and semantic content in dialogue act for construction of consulting dialogue systems. INTERSPEECH 2009: 1843-1846 - [c25]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Evaluation for WFST-based dialog management. IUCS 2009: 255-260 - [c24]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Dialogue act annotation for consulting dialogue corpus. IUCS 2009: 372-378 - [c23]Chiori Hori, Sakriani Sakti, Michael Paul, Noriyuki Kimura, Yutaka Ashikari, Ryosuke Isotani, Eiichiro Sumita, Satoshi Nakamura:
Network-based speech-to-speech translation. IWSLT 2009: 168 - 2008
- [c22]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
Dialog management using weighted finite-state transducers. INTERSPEECH 2008: 211-214 - [c21]Tatsuya Kawahara, Masayoshi Toyokura, Teruhisa Misu, Chiori Hori:
Detection of feeling through back-channels in spoken dialogue. INTERSPEECH 2008: 1696 - [c20]Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, Hideki Kashioka, Satoshi Nakamura:
A Statistical Approach to Expandable Spoken Dialog Systems using WFSTs. ISUC 2008: 24-27 - [c19]Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Dialogue Act Annotation for Statistically Managed Spoken Dialogue Systems. ISUC 2008: 416-422 - 2007
- [j8]Takaaki Hori, Chiori Hori, Yasuhiro Minami, Atsushi Nakamura:
Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition. IEEE Trans. Speech Audio Process. 15(4): 1352-1365 (2007) - [c18]Chiori Hori, Bing Zhao, Stephan Vogel, Alex Waibel:
Consolidation based speech translation. ASRU 2007: 380-385 - 2005
- [c17]Chiori Hori, Alex Waibel:
Spontaneous speech consolidation for spoken language applications. INTERSPEECH 2005: 617-620 - [c16]Matthias Eck, Chiori Hori:
Overview of the IWSLT 2005 evaluation campaign. IWSLT 2005: 1-22 - [c15]Sanjika Hewavitharana, Bing Zhao, Almut Silja Hildebrand, Matthias Eck, Chiori Hori, Stephan Vogel, Alex Waibel:
The CMU statistical machine translation system for IWSLT 2005. IWSLT 2005: 53-60 - [c14]Jesús Giménez, Enrique Amigó, Chiori Hori:
Machine translation evaluation inside QARLA. IWSLT 2005: 189-196 - 2004
- [j7]Chiori Hori, Sadaoki Furui:
Speech Summarization: An Approach through Word Extraction and a Method for Evaluation. IEICE Trans. Inf. Syst. 87-D(1): 15-25 (2004) - [j6]Sadaoki Furui, Tomonori Kikuchi, Yosuke Shinnaka, Chiori Hori:
Speech-to-text and speech-to-speech summarization of spontaneous speech. IEEE Trans. Speech Audio Process. 12(4): 401-408 (2004) - [c13]Takaaki Hori, Chiori Hori, Yasuhiro Minami:
Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition. INTERSPEECH 2004: 289-292 - 2003
- [j5]Chiori Hori, Sadaoki Furui, Robert G. Malkin, Hua Yu, Alex Waibel:
A Statistical Approach to Automatic Speech Summarization. EURASIP J. Adv. Signal Process. 2003(2): 128-139 (2003) - [j4]Chiori Hori, Sadaoki Furui:
A new approach to automatic speech summarization. IEEE Trans. Multim. 5(3): 368-378 (2003) - [c12]Chiori Hori, Takaaki Hori, Hajime Tsukada, Hideki Isozaki, Yutaka Sasaki, Eisaku Maeda:
Spoken Interactive ODQA System: SPIQA. ACL (Companion) 2003: 153-156 - [c11]Tomonori Kikuchi, Sadaoki Furui, Chiori Hori:
Automatic speech summarization based on sentence extraction and compaction. ICASSP (1) 2003: 384-387 - [c10]Chiori Hori, Takaaki Hori, Hideki Isozaki, Eisaku Maeda, Shigeru Katagiri, Sadaoki Furui:
Deriving disambiguous queries in a spoken interactive ODQA system. ICASSP (1) 2003: 624-627 - [c9]Takaaki Hori, Chiori Hori, Yasuhiro Minami:
Speech summarization using weighted finite-state transducers. INTERSPEECH 2003: 2817-2820 - [c8]Chiori Hori, Takaaki Hori, Sadaoki Furui:
Evaluation method for automatic speech summarization. INTERSPEECH 2003: 2825-2828 - 2002
- [j3]Akinori Ito, Chiori Hori, Masaharu Katoh, Masaki Kohda:
Erratum: Language modeling by stochastic dependency grammer for Japanese speech recognition. Syst. Comput. Jpn. 33(3): 74 (2002) - [j2]Chiori Hori, Masaharu Katoh, Akinori Ito, Masaki Kohda:
Construction and evaluation of language models based on stochastic context-free grammar for speech recognition Chiori Hori, Masaharu Katoh, Akinori Ito, Masaki Koh. Syst. Comput. Jpn. 33(13): 48-59 (2002) - [c7]Chiori Hori, Sadaoki Furui, Robert G. Malkin, Hua Yu, Alex Waibel:
Automatic speech summarization applied to English broadcast news speech. ICASSP 2002: 9-12 - 2001
- [j1]Akinori Ito, Chiori Hori, Masaharu Katoh, Masaki Kohda:
Language modeling by stochastic dependency grammar for Japanese speech recognition. Syst. Comput. Jpn. 32(12): 10-15 (2001) - [c6]Sadaoki Furui, Koji Iwano, Chiori Hori, Takahiro Shinozaki, Yohei Saito, Satoshi Tamura:
Ubiquitous speech processing. ICASSP 2001: 13-16 - [c5]Takahiro Shinozaki, Chiori Hori, Sadaoki Furui:
Towards automatic transcription of spontaneous presentations. INTERSPEECH 2001: 491-494 - [c4]Chiori Hori, Sadaoki Furui:
Advances in automatic speech summarization. INTERSPEECH 2001: 1771-1774 - 2000
- [c3]Chiori Hori, Sadaoki Furui:
Automatic speech summarization based on word significance and linguistic likelihood. ICASSP 2000: 1579-1582 - [c2]Akinori Ito, Chiori Hori, Masaharu Katoh, Masaki Kohda:
Language modeling by stochastic dependency grammar for Japanese speech recognition. INTERSPEECH 2000: 246-249 - [c1]Chiori Hori, Sadaoki Furui:
Improvements in automatic speech summarization and evaluation methods. INTERSPEECH 2000: 326-329
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-04 21:05 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint