default search action
Kazuhiro Nakadai
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j58]Kazuhiro Nakadai, Emilia I. Barakova, Ki-Uk Kyung:
Special issue on robot and human interactive communication. Adv. Robotics 38(19-20): 1349-1350 (2024) - [j57]Yui Sudo, Masayuki Takigahira, Hideo Tsuru, Kazuhiro Nakadai, Hirofumi Nakajima:
Online adaptation of fourier series-based acoustic transfer function model and its application to sound source localization and separation. Adv. Robotics 38(19-20): 1351-1363 (2024) - [j56]Jiang Wang, Yuanzheng He, Daobilige Su, Katsutoshi Itoyama, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, Youfu Li, He Kong:
SLAM-Based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization. IEEE Trans. Robotics 40: 4024-4044 (2024) - [c237]Zirui Lin, Katsutoshi Itoyama, Kazuhiro Nakadai, Hideharu Amano:
FPGA-based Low Power Acceleration of HARK Sound Source Localization. COOL CHIPS 2024: 1-6 - [c236]Haruto Yokota, Mert Bozkurtlar, Benjamin Yen, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
A Video Vision Transformer for Sound Source Localization. EUSIPCO 2024: 106-110 - [c235]Ragib Amin Nihal, Benjamin Yen, Katsutoshi Itoyama, Kazuhiro Nakadai:
UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster Scenarios. ICPR (14) 2024: 145-162 - [c234]Takahiro Osaki, Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Improving Noise Robustness of Automatic Speech Recognition Based on a Parallel Adapter Model with Near-Identity Initialization. IEA/AIE 2024: 454-466 - [c233]Shuhei Asaka, Katsutoshi Itoyama, Kazuhiro Nakadai:
Improving Impressions of Response Delay in AI-based Spoken Dialogue Systems. RO-MAN 2024: 1416-1421 - [c232]Mert Bozkurtlar, Benjamin Yen, Katsutoshi Itoyama, Kazuhiro Nakadai:
Real Time Sound Source Localization Using von-Mises ResNet. SII 2024: 466-471 - [i11]Ragib Amin Nihal, Benjamin Yen, Katsutoshi Itoyama, Kazuhiro Nakadai:
From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution. CoRR abs/2401.14661 (2024) - [i10]Jiang Wang, Yuanzheng He, Daobilige Su, Katsutoshi Itoyama, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, Youfu Li, He Kong:
SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization. CoRR abs/2405.19813 (2024) - [i9]Atsuo Hiroe, Katsutoshi Itoyama, Kazuhiro Nakadai:
Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance? CoRR abs/2407.15310 (2024) - [i8]Ragib Amin Nihal, Benjamin Yen, Katsutoshi Itoyama, Kazuhiro Nakadai:
UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster Scenarios. CoRR abs/2408.04922 (2024) - 2023
- [c231]Atsuo Hiroe, Katsutoshi Itoyama, Kazuhiro Nakadai:
Is the Ideal Ratio Mask Really the Best? - Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers. APSIPA ASC 2023: 1843-1850 - [c230]Ziquan Qin, Kaijie Wei, Hideharu Amano, Kazuhiro Nakadai:
Low power implementation of Geometric High-order Decorrelation-based Source Separation on an FPGA board. COOL CHIPS 2023: 1-6 - [c229]Yui Sudo, Kazuya Hata, Kazuhiro Nakadai:
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation. INTERSPEECH 2023: 491-495 - [c228]Haris Gulzar, Monikka Roslianna Busto, Takeharu Eda, Katsutoshi Itoyama, Kazuhiro Nakadai:
miniStreamer: Enhancing Small Conformer with Chunked-Context Masking for Streaming ASR Applications on the Edge. INTERSPEECH 2023: 3277-3281 - [c227]Takahiro Aizawa, Yoshiaki Bando, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai, Masaki Onishi:
Unsupervised Domain Adaptation of Universal Source Separation Based on Neural Full-Rank Spatial Covariance Analysis. MLSP 2023: 1-6 - [c226]Tan Sihan, Khan Nabeela Khanum, Katsutoshi Itoyama, Kazuhiro Nakadai:
Improving Sign Language Understanding Introducing Label Smoothing. RO-MAN 2023: 113-118 - [c225]Yui Sudo, Masayuki Takigahira, Hideo Tsuru, Kazuhiro Nakadai, Hirofumi Nakajima:
Online Adaptation of Fourier Series Based Acoustic Transfer Function Model to Improve Sound Source Localization and Separation. RO-MAN 2023: 2058-2063 - [c224]Masahiko Fujita, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
An Ensemble Method for Multiple Speech Enhancement Using Deep Learning. SII 2023: 1-6 - [c223]Haris Gulzar, Muhammad Shakeel, Katsutoshi Itoyama, Kazuhiro Nakadai, Kenji Nishida, Hideharu Amano, Takeharu Eda:
FPGA based Power-Efficient Edge Server to Accelerate Speech Interface for Socially Assistive Robotics. SII 2023: 1-6 - [c222]Yuanzheng He, Jiang Wang, Daobilige Su, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, Youfu Li, He Kong:
Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization. SII 2023: 1-8 - [c221]Hidehiko Kishinami, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Reconstruction of Depth Scenes Based on Echolocation. SII 2023: 1-6 - [c220]Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Metric-Based Multimodal Meta-Learning for Human Movement Identification Via Footstep Recognition. SII 2023: 1-8 - [c219]Chishio Sugiyama, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Assessment of Simultaneous Calibration for Positions, Orientations, and Time Offsets in Multiple Microphone Arrays Systems. SII 2023: 1-6 - [c218]Kei Suzuki, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Audio-Visual Class Association Based on Two-stage Self-supervised Contrastive Learning towards Robust Scene Analysis. SII 2023: 1-6 - [c217]Reiji Suzuki, Shinji Sumitani, Zachary Harlow, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi G. Okuno:
Extracting Bird Vocalizations from a Complex Natural Soundscape in Forests Using Robot Audition Techniques. SII 2023: 1-6 - [i7]Yui Sudo, Kazuya Hata, Kazuhiro Nakadai:
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation. CoRR abs/2305.17846 (2023) - [i6]Atsuo Hiroe, Katsutoshi Itoyama, Kazuhiro Nakadai:
Is the Ideal Ratio Mask Really the Best? - Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers. CoRR abs/2309.12065 (2023) - 2022
- [j55]Shiho Matsubayashi, Kazuhiro Nakadai, Reiji Suzuki, Tatsuya Ura, Makoto Hasebe, Hiroshi G. Okuno:
Auditory Survey of Endangered Eurasian Bittern Using Microphone Arrays and Robot Audition. Frontiers Robotics AI 9: 854572 (2022) - [c216]Zhongyang Hou, Kaijie Wei, Hideharu Amano, Kazuhiro Nakadai:
An FPGA off-loading of HARK sound source localization. CANDARW 2022: 236-240 - [c215]Ryu Takeda, Yui Sudo, Kazuhiro Nakadai, Kazunori Komatani:
Empirical Sampling from Latent Utterance-wise Evidence Model for Missing Data ASR based on Neural Encoder-Decoder Model. INTERSPEECH 2022: 3789-3793 - [c214]Yoshiaki Bando, Takahiro Aizawa, Katsutoshi Itoyama, Kazuhiro Nakadai:
Weakly-Supervised Neural Full-Rank Spatial Covariance Analysis for a Front-End System of Distant Speech Recognition. INTERSPEECH 2022: 3824-3828 - [c213]Yui Sudo, Muhammad Shakeel, Kazuhiro Nakadai, Jiatong Shi, Shinji Watanabe:
Streaming Automatic Speech Recognition with Re-blocking Processing Based on Integrated Voice Activity Detection. INTERSPEECH 2022: 4641-4645 - [c212]Yasuhiro Kagimoto, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Spotforming by NMF Using Multiple Microphone Arrays. IROS 2022: 9253-9258 - [c211]Taiki Yamada, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Outdoor evaluation of sound source localization for drone groups using microphone arrays. IROS 2022: 9296-9301 - [i5]Yuanzheng He, Jiang Wang, Daobilige Su, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, Youfu Li, He Kong:
Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization. CoRR abs/2210.05600 (2022) - 2021
- [j54]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Multichannel environmental sound segmentation. Appl. Intell. 51(11): 8245-8259 (2021) - [j53]Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Detecting earthquakes: a novel deep learning-based approach for effective disaster response. Appl. Intell. 51(11): 8305-8315 (2021) - [c210]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani:
Spatial Normalization to Reduce Positional Complexity in Direction-aided Supervised Binaural Sound Source Separation. APSIPA ASC 2021: 248-253 - [c209]Katsutoshi Itoyama, Yoshiya Morimoto, Shungo Masaki, Ryosuke Kojima, Kenji Nishida, Kazuhiro Nakadai:
Assessment of von Mises-Bernoulli Deep Neural Network in Sound Source Localization. Interspeech 2021: 2152-2156 - [c208]Kazuhiro Nakadai, Masayuki Takigahira, Yusuke Kawai, Hirofumi Nakajima:
Fully-Online Always-Adaptation of Transfer Functions and Its Application to Sound Source Localization and Separation. IROS 2021: 2100-2105 - [c207]Zhi Zhong, Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Assessment of a Beamforming Implementation Developed for Surface Sound Source Separation. SII 2021: 369-374 - [c206]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Multi-channel Environmental Sound Segmentation utilizing Sound Source Localization and Separation U-Net. SII 2021: 382-387 - [c205]Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
EMC: Earthquake Magnitudes Classification on Seismic Signals via Convolutional Recurrent Networks. SII 2021: 388-393 - [c204]Taiki Yamada, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Sound Source Tracking Using Integrated Direction Likelihood for Drones with Microphone Arrays. SII 2021: 394-399 - [c203]Reiji Suzuki, Hao Zhao, Shinji Sumitani, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi G. Okuno:
Visualizing Directional Soundscapes of Bird Vocalizations Using Robot Audition Techniques. SII 2021: 487-492 - [c202]Shiho Matsubayashi, Fumiyuki Saito, Reiji Suzuki, Kazuhiro Nakadai, Hiroshi G. Okuno:
Observing Nocturnal Birds Using Localization Techniques. SII 2021: 493-498 - [c201]Kazuhiro Nakadai, Yosuke Fukumoto, Ryu Takeda:
Investigation of Node Pruning Criteria for Neural Networks Model Compression with Non-Linear Function and Non-Uniform Network Topology. SLT 2021: 117-124 - [i4]Muhammad Shakeel, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Metric-based multimodal meta-learning for human movement identification via footstep recognition. CoRR abs/2111.07979 (2021) - 2020
- [j52]Kazuhiro Nakadai, Hiroshi G. Okuno:
Robot Audition and Computational Auditory Scene Analysis. Adv. Intell. Syst. 2(9): 2000050 (2020) - [j51]Toshinori Kagawa, Fumie Ono, Lin Shan, Ryu Miura, Kazuhiro Nakadai, Kotaro Hoshiba, Makoto Kumon, Hiroshi G. Okuno, Shin Kato, Fumihide Kojima:
Multi-hop wireless command and telemetry communication system for remote operation of robots with extending operation area beyond line-of-sight using 920 MHz/169 MHz. Adv. Robotics 34(11): 756-766 (2020) - [j50]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Sound event aware environmental sound segmentation with Mask U-Net. Adv. Robotics 34(20): 1280-1290 (2020) - [j49]Ryosuke Hasumoto, Kazuhiro Nakadai, Michita Imai:
Reactive Chameleon: A Method to Mimic Conversation Partner's Body Sway for a Robot. Int. J. Soc. Robotics 12(1): 239-258 (2020) - [j48]Heike Brock, Iva Farag, Kazuhiro Nakadai:
Recognition of Non-Manual Content in Continuous Japanese Sign Language. Sensors 20(19): 5621 (2020) - [j47]Heike Brock, Felix Law, Kazuhiro Nakadai, Yuji Nagashima:
Learning Three-dimensional Skeleton Data from Sign Language Video. ACM Trans. Intell. Syst. Technol. 11(3): 30:1-30:24 (2020) - [c200]Toru Yamashita, Futoshi Asano, Kazuhiro Nakadai:
Age Classification of Evacuees at Times of Disaster Using a Vibration Sensor. APSIPA 2020: 184-188 - [c199]Naoki Yamamoto, Kenji Nishida, Katsutoshi Itoyama, Kazuhiro Nakadai:
Detection of Ball Spin Direction using Hitting Sound in Tennis. icSPORTS 2020: 30-37 - [c198]Katsuhiro Dan, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Calibration of a Microphone Array Based on a Probabilistic Model of Microphone Positions. IEA/AIE 2020: 614-625 - [c197]Katsutoshi Itoyama, Kazuhiro Nakadai:
Synchronization of Microphones Based on Rank Minimization of Warped Spectrum for Asynchronous Distributed Recording. IROS 2020: 4842-4847 - [c196]Shinji Sumitani, Reiji Suzuki, Takemi Morimatsu, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi G. Okuno:
Soundscape Analysis of Bird Songs in Forests Using Microphone Arrays. SII 2020: 634-639 - [c195]Kazuhiro Nakadai, Shungo Masaki, Ryosuke Kojima, Osamu Sugiyama, Katsutoshi Itoyama, Kenji Nishida:
Sound Source Localization Based on von-Mises-Bernoulli Deep Neural Network. SII 2020: 658-663 - [c194]Yoshiaki Asahara, Kohich Matsuda, Hirofumi Nakajima, Kazuhiro Nakadai:
A Fourier series based Data compression model for Acoustic transfer function. SII 2020: 664-668 - [c193]Taiki Yamada, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Sound Source Tracking by Drones with Microphone Arrays. SII 2020: 796-801 - [c192]Takashi Konno, Kenji Nishida, Katsutoshi Itoyama, Kazuhiro Nakadai:
Audio-Visual 3D Reconstruction Framework for Dynamic Scenes. SII 2020: 802-807 - [c191]Zhi Zhong, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Design and Assessment of a Scan-and-sum Beamformer for Surface Sound Source Separation. SII 2020: 808-813 - [c190]Mizuho Wakabayashi, Kai Washizaki, Kotaro Hoshiba, Kazuhiro Nakadai, Hiroshi G. Okuno, Makoto Kumon:
Design and Implementation of Real-Time Visualization of Sound Source Positions by Drone Audition. SII 2020: 814-819 - [c189]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Multi-channel Environmental sound segmentation. SII 2020: 820-825
2010 – 2019
- 2019
- [j46]Kazuhiro Nakadai, Emilia I. Barakova, Michita Imai, Tetsunari Inamura:
Special issue on robot and human interactive communication. Adv. Robotics 33(7-8): 307-308 (2019) - [j45]Daniel Gabriel, Ryosuke Kojima, Kotaro Hoshiba, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
2D sound source position estimation using microphone arrays and its application to a VR-based bird song analysis system. Adv. Robotics 33(7-8): 403-414 (2019) - [j44]Kazuhiro Nakadai, Emilia I. Barakova, Michita Imai, Tetsunari Inamura:
Special issue on robot and human interactive communication. Adv. Robotics 33(15-16): 699 (2019) - [c188]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Improvement of DOA Estimation by using Quaternion Output in Sound Event Localization and Detection. DCASE 2019: 244-247 - [c187]Nelson Yalta, Shinji Watanabe, Takaaki Hori, Kazuhiro Nakadai, Tetsuya Ogata:
CNN-based Multichannel End-to-End Speech Recognition for Everyday Home Environments*. EUSIPCO 2019: 1-5 - [c186]Zhaofeng Zhang, Kazuhiro Nakadai, Hirofumi Nakajima, Naoaki Sumida:
Acoustic Simulation in Dynamic Environments for Robot Audition. EUSIPCO 2019: 1-5 - [c185]Shinji Sumitani, Reiji Suzuki, Naoaki Chiba, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi Gitchang Okuno:
An Integrated Framework for Field Recording, Localization, Classification and Annotation of Birdsongs Using Robot Audition Techniques - Harkbird 2.0. ICASSP 2019: 8246-8250 - [c184]Nelson Yalta, Shinji Watanabe, Kazuhiro Nakadai, Tetsuya Ogata:
Weakly-Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation. IJCNN 2019: 1-8 - [c183]Yui Sudo, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Environmental sound segmentation utilizing Mask U-Net. IROS 2019: 5340-5345 - [c182]Daniel Gabriel, Ryosuke Kojima, Kotaro Hoshiba, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Design and assessment of multiple-sound source localization using microphone arrays. SII 2019: 199-204 - [c181]Makoto Kumon, Kai Washizaki, Kazuhiro Nakadai:
Close Sound Source Localization incorporating Semi-Supervised Variational Bayesian NMF. SII 2019: 313-318 - [p1]Kenzo Nonami, Kotaro Hoshiba, Kazuhiro Nakadai, Makoto Kumon, Hiroshi G. Okuno, Yasutada Tanabe, Koichi Yonezawa, Hiroshi Tokutake, Satoshi Suzuki, Kohei Yamaguchi, Shigeru Sunada, Takeshi Takaki, Toshiyuki Nakata, Ryusuke Noda, Hao Liu, Satoshi Tadokoro:
Recent R&D Technologies and Future Prospective of Flying Robot in Tough Robotics Challenge. Disaster Robotics 2019: 77-142 - 2018
- [j43]Kotaro Hoshiba, Kazuhiro Nakadai, Makoto Kumon, Hiroshi G. Okuno:
Assessment of MUSIC-Based Noise-Robust Sound Source Localization with Active Frequency Range Filtering. J. Robotics Mechatronics 30(3): 426-435 (2018) - [j42]Yoshiaki Bando, Katsutoshi Itoyama, Masashi Konyo, Satoshi Tadokoro, Kazuhiro Nakadai, Kazuyoshi Yoshii, Tatsuya Kawahara, Hiroshi G. Okuno:
Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 215-230 (2018) - [c180]Shinji Sumitani, Reiji Suzuki, Shiho Matsubayashi, Takaya Arita, Kazuhiro Nakadai, Hiroshi G. Okuno:
Extracting the Relationship between the Spatial Distribution and Types of Bird Vocalizations Using Robot Audition System HARK. IROS 2018: 2485-2490 - [c179]Ryosuke Kojima, Osamu Sugiyama, Kotaro Hoshiba, Reiji Suzuki, Kazuhiro Nakadai:
HARK-Bird-Box: A Portable Real-time Bird Song Scene Analysis System. IROS 2018: 2497-2502 - [c178]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani:
Multi-timescale Feature-extraction Architecture of Deep Neural Networks for Acoustic Model Training from Raw Speech Signal. IROS 2018: 2503-2510 - [c177]Heike Brock, Shigeaki Nishina, Kazuhiro Nakadai:
To animate or anime-te?: Investigating sign avatar comprehensibility. IVA 2018: 331-332 - [c176]Heike Brock, Kazuhiro Nakadai:
Deep JSLC: A Multimodal Corpus Collection for Data-driven Generation of Japanese Sign Language Expressions. LREC 2018 - [c175]Agathe Balayn, Heike Brock, Kazuhiro Nakadai:
Data-driven development of Virtual Sign Language Communication Agents. RO-MAN 2018: 370-377 - [c174]Ryosuke Taniguchi, Kotaro Hoshiba, Katsutoshi Itoyama, Kenji Nishida, Kazuhiro Nakadai:
Signal Restoration based on Bi-directional LSTM with Spectral Filtering for Robot Audition. RO-MAN 2018: 955-960 - [i3]Nelson Yalta, Shinji Watanabe, Kazuhiro Nakadai, Tetsuya Ogata:
Weakly Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation. CoRR abs/1807.01126 (2018) - [i2]Nelson Yalta, Shinji Watanabe, Takaaki Hori, Kazuhiro Nakadai, Tetsuya Ogata:
CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments. CoRR abs/1811.02735 (2018) - 2017
- [j41]Lana Sinapayen, Keisuke Nakamura, Kazuhiro Nakadai, Hiroki Takahashi, Tetsuo Kinoshita:
Swarm of micro-quadrocopters for consensus-based sound source localization. Adv. Robotics 31(12): 624-633 (2017) - [j40]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani:
Acoustic model training based on node-wise weight boundary model for fast and small-footprint deep neural networks. Comput. Speech Lang. 46: 461-480 (2017) - [j39]Hiroshi G. Okuno, Kazuhiro Nakadai:
Editorial: Robot Audition Technologies. J. Robotics Mechatronics 29(1): 15 (2017) - [j38]Kazuhiro Nakadai, Hiroshi G. Okuno, Takeshi Mizumoto:
Development, Deployment and Applications of Robot Audition Open Source Software HARK. J. Robotics Mechatronics 29(1): 16-25 (2017) - [j37]Nelson Yalta, Kazuhiro Nakadai, Tetsuya Ogata:
Sound Source Localization Using Deep Learning Models. J. Robotics Mechatronics 29(1): 37-48 (2017) - [j36]Kazuhiro Nakadai, Tomoaki Koiwa:
Psychologically-Inspired Audio-Visual Speech Recognition Using Coarse Speech Recognition and Missing Feature Theory. J. Robotics Mechatronics 29(1): 105-113 (2017) - [j35]Kazuhiro Nakadai, Taiki Tezuka, Takami Yoshida:
Ego-Noise Suppression for Robots Based on Semi-Blind Infinite Non-Negative Matrix Factorization. J. Robotics Mechatronics 29(1): 114-124 (2017) - [j34]Kotaro Hoshiba, Osamu Sugiyama, Akihide Nagamine, Ryosuke Kojima, Makoto Kumon, Kazuhiro Nakadai:
Design and Assessment of Sound Source Localization System with a UAV-Embedded Microphone Array. J. Robotics Mechatronics 29(1): 154-167 (2017) - [j33]Takuma Ohata, Keisuke Nakamura, Akihide Nagamine, Takeshi Mizumoto, Takayuki Ishizaki, Ryosuke Kojima, Osamu Sugiyama, Kazuhiro Nakadai:
Outdoor Sound Source Detection Using a Quadcopter with Microphone Array. J. Robotics Mechatronics 29(1): 177-187 (2017) - [j32]Osamu Sugiyama, Satoshi Uemura, Akihide Nagamine, Ryosuke Kojima, Keisuke Nakamura, Kazuhiro Nakadai:
Outdoor Acoustic Event Identification with DNN Using a Quadrotor-Embedded Microphone Array. J. Robotics Mechatronics 29(1): 188-197 (2017) - [j31]Reiji Suzuki, Shiho Matsubayashi, Richard W. Hedley, Kazuhiro Nakadai, Hiroshi G. Okuno:
HARKBird: Exploring Acoustic Interactions in Bird Communities Using a Microphone Array. J. Robotics Mechatronics 29(1): 213-223 (2017) - [j30]Shiho Matsubayashi, Reiji Suzuki, Fumiyuki Saito, Tatsuyoshi Murate, Tomohisa Masuda, Koichi Yamamoto, Ryosuke Kojima, Kazuhiro Nakadai, Hiroshi G. Okuno:
Acoustic Monitoring of the Great Reed Warbler Using Multiple Microphone Arrays and Robot Audition. J. Robotics Mechatronics 29(1): 224-235 (2017) - [j29]Ryosuke Kojima, Osamu Sugiyama, Kotaro Hoshiba, Kazuhiro Nakadai, Reiji Suzuki, Charles E. Taylor:
Bird Song Scene Analysis Using a Spatial-Cue-Based Probabilistic Model. J. Robotics Mechatronics 29(1): 236-246 (2017) - [j28]Kotaro Hoshiba, Kai Washizaki, Mizuho Wakabayashi, Takahiro Ishiki, Makoto Kumon, Yoshiaki Bando, Daniel Gabriel, Kazuhiro Nakadai, Hiroshi G. Okuno:
Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments. Sensors 17(11): 2535 (2017) - [c173]Ryosuke Kojima, Osamu Sugiyama, Kotaro Hoshiba, Reiji Suzuki, Kazuhiro Nakadai:
A Spatial-Cue-Based Probabilistic Model for Bird Song Scene Analysis. DSAA 2017: 395-404 - [c172]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani:
Node Pruning Based on Entropy of Weights and Node Activity for Small-Footprint Acoustic Model Based on Deep Neural Networks. INTERSPEECH 2017: 1636-1640 - [c171]Kazuhiro Nakadai, Makoto Kumon, Hiroshi G. Okuno, Kotaro Hoshiba, Mizuho Wakabayashi, Kai Washizaki, Takahiro Ishiki, Daniel Gabriel, Yoshiaki Bando, Takayuki Morito, Ryosuke Kojima, Osamu Sugiyama:
Development of microphone-array-embedded UAV for search and rescue task. IROS 2017: 5985-5990 - 2016
- [j27]Ryosuke Kojima, Osamu Sugiyama, Kazuhiro Nakadai:
Multimodal Scene Understanding Framework and Its Application to Cooking Recognition. Appl. Artif. Intell. 30(3): 181-200 (2016) - [c170]Cosmin Munteanu, Pourang Irani, Sharon L. Oviatt, Matthew P. Aylett, Gerald Penn, Shimei Pan, Nikhil Sharma, Frank Rudzicz, Randy Gomez, Keisuke Nakamura, Kazuhiro Nakadai:
Designing Speech and Multimodal Interactions for Mobile, Wearable, and Pervasive Applications. CHI Extended Abstracts 2016: 3612-3619 - [c169]Yoshiaki Bando, Katsutoshi Itoyama, Masashi Konyo, Satoshi Tadokoro, Kazuhiro Nakadai, Kazuyoshi Yoshii, Hiroshi G. Okuno:
Variational Bayesian multi-channel robust NMF for human-voice enhancement with a deformable and partially-occluded microphone array. EUSIPCO 2016: 1018-1022 - [c168]Takayuki Morito, Osamu Sugiyama, Satoshi Uemura, Ryosuke Kojima, Kazuhiro Nakadai:
Reduction of Computational Cost Using Two-Stage Deep Neural Network for Training for Denoising and Sound Source Identification. IEA/AIE 2016: 562-573 - [c167]Reiji Suzuki, Shiho Matsubayashi, Kazuhiro Nakadai, Hiroshi G. Okuno:
Localizing Bird Songs Using an Open Source Robot Audition System with a Microphone Array. INTERSPEECH 2016: 2626-2630 - [c166]Ryosuke Kojima, Osamu Sugiyama, Reiji Suzuki, Kazuhiro Nakadai, Charles E. Taylor:
Semi-automatic bird song analysis by spatial-cue-based integration of sound source detection, localization, separation, and identification. IROS 2016: 1287-1292 - [c165]Takayuki Morito, Osamu Sugiyama, Ryosuke Kojima, Kazuhiro Nakadai:
Partially Shared Deep Neural Network in sound source separation and identification using a UAV-embedded microphone array. IROS 2016: 1299-1304 - [c164]Kouhei Sekiguchi, Yoshiaki Bando, Keisuke Nakamura, Kazuhiro Nakadai, Katsutoshi Itoyama, Kazuyoshi Yoshii:
Online simultaneous localization and mapping of multiple sound sources and asynchronous microphone arrays. IROS 2016: 1973-1979 - [c163]Daobilige Su, Keisuke Nakamura, Kazuhiro Nakadai, Jaime Valls Miró:
Robust sound source mapping using three-layered selective audio rays for mobile robots. IROS 2016: 2771-2777 - [c162]Nurul Lubis, Randy Gomez, Sakriani Sakti, Keisuke Nakamura, Koichiro Yoshino, Satoshi Nakamura, Kazuhiro Nakadai:
Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition. LREC 2016 - [c161]Randy Gomez, Yurii Vasylkiv, Keisuke Nakamura, Takeshi Mizumoto, Kazuhiro Nakadai:
Leveraging phantom signals for improved voice-based human-robot interaction. RO-MAN 2016: 30-35 - [i1]Jean-Marc Valin, Shun'ichi Yamamoto, Jean Rouat, François Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno:
Robust Recognition of Simultaneous Speech By a Mobile Robot. CoRR abs/1602.06442 (2016) - 2015
- [j26]Ui-Hyun Kim, Kazuhiro Nakadai, Hiroshi G. Okuno:
Improved sound source localization in horizontal plane for binaural robot audition. Appl. Intell. 42(1): 63-74 (2015) - [j25]Kuniaki Noda, Yuki Yamaguchi, Kazuhiro Nakadai, Hiroshi G. Okuno, Tetsuya Ogata:
Audio-visual speech recognition using deep learning. Appl. Intell. 42(4): 722-737 (2015) - [j24]Yoshiaki Bando, Takuma Otsuka, Takeshi Mizumoto, Katsutoshi Itoyama, Masashi Konyo, Satoshi Tadokoro, Kazuhiro Nakadai, Hiroshi G. Okuno:
Posture estimation of hose-shaped robot by using active microphone array. Adv. Robotics 29(1): 35-49 (2015) - [j23]Kenta Yonekura, Chyon Hae Kim, Kazuhiro Nakadai, Hiroshi Tsujino, Kazuhito Yokoi:
Prevention of accomplishing synchronous multi-modal human-robot cooperation by using visual rhythms. Adv. Robotics 29(14): 901-912 (2015) - [j22]João Lobato Oliveira, Gökhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Fabien Gouyon, Luís Paulo Reis:
Beat Tracking for Interactive Dancing Robots. Int. J. Humanoid Robotics 12(4): 1550023:1-1550023:24 (2015) - [c160]Ryu Takeda, Kazunori Komatani, Kazuhiro Nakadai:
Acoustic model training based on node-wise weight boundary model increasing speed of discrete neural networks. ASRU 2015: 52-58 - [c159]Kuniaki Noda, Naoya Hashimoto, Kazuhiro Nakadai, Tetsuya Ogata:
Sound source separation for robot audition using deep learning. Humanoids 2015: 389-394 - [c158]Osamu Sugiyama, Ryosuke Kojima, Kazuhiro Nakadai:
Interactive interface to optimize sound source localization based on microphone array with coarse-to-fine tuning for humanoids. Humanoids 2015: 825-830 - [c157]Randy Gomez, Keisuke Nakamura, Takeshi Mizumoto, Kazuhiro Nakadai:
Compensating changes in speaker position for improved voice-based human-robot communication. Humanoids 2015: 977-982 - [c156]Hiroshi G. Okuno, Kazuhiro Nakadai:
Robot audition: Its rise and perspectives. ICASSP 2015: 5610-5614 - [c155]Randy Gomez, Keisuke Nakamura, Takeshi Mizumoto, Kazuhiro Nakadai:
Temporal smearing compensation in reverberant environment for speech-based human-robot interaction. ICRA 2015: 3347-3353 - [c154]Keisuke Nakamura, Surya Ambrose, Kazuhiro Nakadai:
On-the-spot calibration of microphone array Transfer Functions for robot audition. ICRA 2015: 3354-3359 - [c153]Osamu Sugiyama, Ryosuke Kojima, Kazuhiro Nakadai:
Interactive Interface to Optimize Sound Source Localization with HARK. IEA/AIE 2015: 262-271 - [c152]Ryosuke Kojima, Osamu Sugiyama, Kazuhiro Nakadai:
Scene Understanding Based on Sound and Text Information for a Cooking Support Robot. IEA/AIE 2015: 665-674 - [c151]Randy Gomez, Levko Ivanchuk, Keisuke Nakamura, Takeshi Mizumoto, Kazuhiro Nakadai:
Dereverberation for active human-robot communication robust to speaker's face orientation. INTERSPEECH 2015: 180-184 - [c150]Ryosuke Kojima, Osamu Sugiyama, Kazuhiro Nakadai:
Audio-visual scene understanding utilizing text information for a cooking support robot. IROS 2015: 4210-4215 - [c149]Randy Gomez, Levko Ivanchuk, Keisuke Nakamura, Takeshi Mizumoto, Kazuhiro Nakadai:
Utilizing visual cues in robot audition for sound source discrimination in speech-based human-robot communication. IROS 2015: 4216-4222 - [c148]Keisuke Nakamura, Kazuhiro Nakadai:
Robot audition based Acoustic Event Identification using a Bayesian model considering spectral and temporal uncertainties. IROS 2015: 4840-4845 - [c147]Yoshiaki Bando, Katsutoshi Itoyama, Masashi Konyo, Satoshi Tadokoro, Kazuhiro Nakadai, Kazuyoshi Yoshii, Hiroshi G. Okuno:
Microphone-accelerometer based 3D posture estimation for a hose-shaped rescue robot. IROS 2015: 5580-5586 - [c146]Kazuhiro Nakadai, Takeshi Mizumoto, Keisuke Nakamura:
Robot-Audition-based Human-Machine Interface for a Car. IROS 2015: 6129-6136 - [c145]Keisuke Nakamura, Lana Sinapayen, Kazuhiro Nakadai:
Interactive sound source localization using robot audition for tablet devices. IROS 2015: 6137-6142 - [c144]Masaaki Takahashi, Masa Ogata, Michita Imai, Keisuke Nakamura, Kazuhiro Nakadai:
A case study of an automatic volume control interface for a telepresence system. RO-MAN 2015: 517-522 - [c143]Yoshiaki Bando, Katsutoshi Itoyama, Masashi Konyo, Satoshi Tadokoro, Kazuhiro Nakadai, Kazuyoshi Yoshii, Hiroshi G. Okuno:
Human-voice enhancement based on online RPCA for a hose-shaped rescue robot with a microphone array. SSRR 2015: 1-6 - 2014
- [j21]Hirofumi Nakajima, Keiko Kikuchi, Kazuhiro Nakadai, Yutaka Kaneda:
Sound Source Orientation Estimation Based on an Orientation-Extended Beamformer. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 97-A(9): 1875-1883 (2014) - [c142]Akira Hayamizu, Michita Imai, Keisuke Nakamura, Kazuhiro Nakadai:
Volume adaptation and visualization by modeling the volume level in noisy environments for telepresence system. HAI 2014: 67-74 - [c141]Randy Gomez, Keisuke Nakamura, Takeshi Mizumoto, Kazuhiro Nakadai:
Improved hands-free automatic speech recognition in reverberant environment condition. HSCMA 2014: 67-71 - [c140]Taiki Tezuka, Takami Yoshida, Kazuhiro Nakadai:
Ego-motion noise suppression for robots based on Semi-Blind Infinite Non-negative Matrix Factorization. ICRA 2014: 6293-6298 - [c139]Kuniaki Noda, Yuki Yamaguchi, Kazuhiro Nakadai, Hiroshi G. Okuno, Tetsuya Ogata:
Lipreading using convolutional neural network. INTERSPEECH 2014: 1149-1153 - [c138]Randy Gomez, Koji Inoue, Keisuke Nakamura, Takeshi Mizumoto, Kazuhiro Nakadai:
Speech-based human-robot interaction robust to acoustic reflections in real environment. IROS 2014: 1367-1373 - [c137]João Lobato Oliveira, Keisuke Nakamura, Thibault Langlois, Fabien Gouyon, Kazuhiro Nakadai, Angelica Lim, Luís Paulo Reis, Hiroshi G. Okuno:
Making a robot dance to diverse musical genre in noisy environments. IROS 2014: 1896-1901 - [c136]Takuma Ohata, Keisuke Nakamura, Takeshi Mizumoto, Taiki Tezuka, Kazuhiro Nakadai:
Improvement in outdoor sound source detection using a quadrotor-embedded microphone array. IROS 2014: 1902-1907 - [c135]Osamu Sugiyama, Katsutoshi Itoyama, Kazuhiro Nakadai, Hiroshi G. Okuno:
Sound annotation tool for multidirectional sounds based on spatial information extracted by HARK robot audition software. SMC 2014: 2335-2340 - [c134]Gautam Narang, Keisuke Nakamura, Kazuhiro Nakadai:
Auditory-aware navigation for mobile robots based on reflection-robust sound source localization and visual SLAM. SMC 2014: 4021-4026 - [c133]Yoshiaki Bando, Katsutoshi Itoyama, Masashi Konyo, Satoshi Tadokoro, Kazuhiro Nakadai, Kazuyoshi Yoshii, Hiroshi G. Okuno:
A sound-based online method for estimating the time-varying posture of a hose-shaped robot. SSRR 2014: 1-6 - 2013
- [j20]Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno:
A real-time super-resolution robot audition system that improves the robustness of simultaneous speech recognition. Adv. Robotics 27(12): 933-945 (2013) - [j19]Futoshi Asano, Hideki Asoh, Kazuhiro Nakadai:
Sound Source Localization Using Joint Bayesian Estimation With a Hierarchical Noise Model. IEEE Trans. Speech Audio Process. 21(9): 1953-1965 (2013) - [c132]Martin Heckmann, Keisuke Nakamura, Kazuhiro Nakadai:
Differences in the audio-visual detection of word prominence from Japanese and English speakers. AVSP 2013: 209-214 - [c131]Randy Gomez, Keisuke Nakamura, Takeshi Mizumoto, Kazuhiro Nakadai:
Mitigating the effects of reverberation for effective human-robot interaction in the real world. Humanoids 2013: 177-182 - [c130]Randy Gomez, Keisuke Nakamura, Kazuhiro Nakadai:
Robustness to speaker position in distant-talking automatic speech recognition. ICASSP 2013: 7034-7038 - [c129]Mihoko Otake, Myagmarbayar Nergui, Seong-eun Moon, Kentaro Takagi, Tsutomu Kamashima, Kazuhiro Nakadai:
Development of a Sound Source Localization System for Assisting Group Conversation. ICIRA (1) 2013: 532-539 - [c128]Randy Gomez, Keisuke Nakamura, Kazuhiro Nakadai, Ui-Hyun Kim, Hiroshi G. Okuno, Tatsuya Kawahara:
Hands-free human-robot communication robust to speaker's radial position. ICRA 2013: 4329-4334 - [c127]Ui-Hyun Kim, Kazuhiro Nakadai, Hiroshi G. Okuno:
Improved Sound Source Localization and Front-Back Disambiguation for Humanoid Robots with Two Ears. IEA/AIE 2013: 282-291 - [c126]Randy Gomez, Keisuke Nakamura, Kazuhiro Nakadai:
Dereverberation robust to speaker's azimuthal orientation in multi-channel human-robot communication. IROS 2013: 3439-3444 - [c125]Yoshiaki Bando, Takeshi Mizumoto, Katsutoshi Itoyama, Kazuhiro Nakadai, Hiroshi G. Okuno:
Posture estimation of hose-shaped robot using microphone array localization. IROS 2013: 3446-3451 - [c124]Koutarou Furukawa, Keita Okutani, Kohei Nagira, Takuma Otsuka, Katsutoshi Itoyama, Kazuhiro Nakadai, Hiroshi G. Okuno:
Noise correlation matrix estimation for improving sound source localization by multirotor UAV. IROS 2013: 3943-3948 - [c123]Keisuke Nakamura, Randy Gomez, Kazuhiro Nakadai:
Real-time super-resolution three-dimensional sound source localization for robots. IROS 2013: 3949-3954 - [c122]Kazuhiro Nakadai, Yuta Fujii, Shigeki Sugano:
Footstep detection and classification using distributed microphones. WIAMIS 2013: 1-4 - 2012
- [j18]Takami Yoshida, Kazuhiro Nakadai:
Audio-Visual Voice Activity Detection Based on an Utterance State Transition Model. Adv. Robotics 26(10): 1183-1201 (2012) - [j17]Hiroaki Miura, Takami Yoshida, Keisuke Nakamura, Kazuhiro Nakadai:
SLAM-based Online Calibration for Asynchronous Microphone Array. Adv. Robotics 26(17): 1941-1965 (2012) - [j16]Kenta Yonekura, Chyon Hae Kim, Kazuhiro Nakadai, Hiroshi Tsujino, Shigeki Sugano:
A role of multi-modal rhythms in physical interaction and cooperation. EURASIP J. Audio Speech Music. Process. 2012: 12 (2012) - [j15]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Efficient Blind Dereverberation and Echo Cancellation Based on Independent Component Analysis for Actual Acoustic Signals. Neural Comput. 24(1): 234-272 (2012) - [c121]Futoshi Asano, Hideki Asoh, Kazuhiro Nakadai:
Estimation of the number of sources and their locations in colored noise using reversible jump MCMC. EUSIPCO 2012: 609-613 - [c120]Randy Gomez, Tatsuya Kawahara, Keisuke Nakamura, Kazuhiro Nakadai:
Multi-party human-robot interaction with distant-talking speech recognition. HRI 2012: 439-446 - [c119]Takami Yoshida, Kazuhiro Nakadai:
Active audio-visual integration for Voice Activity Detection based on a Causal Bayesian Network. Humanoids 2012: 370-375 - [c118]Tatsuhiko Itohara, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno:
Improvement of audio-visual score following in robot ensemble with human guitarist. Humanoids 2012: 574-579 - [c117]Futoshi Asano, Hideki Asoh, Kazuhiro Nakadai:
Sound source localization in spatially colored noise using a hierarchical Bayesian model. ICASSP 2012: 193-196 - [c116]João Lobato Oliveira, Gökhan Ince, Keisuke Nakamura, Kazuhiro Nakadai:
Online audio beat tracking for a dancing robot in the presence of ego-motion noise in a real environment. ICRA 2012: 403-408 - [c115]Keisuke Nakamura, Kazuhiro Nakadai, Gökhan Ince:
Real-time super-resolution Sound Source Localization for robots. IROS 2012: 694-699 - [c114]João Lobato Oliveira, Gökhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luís Paulo Reis, Fabien Gouyon:
Live assessment of beat tracking for robot audition. IROS 2012: 992-997 - [c113]Gökhan Ince, Kazuhiro Nakadai, Keisuke Nakamura:
Online learning for template-based multi-channel ego noise estimation. IROS 2012: 3282-3287 - [c112]Keita Okutani, Takami Yoshida, Keisuke Nakamura, Kazuhiro Nakadai:
Outdoor auditory scene analysis using a moving microphone array embedded in a quadrocopter. IROS 2012: 3288-3293 - [c111]João Lobato Oliveira, Gökhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luís Paulo Reis, Fabien Gouyon:
An active audition framework for auditory-driven HRI: Application to interactive robot dancing. RO-MAN 2012: 1078-1085 - 2011
- [j14]Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura:
Ego noise cancellation of a robot using missing feature masks. Appl. Intell. 34(3): 360-371 (2011) - [j13]Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura:
Whole Body Motion Noise Cancellation of a Robot for Improved Automatic Speech Recognition. Adv. Robotics 25(11-12): 1405-1426 (2011) - [j12]Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno:
Real-Time Audio-to-Score Alignment Using Particle Filter for Coplayer Music Robots. EURASIP J. Adv. Signal Process. 2011 (2011) - [j11]Mikio Nakano, Yuji Hasegawa, Kotaro Funakoshi, Johane Takeuchi, Toyotaka Torii, Kazuhiro Nakadai, Naoyuki Kanda, Kazunori Komatani, Hiroshi G. Okuno, Hiroshi Tsujino:
A multi-expert model for dialogue and behavior control of conversational robots and agents. Knowl. Based Syst. 24(2): 248-256 (2011) - [c110]Kenta Yonekura, Chyon Hae Kim, Kazuhiro Nakadai, Hiroshi Tsujino, Shigeki Sugano:
Rhythmic reference of a human while a rope turning task. HRI 2011: 289-290 - [c109]Keisuke Nakamura, Kazuhiro Nakadai, Hirofumi Nakajima, Gökhan Ince:
Correlation matrix interpolation in Sound Source Localization for a robot. ICASSP 2011: 4324-4327 - [c108]Takeshi Mizumoto, Kazuhiro Nakadai, Takami Yoshida, Ryu Takeda, Takuma Otsuka, Toru Takahashi, Hiroshi G. Okuno:
Design and implementation of selectable sound separation on the Texai telepresence system using HARK. ICRA 2011: 2130-2137 - [c107]Gökhan Ince, Keisuke Nakamura, Futoshi Asano, Hirofumi Nakajima, Kazuhiro Nakadai:
Assessment of general applicability of ego noise estimation. ICRA 2011: 3517-3522 - [c106]Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno:
Bayesian Extension of MUSIC for Sound Source Localization and Tracking. INTERSPEECH 2011: 3109-3112 - [c105]Martin Heckmann, Kazuhiro Nakadai, Hirofumi Nakajima:
Robust Intonation Pattern Classification in Human Robot Interaction. INTERSPEECH 2011: 3137-3140 - [c104]Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Jun-ichi Imura, Keisuke Nakamura, Hirofumi Nakajima:
Assessment of single-channel ego noise estimation methods. IROS 2011: 106-111 - [c103]Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Jun-ichi Imura, Keisuke Nakamura, Hirofumi Nakajima:
Incremental learning for ego noise estimation of a robot. IROS 2011: 131-136 - [c102]Keisuke Nakamura, Kazuhiro Nakadai, Futoshi Asano, Gökhan Ince:
Intelligent sound source localization and its application to multimodal human tracking. IROS 2011: 143-148 - [c101]Hiroaki Miura, Takami Yoshida, Keisuke Nakamura, Kazuhiro Nakadai:
SLAM-based online calibration of asynchronous microphone array for robot audition. IROS 2011: 524-529 - [c100]Zheng Gong, Kazuhiro Nakadai, Hirofumi Nakajima, Ichiro Hagiwara:
HARK based real-time single pane 3D auditory scene visualizer empowered by Speech Arrow. IROS 2011: 530-535 - [c99]Takuma Otsuka, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno:
Incremental Bayesian Audio-to-Score Alignment with Flexible Harmonic Structure Models. ISMIR 2011: 525-530 - 2010
- [j10]Kazuhiro Nakadai, Toru Takahashi, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino:
Design and Implementation of Robot Audition System 'HARK' - Open Source Software for Listening to Three Simultaneous Speakers. Adv. Robotics 24(5-6): 739-761 (2010) - [j9]Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Soft missing-feature mask generation for robot audition. Paladyn J. Behav. Robotics 1(1): 37-47 (2010) - [j8]Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Voice-awareness control for a humanoid robot consistent with its body posture and movements. Paladyn J. Behav. Robotics 1(1): 80-88 (2010) - [j7]Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino:
Blind Source Separation With Parameter-Free Adaptive Step-Size Method for Robot Audition. IEEE Trans. Speech Audio Process. 18(6): 1476-1485 (2010) - [c98]Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Design and Implementation of Two-level Synchronization for Interactive Music Robot. AAAI 2010: 1238-1244 - [c97]Takami Yoshida, Kazuhiro Nakadai:
Audio-visual speech recognition system for a robot. AVSP 2010: 1-2 - [c96]Randy Gomez, Tatsuya Kawahara, Kazuhiro Nakadai:
Robust hands-free Automatic Speech Recognition for human-machine interaction. Humanoids 2010: 138-143 - [c95]Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Improvement in listening capability for humanoid robot HRP-2. ICRA 2010: 470-475 - [c94]Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Yuji Hasegawa, Hiroshi Tsujino, Jun-ichi Imura:
A hybrid framework for ego noise cancellation of a robot. ICRA 2010: 3623-3628 - [c93]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Upper-limit evaluation of robot audition based on ICA-BSS in multi-source, barge-in and highly reverberant conditions. ICRA 2010: 4366-4371 - [c92]Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno:
An Improvement in Audio-Visual Voice Activity Detection for Automatic Speech Recognition. IEA/AIE (1) 2010: 51-61 - [c91]Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura:
Robust Ego Noise Suppression of a Robot. IEA/AIE (1) 2010: 62-71 - [c90]Takuma Otsuka, Takeshi Mizumoto, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Music-Ensemble Robot That Is Capable of Playing the Theremin While Listening to the Accompanied Music. IEA/AIE (1) 2010: 102-112 - [c89]Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura:
A robust speech recognition system against the ego noise of a robot. INTERSPEECH 2010: 2070-2073 - [c88]Martin Heckmann, Claudius Gläser, Frank Joublin, Kazuhiro Nakadai:
Applying geometric source separation for improved pitch extraction in human-robot interaction. INTERSPEECH 2010: 2602-2605 - [c87]Takami Yoshida, Kazuhiro Nakadai:
Two-layered audio-visual integration in voice activity detection and automatic speech recognition for robots. INTERSPEECH 2010: 2702-2705 - [c86]Hirofumi Nakajima, Gökhan Ince, Kazuhiro Nakadai, Yuji Hasegawa:
An easily-configurable robot audition system using Histogram-based Recursive Level Estimation. IROS 2010: 958-963 - [c85]Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
An improvement in automatic speech recognition using soft missing feature masks for robot audition. IROS 2010: 964-969 - [c84]Kazuhiro Nakadai, Hirofumi Nakajima, Gökhan Ince, Yuji Hasegawa:
Sound source separation and automatic speech recognition for moving sources. IROS 2010: 976-981 - [c83]Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Hiroshi Tsujino, Jun-ichi Imura:
Multi-talker speech recognition under ego-motion noise using Missing Feature Theory. IROS 2010: 982-987 - [c82]Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno:
Two-layered audio-visual speech recognition for robots in noisy environments. IROS 2010: 988-993 - [c81]Martin Heckmann, Frank Joublin, Kazuhiro Nakadai:
Pitch extraction in Human-Robot interaction. IROS 2010: 1482-1487 - [c80]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Speedup and performance improvement of ICA-based robot audition by parallel and resampling-based block-wise processing. IROS 2010: 1949-1956 - [c79]Takeshi Mizumoto, Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Human-robot ensemble between robot thereminist and human percussionist using coupled oscillator model. IROS 2010: 1957-1963 - [c78]Ryota Fujimura, Kazuhiro Nakadai, Michita Imai, Ren Ohmura:
PROT - An embodied agent for intelligible and user-friendly human-robot interaction. IROS 2010: 3860-3867 - [c77]Toshimasa Suzuki, Hirofumi Nakajima, Hideo Tsuru, Takayuki Arai, Kazuhiro Nakadai:
3D sound field recording and reproducing system including sound source orientation. IUCS 2010: 215-220
2000 – 2009
- 2009
- [c76]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Automatic estimation of reverberation time with robot speech to improve ICA-based robot audition. Humanoids 2009: 250-255 - [c75]Takuma Otsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Voice quality manipulation for humanoid robots consistent with their head movements. Humanoids 2009: 405-410 - [c74]Takami Yoshida, Kazuhiro Nakadai, Hiroshi G. Okuno:
Automatic speech recognition improved by two-layered audio-visual integration for robot audition. Humanoids 2009: 604-609 - [c73]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
ICA-based efficient blind dereverberation and echo cancellation method for barge-in-able robot audition. ICASSP 2009: 3677-3680 - [c72]Kazuhiro Nakadai, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino:
Sound source separation of moving speakers for robot audition. ICASSP 2009: 3685-3688 - [c71]Gökhan Ince, Kazuhiro Nakadai, Tobias Rodemann, Yuji Hasegawa, Hiroshi Tsujino, Jun-ichi Imura:
Ego noise suppression of a robot using template subtraction. IROS 2009: 199-204 - [c70]Keisuke Nakamura, Kazuhiro Nakadai, Futoshi Asano, Yuji Hasegawa, Hiroshi Tsujino:
Intelligent sound source localization for dynamic environments. IROS 2009: 664-669 - [c69]Hirofumi Nakajima, Keiko Kikuchi, Touru Daigo, Yutaka Kaneda, Kazuhiro Nakadai, Yuji Hasegawa:
Real-time sound source orientation estimation using a 96 channel microphone array. IROS 2009: 676-683 - [c68]Ryu Takeda, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Step-size parameter adaptation of multi-channel semi-blind ICA with piecewise linear model for barge-in-able robot audition. IROS 2009: 2277-2282 - [c67]Takuma Otsuka, Toru Takahashi, Hiroshi G. Okuno, Kazunori Komatani, Tetsuya Ogata, Kazumasa Murata, Kazuhiro Nakadai:
Incremental polyphonic audio to score alignment using beat tracking for singer robots. IROS 2009: 2289-2296 - [c66]Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model. IROS 2009: 2730-2735 - [c65]Hiroshi G. Okuno, Kazuhiro Nakadai, Hyun-Don Kim:
Robot Audition: Missing Feature Theory Approach and Active Audition. ISRR 2009: 227-244 - 2008
- [c64]Kazuhiro Nakadai, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino:
An open source software system for robot audition HARK and its evaluation. Humanoids 2008: 561-566 - [c63]Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino:
Adaptive step-size parameter control for real-world blind source separation. ICASSP 2008: 149-152 - [c62]Kazuhiro Nakadai, Shun'ichi Yamamoto, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino:
A robot referee for rock-paper-scissors sound games. ICRA 2008: 3469-3474 - [c61]Toru Takahashi, Shun'ichi Yamamoto, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Soft missing-feature mask generation for simultaneous speech recognition system in robots. INTERSPEECH 2008: 992-995 - [c60]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Barge-in-able robot audition based on ICA and missing feature theory under semi-blind situation. IROS 2008: 1718-1723 - [c59]Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino:
High performance sound source separation adaptable to environmental changes for robot audition. IROS 2008: 2165-2171 - [c58]Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotaka Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino:
A robot uses its own microphone to synchronize its steps to musical beats while scatting and singing. IROS 2008: 2459-2464 - [c57]Kazumasa Murata, Kazuhiro Nakadai, Kazuyoshi Yoshii, Ryu Takeda, Toyotaka Torii, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino:
A Robot Singer with Music Recognition Based on Real-Time Beat Tracking. ISMIR 2008: 199-204 - 2007
- [j6]Jean-Marc Valin, Seiichi Yamamoto, Jean Rouat, François Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno:
Robust Recognition of Simultaneous Speech by a Mobile Robot. IEEE Trans. Robotics 23(4): 742-752 (2007) - [c56]Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech. ASRU 2007: 111-116 - [c55]Kentaro Ishii, Yukiko Yamamoto, Michita Imai, Kazuhiro Nakadai:
A Navigation System Using Ultrasonic Directional Speaker with Rotating Base. HCI (9) 2007: 526-535 - [c54]Kazuhiro Nakadai, Ryota Sumiya, Mikio Nakano, Koichi Ichige, Yasuo Hirose, Hiroshi Tsujino:
The Design of Phoneme Grouping for Coarse Phoneme Recognition. IEA/AIE 2007: 905-914 - [c53]Kazuyoshi Yoshii, Kazuhiro Nakadai, Toyotaka Torii, Yuji Hasegawa, Hiroshi Tsujino, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
A biped robot that keeps steps in time with musical beats while listening to music with its own ears. IROS 2007: 1743-1750 - [c52]Tomoaki Koiwa, Kazuhiro Nakadai, Jun-ichi Imura:
Coarse speech recognition by audio-visual integration based on missing feature theory. IROS 2007: 1751-1756 - [c51]Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Exploiting known sound source signals to improve ICA-based robot audition in speech separation and recognition. IROS 2007: 1757-1762 - [c50]Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino:
Moving Sound Source Extraction by Time-Variant Beamforming. JSAI 2007: 47-53 - 2006
- [c49]Yoshitaka Nishimura, Mitsuru Ishizuka, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino:
Speech Recognition for a Humanoid with Motor Noise Utilizing Missing Feature Theory. Humanoids 2006: 26-33 - [c48]Mikio Nakano, Atsushi Hoshino, Johane Takeuchi, Yuji Hasegawa, Toyotaka Torii, Kazuhiro Nakadai, Kazuhiko Kato, Hiroshi Tsujino:
A Robot That Can Engage in Both Task-Oriented and Non-Task-Oriented Dialogues. Humanoids 2006: 404-411 - [c47]Kazuhiro Nakadai, Hirofumi Nakajima, Masamitsu Murase, Satoshi Kaijiri, Kentaro Yamada, Takahiro Nakamura, Yuji Hasegawa, Hiroshi G. Okuno, Hiroshi Tsujino:
Robust Tracking of Multiple Sound Sources by Spatial Integration of Room And Robot Microphone Arrays. ICASSP (4) 2006: 929-932 - [c46]Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals. IEA/AIE 2006: 207-217 - [c45]Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Leak energy based missing feature mask generation for ICA and GSS and its evaluation with simultaneous speech recognition. SAPA@INTERSPEECH 2006: 42-47 - [c44]Yoshitaka Nishimura, Mikio Nakano, Kazuhiro Nakadai, Hiroshi Tsujino, Mitsuru Ishizuka:
Speech recognition for a robot under its motor noises by selective application of missing feature theory and MLLR. SAPA@INTERSPEECH 2006: 53-58 - [c43]Kazuhiro Nakadai, Hirofumi Nakajima, Masamitsu Murase, Hiroshi G. Okuno, Yuji Hasegawa, Hiroshi Tsujino:
Real-Time Tracking of Multiple Sound Sources by Integration of In-Room and Robot-Embedded Microphone Arrays. IROS 2006: 852-859 - [c42]Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Real-Time Robot Audition System That Recognizes Simultaneous Speech in The Real World. IROS 2006: 5333-5338 - [c41]Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition. PRICAI 2006: 484-494 - [c40]Kazunori Komatani, Naoyuki Kanda, Mikio Nakano, Kazuhiro Nakadai, Hiroshi Tsujino, Tetsuya Ogata, Hiroshi G. Okuno:
Multi-Domain Spoken Dialogue System with Extensibility and Robustness against Speech Recognition Errors. SIGDIAL Workshop 2006: 9-17 - 2005
- [c39]Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Jean Rouat, François Michaud, Tetsuya Ogata, Hiroshi G. Okuno:
Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory. ICRA 2005: 1477-1482 - [c38]Kazuhiro Nakadai, Hiroshi Tsujino:
Towards New Human-Humanoid Communication: Listening During Speaking by Using Ultrasonic Directional Speaker. ICRA 2005: 1483-1488 - [c37]Masamitsu Murase, Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Kentaro Yamada, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Multiple moving speaker tracking by microphone array on mobile robot. INTERSPEECH 2005: 249-252 - [c36]Kazuhiro Nakadai, Hirofumi Nakajima, Kentaro Yamada, Yuji Hasegawa, Takahiro Nakamura, Hiroshi Tsujino:
Sound source tracking with directivity pattern estimation using a 64 ch microphone array. IROS 2005: 1690-1696 - [c35]Shunsuke Kurotaki, Noriaki Suzuki, Kazuhiro Nakadai, Hiroshi G. Okuno, Hideharu Amano:
Implementation of active direction-pass filter on dynamically reconfigurable processor. IROS 2005: 3175-3180 - [c34]Mikio Nakano, Yuji Hasegawa, Kazuhiro Nakadai, Takahiro Nakamura, Johane Takeuchi, Toyotaka Torii, Hiroshi Tsujino, Naoyuki Kanda, Hiroshi G. Okuno:
A two-layer model for behavior and dialogue planning in conversational service robots. IROS 2005: 3329-3335 - [c33]Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Jean Rouat, François Michaud, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Making a robot recognize three simultaneous sentences in real-time. IROS 2005: 4040-4045 - 2004
- [j5]Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot. Appl. Intell. 20(3): 253-266 (2004) - [j4]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano:
Effects of increasing modalities in recognizing three simultaneous speeches. Speech Commun. 43(4): 347-359 (2004) - [j3]Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino:
Improvement of recognition of simultaneous speech signals using AV integration and scattering theory for humanoid robots. Speech Commun. 44(1-4): 97-112 (2004) - [c32]Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Toshio Yokoyama, Hiroshi G. Okuno:
Improvement of Robot Audition by Interfacing Sound Source Separation and Automatic Speech Recognition with Missing Feature Theory. ICRA 2004: 1517-1523 - [c31]Tokitomo Ariyoshi, Kazuhiro Nakadai, Hiroshi Tsujino:
Multimodal expression for humanoid robots by integration of human speech mimicking and facial color. INTERSPEECH 2004: 2305-2308 - [c30]Shun'ichi Yamamoto, Kazuhiro Nakadai, Hiroshi Tsujino, Hiroshi G. Okuno:
Assessment of general applicability of robot audition system by recognizing three simultaneous speeches. IROS 2004: 2111-2116 - 2003
- [j2]Hiroshi G. Okuno, Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroaki Kitano:
Human-robot non-verbal interaction empowered by real-time auditory and visual multiple-talker tracking. Adv. Robotics 17(2): 115-130 (2003) - [c29]Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino:
Improvement of three simultaneous speech recognition by using AV integration and scattering theory for humanoid. AVSP 2003: 157-162 - [c28]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano:
Realizing personality in audio-visually triggered non-verbal behaviors. ICRA 2003: 392-397 - [c27]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano:
Robot recognizes three simultaneous speech by active audition. ICRA 2003: 398-405 - [c26]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano:
Design and Implementation of Personality of Humanoids in Human Humanoid Non-verbal Interaction. IEA/AIE 2003: 662-673 - [c25]Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino:
Three simultaneous speech recognition by integration of active audition and face recognition for humanoid. INTERSPEECH 2003: 2705-2708 - [c24]Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroaki Kitano:
Applying scattering theory to robot audition system: robust sound source localization and extraction. IROS 2003: 1147-1152 - [c23]Hiroshi G. Okuno, Kazuhiro Nakadai:
Real-Time Sound Source Localization and Separation Based on Active Audio-Visual Integration. IWANN (1) 2003: 118-125 - 2002
- [j1]Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroshi Mizoguchi, Hiroaki Kitano:
Real-time Auditory and Visual Multiple-speaker Tracking For Human-robot Interaction. J. Robotics Mechatronics 14(5): 479-489 (2002) - [c22]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano:
Exploiting Auditory Fovea in Humanoid-Human Interaction. AAAI/IAAI 2002: 431-438 - [c21]Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano:
Real-Time Speaker Localization and Speech Separation by Audio-Visual Integration. ICRA 2002: 1043-1049 - [c20]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano:
Social Interaction of Humanoid RobotBased on Audio-Visual Tracking. IEA/AIE 2002: 725-735 - [c19]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano:
Real-time sound source localization and separation for robot audition. INTERSPEECH 2002: 193-196 - [c18]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano:
Auditory fovea based speech enhancement and its application to human-robot dialog system. INTERSPEECH 2002: 1817-1820 - [c17]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano:
Auditory fovea based speech separation and its application to dialog system. IROS 2002: 1320-1325 - [c16]Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano:
Realizing Audio-Visually Triggered ELIZA-Like Non-verbal Behaviors. PRICAI 2002: 552-562 - 2001
- [c15]Tino Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano:
A computational model of monkey grating cells for oriented repetitive alternating patterns. ESANN 2001: 315-322 - [c14]Tino Lourens, Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano:
Graph extraction from color images. ESANN 2001: 329-334 - [c13]Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano:
Sound and Visual Tracking for Humanoid Robot. IEA/AIE 2001: 640-650 - [c12]Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroshi G. Okuno, Hiroaki Kitano:
Real-Time Auditory and Visual Multiple-Object Tracking for Humanoids. IJCAI 2001: 1425-1436 - [c11]Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano:
Real-time multiple speaker tracking by multi-modal integration for mobile robots. INTERSPEECH 2001: 1193-1196 - [c10]Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano:
Separating three simultaneous speeches with two microphones by integrating auditory and visual processing. INTERSPEECH 2001: 2643-2646 - [c9]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano:
Epipolar geometry based sound localization and extraction for humanoid audition. IROS 2001: 1395-1401 - [c8]Hiroshi G. Okuno, Kazuhiro Nakadai, Ken-ichi Hidai, Hiroshi Mizoguchi, Hiroaki Kitano:
Human-robot interaction through real-time auditory and visual multiple-talker tracking. IROS 2001: 1402-1409 - 2000
- [c7]Kazuhiro Nakadai, Tino Lourens, Hiroshi G. Okuno, Hiroaki Kitano:
Active Audition for Humanoid. AAAI/IAAI 2000: 832-839 - [c6]Hiroaki Kitano, Hiroshi G. Okuno, Kazuhiro Nakadai, Iris Fermin, Theo Sabisch, Yukiko Nakagawa, Tatsuya Matsui:
Designing a humanoid head for RoboCup challenge. Agents 2000: 17-18 - [c5]Hiroaki Kitano, Hiroshi G. Okuno, Kazuhiro Nakadai, Theo Sabisch, Tatsuya Matsui:
Design and architecture of SIG the humanoid: an experimental platform for integrated perception in RoboCup humanoid challenge. IROS 2000: 181-190 - [c4]Kazuhiro Nakadai, Tatsuya Matsui, Hiroshi G. Okuno, Hiroaki Kitano:
Active audition system and humanoid exterior design. IROS 2000: 1453-1461 - [c3]Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano:
Humanoid Active Audition System Improved by the Cover Acoustics. PRICAI 2000: 544-554 - [c2]Ian Frank, Kumiko Tanaka-Ishii, Hiroshi G. Okuno, Junichi Akita, Yukiko Nakagawa, Kazuaki Maeda, Kazuhiro Nakadai, Hiroaki Kitano:
And the Fans Are Going Wild! SIG plus MIKE. RoboCup 2000: 139-148
1990 – 1999
- 1995
- [c1]Kunio Kashino, Kazuhiro Nakadai, Tomoyoshi Kinoshita, Hidehiko Tanaka:
Organization of Hierarchical Perceptual Sounds: Music Scene Analysis with Autonomous Processing Modules and a Quantitative Information Integration Mechanism. IJCAI 1995: 158-164
Coauthor Index
aka: Hiroshi Gitchang Okuno
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-15 02:18 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint