![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
Koichi Shinoda
Person information
Refine list
![note](https://dblp.uni-trier.de./img/note-mark.dark.12x12.png)
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c116]Nitish Jaiswal
, Vi Duc Huan
, Felix Limanta
, Koichi Shinoda
, Masahiro Wakasa
:
Domain-Specific Adaptation for Enhanced Gait Recognition in Practical Scenarios. IVSP 2024: 8-15 - [c115]Shinichi Ka
, Koichi Shinoda
:
Co-speech Gesture Generation with Variational Auto Encoder. MMM (3) 2024: 155-168 - [c114]Felix Limanta, Kuniaki Uto, Koichi Shinoda:
CAMOT: Camera Angle-aware Multi-Object Tracking. WACV 2024: 6465-6474 - [d1]Yuzhe Hao
, Koichi Shinoda
:
EvIs-Kitchen. IEEE DataPort, 2024 - [i14]Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda:
Pyramid Coder: Hierarchical Code Generator for Compositional Visual Question Answering. CoRR abs/2407.20563 (2024) - [i13]Felix Limanta, Kuniaki Uto, Koichi Shinoda:
CAMOT: Camera Angle-aware Multi-Object Tracking. CoRR abs/2409.17533 (2024) - 2023
- [c113]Mitali Ahuja, Shuji Komeiji, Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano, Koichi Shinoda, Toshihisa Tanaka:
Multimodal recognition of speech and electrocorticogram. APSIPA ASC 2023: 546-550 - [c112]Lei Yang, Yuzhe Hao, Koichi Shinoda:
Sensor Data Representation with Transformer-Based Contrastive Learning for Human Action Recognition and Detection. EUSIPCO 2023: 1703-1707 - [c111]Kai Shigemi, Shuji Komeiji, Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano, Koichi Shinoda, Kohei Yatabe, Toshihisa Tanaka:
Synthesizing Speech from ECoG with a Combination of Transformer-Based Encoder and Neural Vocoder. ICASSP 2023: 1-5 - [c110]Yuzhe Hao, Kuniaki Uto, Asako Kanezaki, Ikuro Sato, Rei Kawakami, Koichi Shinoda:
EvIs-Kitchen: Egocentric Human Activities Recognition with Video and Inertial Sensor Data. MMM (1) 2023: 373-384 - [c109]Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda:
Text-Guided Object Detector for Multi-modal Video Question Answering. WACV 2023: 1032-1042 - 2022
- [c108]Pablo Cervantes
, Yusuke Sekikawa
, Ikuro Sato
, Koichi Shinoda
:
Implicit Neural Representations for Variable Length Human Motion Generation. ECCV (17) 2022: 356-372 - [c107]Shuji Komeiji, Kai Shigemi, Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano, Koichi Shinoda, Toshihisa Tanaka:
Transformer-Based Estimation of Spoken Sentences Using Electrocorticography. ICASSP 2022: 1311-1315 - [c106]Takeru Ito, Kuniaki Uto, Koichi Shinoda:
RI-DC: Rotation-Invariant Detection and Classification for Wheat Head Detection. IGARSS 2022: 5750-5753 - [c105]Kengo Machida, Kuniaki Uto, Koichi Shinoda, Taiji Suzuki:
MSR-DARTS: Minimum Stable Rank of Differentiable Architecture Search. IJCNN 2022: 1-9 - [i12]Pablo Cervantes, Yusuke Sekikawa, Ikuro Sato, Koichi Shinoda:
Implicit Neural Representations for Variable Length Human Motion Generation. CoRR abs/2203.13694 (2022) - 2021
- [j28]Mariana Rodrigues Makiuchi, Tifani Warnita, Nakamasa Inoue, Koichi Shinoda, Michitaka Yoshimura, Momoko Kitazawa, Kei Funaki, Yoko Eguchi, Taishiro Kishimoto:
Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network. IEICE Trans. Inf. Syst. 104-D(11): 1930-1940 (2021) - [c104]Kohei Ozamoto, Kuniaki Uto, Koji Iwano, Koichi Shinoda:
Noise-Tolerant Time-Domain Speech Separation with Noise Bases. APSIPA ASC 2021: 624-629 - [c103]Mariana Rodrigues Makiuchi, Kuniaki Uto, Koichi Shinoda:
Multimodal Emotion Recognition with High-Level Speech and Text Features. ASRU 2021: 350-357 - [i11]Mariana Rodrigues Makiuchi, Kuniaki Uto, Koichi Shinoda:
Multimodal Emotion Recognition with High-level Speech and Text Features. CoRR abs/2111.10202 (2021) - 2020
- [j27]Kong Aik Lee
, Hitoshi Yamamoto, Koji Okabe, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda:
NEC-TT System for Mixed-Bandwidth and Multi-Domain Speaker Recognition. Comput. Speech Lang. 61: 101033 (2020) - [c102]Yang Lu, Asri Rizki Yuliani
, Keisuke Ishikawa, Ronaldo Prata Amorim, Roland Hartanto, Nakamasa Inoue, Kuniaki Uto, Koichi Shinoda:
Deep Video Understanding of Character Relationships in Movies. ICMI Companion 2020: 120-129 - [c101]Kuniaki Uto, Mauro Dalla Mura, Yuka Sasaki, Koichi Shinoda:
Estimation of Leaf Angle Distribution Based on Statistical Properties of Leaf Shading Distribution. IGARSS 2020: 5195-5198 - [c100]Kong Aik Lee
, Koji Okabe, Hitoshi Yamamoto, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Keisuke Ishikawa, Koichi Shinoda:
NEC-TT Speaker Verification System for SRE'19 CTS Challenge. INTERSPEECH 2020: 2227-2231 - [c99]Ronaldo Prata Amorim, Nakamasa Inoue, Koichi Shinoda:
Tokyo Tech at TRECVID 2020: Relation Modeling for Video Action Detection. TRECVID 2020 - [e1]Kong-Aik Lee, Takafumi Koshinaka, Koichi Shinoda:
Odyssey 2020: The Speaker and Language Recognition Workshop, 1-5 November 2020, Tokyo, Japan. ISCA 2020 [contents] - [i10]Tifani Warnita, Mariana Rodrigues Makiuchi, Nakamasa Inoue, Koichi Shinoda, Michitaka Yoshimura, Momoko Kitazawa, Kei Funaki, Yoko Eguchi, Taishiro Kishimoto:
Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network. CoRR abs/2004.07992 (2020) - [i9]Kengo Machida, Kuniaki Uto, Koichi Shinoda, Taiji Suzuki:
Neural Architecture Search Using Stable Rank of Convolutional Layers. CoRR abs/2009.09209 (2020)
2010 – 2019
- 2019
- [j26]Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda:
Recurrent out-of-vocabulary word detection based on distribution of features. Comput. Speech Lang. 58: 247-259 (2019) - [c98]Raden Mu'az Mun'im, Nakamasa Inoue, Koichi Shinoda:
Sequence-level Knowledge Distillation for Model Compression of Attention-based Sequence-to-sequence Speech Recognition. ICASSP 2019: 6151-6155 - [c97]Kuniaki Uto, Mauro Dalla Mura
, Jocelyn Chanussot, Koichi Shinoda:
Estimation of Diffuse Component of Global Radiation Based on Leaf-Scale Crop Images. IGARSS 2019: 6263-6266 - [c96]Kong Aik Lee
, Hitoshi Yamamoto, Koji Okabe, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda:
The NEC-TT 2018 Speaker Verification System. INTERSPEECH 2019: 4355-4359 - [c95]Dongxiao Wang, Hirokazu Kameoka, Koichi Shinoda:
A Modified Algorithm for Multiple Input Spectrogram Inversion. INTERSPEECH 2019: 4569-4573 - [c94]Mariana Rodrigues Makiuchi, Tifani Warnita, Kuniaki Uto, Koichi Shinoda:
Multimodal Fusion of BERT-CNN and Gated CNN Representations for Depression Detection. AVEC@MM 2019: 55-63 - [i8]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - 2018
- [c93]Thao Le Minh, Nakamasa Inoue, Koichi Shinoda:
A Fine-to-Coarse Convolutional Neural Network for 3D Human Action Recognition. BMVC 2018: 227 - [c92]Haoyi Zhang, Conggui Liu, Nakamasa Inoue, Koichi Shinoda
:
Multi-Task Autoencoder for Noise-Robust Speech Recognition. ICASSP 2018: 5599-5603 - [c91]Thao Le Minh
, Nobuyuki Shimizu, Takashi Miyazaki, Koichi Shinoda:
Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances. IJCAI 2018: 1546-1553 - [c90]Tifani Warnita, Nakamasa Inoue, Koichi Shinoda
:
Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data. INTERSPEECH 2018: 1706-1710 - [c89]Koji Okabe, Takafumi Koshinaka, Koichi Shinoda
:
Attentive Statistics Pooling for Deep Speaker Embedding. INTERSPEECH 2018: 2252-2256 - [c88]Jiacen Zhang, Nakamasa Inoue, Koichi Shinoda
:
I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification. INTERSPEECH 2018: 3613-3617 - [c87]Nakamasa Inoue, Koichi Shinoda:
Few-Shot Adaptation for Multimedia Semantic Indexing. ACM Multimedia 2018: 1110-1118 - [c86]Nakamasa Inoue, Chihiro Shiraishi, Aleksandr Drozd, Koichi Shinoda, Shi-wook Lee, Alex ChiChung Kot:
VANT at TRECVID 2018. TRECVID 2018 - [i7]Koji Okabe, Takafumi Koshinaka, Koichi Shinoda:
Attentive Statistics Pooling for Deep Speaker Embedding. CoRR abs/1803.10963 (2018) - [i6]Tifani Warnita, Nakamasa Inoue, Koichi Shinoda:
Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data. CoRR abs/1803.11344 (2018) - [i5]Jiacen Zhang, Nakamasa Inoue, Koichi Shinoda:
I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification. CoRR abs/1804.00290 (2018) - [i4]Thao Le Minh, Nakamasa Inoue, Koichi Shinoda:
A Fine-to-Coarse Convolutional Neural Network for 3D Human Action Recognition. CoRR abs/1805.11790 (2018) - [i3]Nakamasa Inoue, Koichi Shinoda:
Few-Shot Adaptation for Multimedia Semantic Indexing. CoRR abs/1807.07203 (2018) - [i2]Thao Le Minh, Nobuyuki Shimizu, Takashi Miyazaki, Koichi Shinoda:
Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances. CoRR abs/1809.04288 (2018) - [i1]Raden Mu'az Mun'im, Nakamasa Inoue, Koichi Shinoda:
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition. CoRR abs/1811.04531 (2018) - 2017
- [j25]Tommi Kerola
, Nakamasa Inoue, Koichi Shinoda
:
Cross-view human action recognition from depth maps using spectral graph sequences. Comput. Vis. Image Underst. 154: 108-126 (2017) - [c85]Yuki Yasui, Nakamasa Inoue, Koji Iwano, Koichi Shinoda
:
Multimodal speech recognition using mouth images from depth camera. APSIPA 2017: 1233-1236 - [c84]Conggui Liu, Nakamasa Inoue, Koichi Shinoda
:
A unified network for multi-speaker speech recognition with multi-channel recordings. APSIPA 2017: 1304-1307 - [c83]Shinya Matsui, Nakamasa Inoue, Yuko Akagi, Goshu Nagino, Koichi Shinoda
:
User adaptation of convolutional neural network for human activity recognition. EUSIPCO 2017: 753-757 - [c82]Mengxi Lin, Nakamasa Inoue, Koichi Shinoda
:
CTC Network with Statistical Language Modeling for Action Sequence Recognition in Videos. ACM Multimedia (Thematic Workshops) 2017: 393-401 - [c81]Yasuhiro Shibasaki, Kotaro Funakoshi, Koichi Shinoda:
Boredom Recognition Based on Users' Spontaneous Behaviors in Multiparty Human-Robot Interactions. MMM (1) 2017: 677-689 - [c80]Nakamasa Inoue, Ryosuke Yamamoto, Na Rong, Satoshi Kanai, Junsuke Masada, Chihiro Shiraishi, Shi-wook Lee, Koichi Shinoda:
TokyoTech-AIST at TRECVID 2017: Multimedia Event Detection Using Deep CNNs and Zero-Shot Classiers. TRECVID 2017 - 2016
- [j24]Johan Rohdin
, Sangeeta Biswas
, Koichi Shinoda
:
Robust discriminative training against data insufficiency in PLDA-based speaker verification. Comput. Speech Lang. 35: 32-57 (2016) - [j23]Ryan Price, Ken-ichi Iso, Koichi Shinoda
:
Wise teachers train better DNN acoustic models. EURASIP J. Audio Speech Music. Process. 2016: 10 (2016) - [j22]Nakamasa Inoue, Koichi Shinoda
:
Fast Coding of Feature Vectors Using Neighbor-to-Neighbor Search. IEEE Trans. Pattern Anal. Mach. Intell. 38(6): 1170-1184 (2016) - [c79]Tommi Kerola, Nakamasa Inoue, Koichi Shinoda
:
Graph regularized implicit pose for 3D human action recognition. APSIPA 2016: 1-4 - [c78]Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda:
Recurrent Out-of-Vocabulary Word Detection Using Distribution of Features. INTERSPEECH 2016: 1320-1324 - [c77]Fumito Nishi, Nakamasa Inoue, Koji Iwano, Koichi Shinoda:
Tokyo Tech at MediaEval 2016 Multimodal Person Discovery in Broadcast TV task. MediaEval 2016 - [c76]Nakamasa Inoue, Koichi Shinoda
:
Adaptation of Word Vectors using Tree Structure for Visual Semantics. ACM Multimedia 2016: 277-281 - [c75]Nakamasa Inoue, Ryosuke Yamamoto, Na Rong, Koichi Shinoda:
TokyoTech at TRECVID 2016. TRECVID 2016 - 2015
- [j21]Yuan Liang, Koji Iwano, Koichi Shinoda
:
Error Correction Using Long Context Match for Smartphone Speech Recognition. IEICE Trans. Inf. Syst. 98-D(11): 1932-1942 (2015) - [j20]Sangeeta Biswas
, Johan Rohdin
, Koichi Shinoda
:
Autonomous selection of i-vectors for PLDA modelling in speaker verification. Speech Commun. 72: 32-46 (2015) - [c74]Fumito Nishi, Nakamasa Inoue, Koichi Shinoda:
Combining Audio Features and Visual I-Vector @ MediaEval 2015 Multimodal Person Discovery in Broadcast TV. MediaEval 2015 - [c73]Nakamasa Inoue, Koichi Shinoda
:
Vocabulary Expansion Using Word Vectors for Video Semantic Indexing. ACM Multimedia 2015: 851-854 - [c72]Nakamasa Inoue, Hai Dang Tran, Ryosuke Yamamoto, Koichi Shinoda:
TokyoTech at TRECVID 2015. TRECVID 2015 - 2014
- [c71]Tommi Kerola, Nakamasa Inoue, Koichi Shinoda:
Spectral Graph Skeletons for 3D Action Recognition. ACCV (4) 2014: 417-432 - [c70]Florian Metze, Koichi Shinoda:
Semantics for Large-Scale Multimedia: New Challenges for NLP. ACL (Tutorial Abstracts) 2014: 6 - [c69]Johan Rohdin
, Sangeeta Biswas
, Koichi Shinoda:
Constrained discriminative PLDA training for speaker verification. ICASSP 2014: 1670-1674 - [c68]Yuan Liang, Koji Iwano, Koichi Shinoda:
Simple gesture-based error correction interface for smartphone speech recognition. INTERSPEECH 2014: 1194-1198 - [c67]Nakamasa Inoue, Koichi Shinoda:
n-gram Models for Video Semantic Indexing. ACM Multimedia 2014: 777-780 - [c66]Zhuolin Liang, Nakamasa Inoue, Koichi Shinoda
:
Event Detection by Velocity Pyramid. MMM (1) 2014: 353-364 - [c65]Johan Rohdin, Sangeeta Biswas, Koichi Shinoda:
Discriminative PLDA training with application-specific loss functions for speaker verification. Odyssey 2014: 26-32 - [c64]Johan Rohdin, Sangeeta Biswas, Koichi Shinoda:
i-Vector Selection for Effective PLDA Modeling in Speaker Recognition. Odyssey 2014: 100-105 - [c63]Ryan Price, Ken-ichi Iso, Koichi Shinoda:
Speaker adaptation of deep neural networks using a hierarchy of output layers. SLT 2014: 153-158 - [c62]Yuan Liang, Koji Iwano, Koichi Shinoda:
An efficient error correction interface for speech recognition on mobile touchscreen devices. SLT 2014: 454-459 - [c61]Nakamasa Inoue, Zhuolin Liang, Mengxi Lin, Hai Dang Tran, Koichi Shinoda, Xuefeng Zhang, Kazuya Ueki:
TokyoTech-Waseda at TRECVID 2014. TRECVID 2014 - 2013
- [j19]Felipe Gómez-Caballero, Takahiro Shinozaki, Sadaoki Furui, Koichi Shinoda:
A statistical approach for person verification using human behavioral patterns. EURASIP J. Image Video Process. 2013: 44 (2013) - [j18]Yusuke Kamishima, Nakamasa Inoue, Koichi Shinoda:
Event detection in consumer videos using GMM supervectors and SVMs. EURASIP J. Image Video Process. 2013: 51 (2013) - [j17]Hilman Ferdinandus Pardede
, Koji Iwano, Koichi Shinoda:
Spectral Subtraction Based on Non-extensive Statistics for Speech Recognition. IEICE Trans. Inf. Syst. 96-D(8): 1774-1782 (2013) - [j16]Nakamasa Inoue, Koichi Shinoda
:
q-Gaussian mixture models for image and video semantic indexing. J. Vis. Commun. Image Represent. 24(8): 1450-1457 (2013) - [j15]Hilman Ferdinandus Pardede
, Koji Iwano, Koichi Shinoda
:
Feature normalization based on non-extensive statistics for speech recognition. Speech Commun. 55(5): 587-599 (2013) - [j14]Ryo Yokoyama
, Yu Nasu, Koji Iwano, Koichi Shinoda
:
Detection of overlapped speech using lapel microphones in meeting. Speech Commun. 55(10): 941-949 (2013) - [j13]Koichi Shinoda
, Nakamasa Inoue:
Reusing Speech Techniques for Video Semantic Indexing [Applications Corner]. IEEE Signal Process. Mag. 30(2): 118-122 (2013) - [c60]Nakamasa Inoue, Koichi Shinoda
:
Neighbor-to-Neighbor Search for Fast Coding of Feature Vectors. ICCV 2013: 1233-1240 - [c59]Felipe Gómez-Caballero, Takahiro Shinozaki, Sadaoki Furui, Koichi Shinoda
:
Statistical Person Verification Using Behavioral Patterns from Complex Human Motion. ICIAP Workshops 2013: 550-558 - [c58]Ryan Price, Sangeeta Biswas, Koichi Shinoda:
Combining deep speaker specific representations with GMM-SVM for speaker verification. INTERSPEECH 2013: 2788-2792 - [c57]Nakamasa Inoue, Kotaro Mori, Zhuolin Liang, Mengxi Lin, Koichi Shinoda, Shunsuke Sato:
TokyoTechCanon at TRECVID 2013. TRECVID 2013 - 2012
- [j12]Muhammad Rasyid Aqmar, Koichi Shinoda, Sadaoki Furui:
Robust Gait-Based Person Identification against Walking Speed Variations. IEICE Trans. Inf. Syst. 95-D(2): 668-676 (2012) - [j11]Takafumi Koshinaka, Kentaro Nagatomo, Koichi Shinoda:
Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model. IEICE Trans. Inf. Syst. 95-D(10): 2469-2478 (2012) - [j10]Hiroko Murakami, Koichi Shinoda, Sadaoki Furui:
Active Learning Using Phone-Error Distribution for Speech Modeling. IEICE Trans. Inf. Syst. 95-D(10): 2486-2494 (2012) - [j9]Nakamasa Inoue, Koichi Shinoda
:
A Fast and Accurate Video Semantic-Indexing System Using Fast MAP Adaptation and GMM Supervectors. IEEE Trans. Multim. 14(4): 1196-1205 (2012) - [c56]Nakamasa Inoue, Koichi Shinoda
:
q-Gaussian Mixture Models Based on Non-extensive Statistics for Image and Video Semantic Indexing. ACCV (2) 2012: 499-510 - [c55]Muhammad Rasyid Aqmar, Koichi Shinoda, Sadaoki Furui:
Efficient model training for HMM-based person identification by gait. APSIPA 2012: 1-4 - [c54]Takuya Tsutaoka, Koichi Shinoda:
Acoustic model training using committee-based active and semi-supervised learning for speech recognition. APSIPA 2012: 1-4 - [c53]Yusuke Kamishima, Nakamasa Inoue, Koichi Shinoda
, Shunsuke Sato:
Multimedia event detection using GMM supervectors and SVMS. ICIP 2012: 3089-3092 - [c52]Hilman Ferdinandus Pardede, Koichi Shinoda, Koji Iwano:
Q-Gaussian based spectral subtraction for robust speech recognition. INTERSPEECH 2012: 1255-1258 - [c51]Ryo Yokoyama, Yu Nasu, Koichi Shinoda, Koji Iwano:
Overlapped Speech Detection in Meeting Using Cross-Channel Spectral Subtraction and Spectrum Similarity. INTERSPEECH 2012: 1500-1503 - [c50]Nakamasa Inoue, Yusuke Kamishima, Kotaro Mori, Koichi Shinoda:
TokyoTechCanon at TRECVID 2012. TRECVID 2012 - 2011
- [j8]Yuzo Hamanaka, Koichi Shinoda, Takuya Tsutaoka, Sadaoki Furui, Tadashi Emori, Takafumi Koshinaka:
Committee-Based Active Learning for Speech Recognition. IEICE Trans. Inf. Syst. 94-D(10): 2015-2023 (2011) - [j7]Koichi Shinoda, Yasushi Watanabe, Kenji Iwata, Yuan Liang, Ryuta Nakagawa, Sadaoki Furui:
Semi-synchronous speech and pen input for mobile user interfaces. Speech Commun. 53(3): 283-291 (2011) - [c49]Hiroko Murakami, Koichi Shinoda, Sadaoki Furui:
Designing text corpus using phone-error distribution for acoustic modeling. ASRU 2011: 191-195 - [c48]Yu Nasu, Koichi Shinoda, Sadaoki Furui:
Cross-Channel Spectral Subtraction for meeting speech recognition. ICASSP 2011: 4812-4815 - [c47]Marc Ferras, Koichi Shinoda, Sadaoki Furui:
Structural MAP adaptation in GMM-supervector based speaker recognition. ICASSP 2011: 5432-5435 - [c46]Hilman Ferdinandus Pardede, Koichi Shinoda:
Generalized-Log Spectral Mean Normalization for Speech Recognition. INTERSPEECH 2011: 1645-1648 - [c45]Marc Ferras, Koichi Shinoda, Sadaoki Furui:
Structural Joint Factor Analysis for Speaker Recognition. INTERSPEECH 2011: 2373-2376 - [c44]Sangeeta Biswas, Marc Ferras, Koichi Shinoda, Sadaoki Furui:
Acoustic Forest for SMAP-Based Speaker Verification. INTERSPEECH 2011: 2377-2380 - [c43]Felipe Gómez-Caballero, Takahiro Shinozaki, Sadaoki Furui, Koichi Shinoda:
Person authentication using 3D human motion. J-HGBU@MM 2011: 35-40 - [c42]Nakamasa Inoue, Koichi Shinoda
:
A fast MAP adaptation technique for gmm-supervector-based video semantic indexing systems. ACM Multimedia 2011: 1357-1360 - [c41]Nakamasa Inoue, Toshiya Wada, Yusuke Kamishima, Koichi Shinoda, Shunsuke Sato:
TokyoTech+Canon at TRECVID 2011. TRECVID 2011 - 2010
- [j6]Koichi Shinoda:
Acoustic Model Adaptation for Speech Recognition. IEICE Trans. Inf. Syst. 93-D(9): 2348-2362 (2010) - [c40]Yuzo Hamanaka, Koichi Shinoda, Sadaoki Furui, Tadashi Emori, Takafumi Koshinaka:
Speech modeling based on committee-based active learning. ICASSP 2010: 4350-4353 - [c39]Muhammad Rasyid Aqmar, Koichi Shinoda, Sadaoki Furui:
Robust Gait Recognition Against Speed Variation. ICPR 2010: 2190-2193 - [c38]Nakamasa Inoue, Tatsuhiko Saito, Koichi Shinoda, Sadaoki Furui:
High-Level Feature Extraction Using SIFT GMMs and Audio Models. ICPR 2010: 3220-3223 - [c37]Hitoshi Yamamoto, Ken Hanazawa, Kiyokazu Miki, Koichi Shinoda:
Dynamic language model adaptation using keyword category classification. INTERSPEECH 2010: 2426-2429 - [c36]Nakamasa Inoue, Toshiya Wada, Yusuke Kamishima, Koichi Shinoda, Ilseo Kim, Byungki Byun, Chin-Hui Lee:
TT+GT at TRECVID 2010 Workshop. TRECVID 2010
2000 – 2009
- 2009
- [c35]Takafumi Koshinaka, Kentaro Nagatomo, Koichi Shinoda:
Online speaker clustering using incremental learning of an ergodic hidden Markov model. ICASSP 2009: 4093-4096 - [c34]Hsin-Lung Hsieh, Jen-Tzung Chien
, Koichi Shinoda, Sadaoki Furui:
Independent component analysis for noisy speech recognition. ICASSP 2009: 4369-4372 - [c33]Koichi Shinoda, Hiroko Murakami, Sadaoki Furui:
Speaker adaptation based on two-step active learning. INTERSPEECH 2009: 576-579 - [c32]Agnieszka Betkowska Cavalcante, Koichi Shinoda, Sadaoki Furui:
Robust Speech Recognition in the Car Environment. LTC 2009: 24-34 - [c31]Nakamasa Inoue, Shanshan Hao, Tatsuhiko Saito, Koichi Shinoda, Ilseo Kim, Chin-Hui Lee:
TITGT at TRECVID 2009 Workshop. TRECVID 2009 - 2008
- [c30]Shutaro Tanji, Koichi Shinoda, Sadaoki Furui, Antonio Ortega:
Improvement of eigenvoice-based speaker adaptation by parameter space clustering. INTERSPEECH 2008: 1229-1232 - [c29]Kenji Iwata, Koichi Shinoda, Sadaoki Furui:
Robust spoken term detection using combination of phone-based and word-based recognition. INTERSPEECH 2008: 2195-2198 - [c28]Yasushi Watanabe, Koichi Shinoda, Sadaoki Furui:
Time-lag adaptation for semi-synchronous speech and pen input. INTERSPEECH 2008: 2675-2678 - [c27]Koichi Shinoda, Kazuki Ishihara, Sadaoki Furui, Takahiro Mochizuki:
Automatic Score Scene Detection for Baseball Video. LKR 2008: 226-240 - [c26]Koji Yamasaki, Koichi Shinoda, Sadaoki Furui:
Automatically estimating number of scenes for rushes summarization. TVS 2008: 129-133 - [c25]Shanshan Hao, Yusuke Yoshizawa, Koji Yamasaki, Koichi Shinoda, Sadaoki Furui:
Tokyo Tech at TRECVID 2008. TRECVID 2008 - 2007
- [j5]Agnieszka Betkowska, Koichi Shinoda
, Sadaoki Furui:
Robust Speech Recognition Using Factorial HMMs for Home Environments. EURASIP J. Adv. Signal Process. 2007 (2007) - [c24]Ryoichi Ando, Koichi Shinoda, Sadaoki Furui, Takahiro Mochizuki:
A robust scene recognition system for baseball broadcast using data-driven approach. CIVR 2007: 186-193 - [c23]Agnieszka Betkowska, Koichi Shinoda, Sadaoki Furui:
Home-environment adaptation of phoneme factorial hidden Markov models. EUSIPCO 2007: 2380-2384 - [c22]Yasushi Watanabe, Kenji Iwata, Ryuta Nakagawa, Koichi Shinoda, Sadaoki Furui:
Semi-Synchronous Speech and Pen Input. ICASSP (4) 2007: 409-412 - [c21]Agnieszka Betkowska, Koichi Shinoda, Sadaoki Furui:
Speech Recognition using FHMMS Robust Against Nonstationary Noise. ICASSP (4) 2007: 1029-1032 - [c20]Jen-Tzung Chien
, Koichi Shinoda, Sadaoki Furui:
Predictive minimum Bayes risk classification for robust speech recognition. INTERSPEECH 2007: 1062-1065 - [c19]Tadashi Emori, Yoshifumi Onishi, Koichi Shinoda:
Automatic estimation of scaling factors among probabilistic models in speech recognition. INTERSPEECH 2007: 1453-1456 - [c18]Hiroki Yamazaki, Koji Iwano, Koichi Shinoda, Sadaoki Furui, Haruo Yokota:
Dynamic language model adaptation using presentation slides for lecture speech recognition. INTERSPEECH 2007: 2349-2352 - [c17]Taichi Nakamura, Koichi Shinoda, Sadaoki Furui:
TokyoTech's TRECVID2007 Notebook. TRECVID 2007 - 2006
- [j4]Nguyen Huu Bach, Koichi Shinoda, Sadaoki Furui:
Robust Scene Extraction Using Multi-Stream HMMs for Baseball Broadcast. IEICE Trans. Inf. Syst. 89-D(9): 2553-2561 (2006) - [c16]Jen-Tzung Chien
, Chih-Hsien Huang, Koichi Shinoda, Sadaoki Furui:
Towards Optimal Bayes Decision for Speech Recognition. ICASSP (1) 2006: 45-48 - [c15]Ryoichi Ando, Koichi Shinoda, Sadaoki Furui, Takahiro Mochizuki:
Robust scene recognition using language models for scene contexts. Multimedia Information Retrieval 2006: 99-106 - [c14]Taichi Nakamura, Yuichi Miyamura, Koichi Shinoda, Sadaoki Furui:
TokyoTech's TRECVID2006 Notebook. TRECVID 2006 - 2005
- [c13]Nguyen Huu Bach, Koichi Shinoda, Sadaoki Furui:
Robust highlight extraction using multi-stream hidden Markov models for baseball video. ICIP (3) 2005: 173-176 - 2002
- [j3]Tadashi Emori, Koichi Shinoda:
Vocal tract length normalization using rapid maximum-likelihood estimation for speech recognition. Syst. Comput. Jpn. 33(5): 30-40 (2002) - [c12]Koichi Shinoda, Ken-ichi Iso:
Efficient reduction of Gaussian components using MDL criterion for HMM-based speech recognition. ICASSP 2002: 869-872 - 2001
- [j2]Koichi Shinoda, Chin-Hui Lee:
A structural Bayes approach to speaker adaptation. IEEE Trans. Speech Audio Process. 9(3): 276-287 (2001) - [c11]Tadashi Emori, Koichi Shinoda:
Rapid vocal tract length normalization using maximum likelihood estimation. INTERSPEECH 2001: 1649-1652 - 2000
- [j1]Koichi Shinoda, Mieko Yamada:
A family of Hadamard matrices of dihedral group type. Discret. Appl. Math. 102(1-2): 141-150 (2000)
1990 – 1999
- 1998
- [c10]Koichi Shinoda, Chin-Hui Lee:
Unsupervised adaptation using structural Bayes approach. ICASSP 1998: 793-796 - 1997
- [c9]Koichi Shinoda, Takao Watanabe:
Acoustic modeling based on the MDL principle for speech recognition. EUROSPEECH 1997: 99-102 - 1996
- [c8]Koichi Shinoda, Takao Watanabe:
Speaker adaptation with autonomous model complexity control by MDL principle. ICASSP 1996: 717-720 - [c7]Keizaburo Takagi, Koichi Shinoda, Hiroaki Hattori, Takao Watanabe:
Unsupervised and incremental speaker adaptation under adverse environmental conditions. ICSLP 1996: 2079-2082 - 1995
- [c6]Takao Watanabe, Koichi Shinoda, Keizaburo Takagi, Ken-ichi Iso:
High speed speech recognition using tree-structured probability density function. ICASSP 1995: 556-559 - [c5]Koichi Shinoda, Takao Watanabe:
Speaker adaptation with autonomous control using tree structure. EUROSPEECH 1995: 1143-1146 - 1994
- [c4]Takao Watanabe, Koichi Shinoda, Keizaburo Takagi, Eiko Yamada:
Speech recognition using tree-structured probability density function. ICSLP 1994: 223-226 - [c3]Koichi Shinoda, Takao Watanabe:
Unsupervised speaker adaptation for speech recognition using demi-syllable HMM. ICSLP 1994: 435-438 - 1991
- [c2]Koichi Shinoda, Ken-ichi Iso, Takao Watanabe:
Speaker adaptation for demi-syllable based continuous density HMM. ICASSP 1991: 857-860 - 1990
- [c1]Koichi Shinoda, Ken-ichi Iso, Takao Watanabe:
Speaker adaptation for demi-syllable based speech recognition using continuous HMM. ICSLP 1990: 261-264
Coauthor Index
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:10 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint