default search action
Yoshihiko Nankaku
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c103]Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model. ICASSP 2024: 12782-12786 - [i10]Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model. CoRR abs/2402.14692 (2024) - 2023
- [c102]Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Singing Voice Synthesis Based on a Musical Note Position-Aware Attention Mechanism. ICASSP 2023: 1-5 - [c101]Takenori Yoshimura, Shinji Takaki, Kazuhiro Nakamura, Keiichiro Oura, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis System. ICASSP 2023: 1-5 - [i9]Miku Nishihara, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Singing voice synthesis based on frame-level sequence-to-sequence models considering vocal timing deviation. CoRR abs/2301.02262 (2023) - 2022
- [c100]Yasutaka Nakamura, Seiichi Harata, Takuto Sakuma, Yoshihiro Tanaka, Yoshihiko Nankaku, Shohei Kato:
Enhancing Social Telepresence on Text Communication Using Robot Avatar that Reflects User's Chatting States. GCCE 2022: 707-709 - [c99]Takato Fujimoto, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Autoregressive Variational Autoencoder with a Hidden Semi-Markov Model-Based Structured Attention for Speech Synthesis. ICASSP 2022: 7462-7466 - [c98]Kentaro Mitsui, Tianyu Zhao, Kei Sawada, Yukiya Hono, Yoshihiko Nankaku, Keiichi Tokuda:
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue. INTERSPEECH 2022: 2328-2332 - [i8]Kentaro Mitsui, Tianyu Zhao, Kei Sawada, Yukiya Hono, Yoshihiko Nankaku, Keiichi Tokuda:
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue. CoRR abs/2206.12040 (2022) - [i7]Takenori Yoshimura, Shinji Takaki, Kazuhiro Nakamura, Keiichiro Oura, Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System. CoRR abs/2211.11222 (2022) - [i6]Yukiya Hono, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Singing Voice Synthesis Based on a Musical Note Position-Aware Attention Mechanism. CoRR abs/2212.13703 (2022) - 2021
- [j21]Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components. IEEE Access 9: 137599-137612 (2021) - [j20]Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2803-2815 (2021) - [c97]Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Periodnet: A Non-Autoregressive Waveform Generation Model with a Structure Separating Periodic and Aperiodic Components. ICASSP 2021: 6049-6053 - [i5]Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components. CoRR abs/2102.07786 (2021) - [i4]Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System. CoRR abs/2108.02776 (2021) - 2020
- [c96]Kazuhiro Nakamura, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Fast and High-Quality Singing Voice Synthesis System Based on Convolutional Neural Networks. ICASSP 2020: 7239-7243 - [c95]Takato Fujimoto, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Semi-Supervised Learning Based on Hierarchical Generative Models for End-to-End Speech Synthesis. ICASSP 2020: 7644-7648 - [c94]Yukiya Hono, Kazuna Tsuboi, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis. INTERSPEECH 2020: 3441-3445 - [i3]Yukiya Hono, Kazuna Tsuboi, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis. CoRR abs/2009.08474 (2020)
2010 – 2019
- 2019
- [c93]Yukiya Hono, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Singing Voice Synthesis Based on Generative Adversarial Networks. ICASSP 2019: 6955-6959 - [c92]Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Speaker-dependent Wavenet-based Delay-free Adpcm Speech Coding. ICASSP 2019: 7145-7149 - [c91]Keiichiro Oura, Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Deep neural network based real-time speech vocoder with periodic and aperiodic inputs. SSW 2019: 13-18 - [c90]Takato Fujimoto, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis. SSW 2019: 166-171 - [c89]Motoki Shimada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Low computational cost speech synthesis based on deep neural networks using hidden semi-Markov model structures. SSW 2019: 177-182 - [i2]Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Singing voice synthesis based on convolutional neural networks. CoRR abs/1904.06868 (2019) - [i1]Kazuhiro Nakamura, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Fast and High-Quality Singing Voice Synthesis System based on Convolutional Neural Networks. CoRR abs/1910.11690 (2019) - 2018
- [j19]Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Mel-Cepstrum-Based Quantization Noise Shaping Applied to Neural-Network-Based Speech Waveform Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1173-1180 (2018) - [c88]Takayuki Kasugai, Yoshinari Tsuzuki, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Image Recognition Based on Convolutional Neural Networks Using Features Generated from Separable Lattice Hidden Markov Models. APSIPA 2018: 324-328 - [c87]Kento Nakao, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Speaker Adaptation for Speech Synthesis Based on Deep Neural Networks Using Hidden Semi-Markov Model Structures. APSIPA 2018: 638-643 - [c86]Takato Fujimoto, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Speech Synthesis Using WaveNet Vocoder Based on Periodic/Aperiodic Decomposition. APSIPA 2018: 644-648 - [c85]Yukiya Hono, Shumma Murata, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Recent Development of the DNN-based Singing Voice Synthesis System - Sinsy. APSIPA 2018: 1003-1009 - [c84]Takenori Yoshimura, Natsumi Koike, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Discriminative Feature Extraction Based on Sequential Variational Autoencoder for Speaker Recognition. APSIPA 2018: 1742-1746 - [c83]Koki Senda, Yukiya Hono, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Singing Voice Conversion Using Posted Waveform Data on Music Social Media. APSIPA 2018: 1913-1917 - [c82]Kei Sawada, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
The NITech text-to-speech system for the Blizzard Challenge 2018. Blizzard Challenge 2018 - [c81]Eiji Ichikawa, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Image Recognition Based on Separable Lattice Hmms Using a Deep Neural Network for Output Probability Distributions. ICASSP 2018: 3021-3025 - [c80]Jumpei Niwa, Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Statistical Voice Conversion Based on Wavenet. ICASSP 2018: 5289-5293 - [c79]Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
WaveNet-Based Zero-Delay Lossless Speech Coding. SLT 2018: 153-158 - 2017
- [j18]Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Simultaneous Optimization of Multiple Tree-Based Factor Analyzed HMM for Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 25(9): 1836-1845 (2017) - [c78]Yoshinari Tsuzuki, Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Image recognition based on discriminative models using features generated from separable lattice HMMS. ICASSP 2017: 2607-2611 - [c77]Amelia Jane Gully, Takenori Yoshimura, Damian T. Murphy, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Articulatory Text-to-Speech Synthesis Using the Digital Waveguide Mesh Driven by a Deep Neural Network. INTERSPEECH 2017: 234-238 - [p1]Keiichi Tokuda, Akinobu Lee, Yoshihiko Nankaku, Keiichiro Oura, Kei Hashimoto, Daisuke Yamamoto, Ichi Takumi, Takahiro Uchiya, Shuhei Tsutsumi, Steve Renals, Junichi Yamagishi:
User Generated Dialogue Systems: uDialogue. Human-Harmonized Information Technology (2) 2017: 77-114 - 2016
- [j17]Kei Sawada, Akira Tamamori, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
A Bayesian Approach to Image Recognition Based on Separable Lattice Hidden Markov Models. IEICE Trans. Inf. Syst. 99-D(12): 3119-3131 (2016) - [c76]Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Trajectory training considering global variance for speech synthesis based on neural networks. ICASSP 2016: 5600-5604 - [c75]Naoki Hosaka, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Voice Conversion Based on Trajectory Model Training of Neural Networks Considering Global Variance. INTERSPEECH 2016: 307-311 - [c74]Masanari Nishimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Singing Voice Synthesis Based on Deep Neural Networks. INTERSPEECH 2016: 2478-2482 - [c73]Rasmus Dall, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Redefining the Linguistic Context Feature Set for HMM and DNN TTS Through Position and Parsing. INTERSPEECH 2016: 2851-2855 - [c72]Keiichi Tokuda, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku:
Temporal modeling in neural network based statistical parametric speech synthesis. SSW 2016: 106-111 - 2015
- [c71]Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
The effect of neural networks in statistical parametric speech synthesis. ICASSP 2015: 4455-4459 - [c70]Takenori Yoshimura, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Simultaneous optimization of multiple tree structures for factor analyzed HMM-based speech synthesis. INTERSPEECH 2015: 1196-1200 - [c69]Siva Reddy Gangireddy, Steve Renals, Yoshihiko Nankaku, Akinobu Lee:
Prosodically-enhanced recurrent neural network language models. INTERSPEECH 2015: 2390-2394 - 2014
- [j16]Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Integration of Spectral Feature Extraction and Modeling for HMM-Based Speech Synthesis. IEICE Trans. Inf. Syst. 97-D(6): 1438-1448 (2014) - [j15]Akira Tamamori, Yoshihiko Nankaku, Keiichi Tokuda:
Image Recognition Based on Separable Lattice Trajectory 2-D HMMs. IEICE Trans. Inf. Syst. 97-D(7): 1842-1854 (2014) - [j14]Shinji Takaki, Yoshihiko Nankaku, Keiichi Tokuda:
Contextual Additive Structure for HMM-Based Speech Synthesis. IEEE J. Sel. Top. Signal Process. 8(2): 229-238 (2014) - [c68]Kazuhiro Nakamura, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
HMM-Based singing voice synthesis and its application to Japanese and English. ICASSP 2014: 265-269 - [c67]Kanako Shirota, Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis. ICASSP 2014: 2559-2563 - [c66]Kazuhiro Nakamura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech. INTERSPEECH 2014: 2494-2498 - 2013
- [j13]Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
A Bayesian Framework Using Multiple Model Structures for Speech Recognition. IEICE Trans. Inf. Syst. 96-D(4): 939-948 (2013) - [j12]Keiichi Tokuda, Yoshihiko Nankaku, Tomoki Toda, Heiga Zen, Junichi Yamagishi, Keiichiro Oura:
Speech Synthesis Based on Hidden Markov Models. Proc. IEEE 101(5): 1234-1252 (2013) - [c65]Kei Sawada, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Image recognition based on hidden Markov eigen-image models using variational Bayesian method. APSIPA 2013: 1-8 - [c64]Takaya Makino, Shinji Takaki, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Separable lattice 2-D HMMS introducing state duration control for recognition of images with various variations. ICASSP 2013: 3203-3207 - [c63]Akira Tamamori, Yoshihiko Nankaku, Keiichi Tokuda:
Image recognition based on separable lattice trajectory 2-D HMMS. ICASSP 2013: 3467-3471 - [c62]Shinji Takaki, Yoshihiko Nankaku, Keiichi Tokuda:
Contextual partial additive structure for HMM-based speech synthesis. ICASSP 2013: 7878-7882 - [c61]Kazuhiro Nakamura, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Integration of acoustic modeling and mel-cepstral analysis for HMM-based speech synthesis. ICASSP 2013: 7883-7887 - [c60]Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis. SSW 2013: 297-302 - 2012
- [j11]Akira Tamamori, Yoshihiko Nankaku, Keiichi Tokuda:
An Extension of Separable Lattice 2-D HMMs for Rotational Data Variations. IEICE Trans. Inf. Syst. 95-D(8): 2074-2083 (2012) - [j10]Heiga Zen, Mark J. F. Gales, Yoshihiko Nankaku, Keiichi Tokuda:
Product of Experts for Statistical Parametric Speech Synthesis. IEEE Trans. Speech Audio Process. 20(3): 794-805 (2012) - [c59]Kei Sawada, Akira Tamamori, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Face recognition based on separable lattice 2-D HMMS using variational bayesian method. ICASSP 2012: 2205-2208 - [c58]Keisuke Kumaki, Yoshihiko Nankaku, Keiichi Tokuda:
Face recognition based on extended separable lattice 2-D HMMS. ICASSP 2012: 2209-2212 - [c57]Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
A model structure integration based on a Bayesian framework for speech recognition. ICASSP 2012: 4813-4816 - [c56]Keiichiro Oura, Ayami Mase, Yoshihiko Nankaku, Keiichi Tokuda:
Pitch adaptive training for hmm-based singing voice synthesis. ICASSP 2012: 5377-5380 - [c55]Viviane de Franca Oliveira, Sayaka Shiota, Yoshihiko Nankaku, Keiichi Tokuda:
Cross-lingual Speaker Adaptation for HMM-based Speech Synthesis based on Perceptual Characteristics and Speaker Interpolation. INTERSPEECH 2012: 983-986 - [c54]Takafumi Hattori, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
A Bayesian Approach to Speaker Recognition Based on GMMs Using Multiple Model Structures. INTERSPEECH 2012: 1107-1110 - 2011
- [j9]Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Bayesian Context Clustering Using Cross Validation for Speech Recognition. IEICE Trans. Inf. Syst. 94-D(3): 668-678 (2011) - [j8]Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda:
Continuous Stochastic Feature Mapping Based on Trajectory HMMs. IEEE Trans. Speech Audio Process. 19(2): 417-430 (2011) - [c53]Shinji Takaki, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
An optimization algorithm of independent mean and variance parameter tying structures for HMM-based speech synthesis. ICASSP 2011: 4700-4703 - [c52]Shifeng Pan, Yoshihiko Nankaku, Keiichi Tokuda, Jianhua Tao:
Global variance modeling on frequency domain delta LSP for HMM-based speech synthesis. ICASSP 2011: 4716-4719 - [c51]Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis. INTERSPEECH 2011: 113-116 - [c50]Lei Li, Yoshihiko Nankaku, Keiichi Tokuda:
A Bayesian Approach to Voice Conversion Based on GMMs Using Multiple Model Structures. INTERSPEECH 2011: 661-664 - [c49]Ulpu Remes, Yoshihiko Nankaku, Keiichi Tokuda:
GMM-Based Missing-Feature Reconstruction on Multi-Frame Windows. INTERSPEECH 2011: 1665-1668 - [c48]Ling-Hui Chen, Yoshihiko Nankaku, Heiga Zen, Keiichi Tokuda, Zhen-Hua Ling, Li-Rong Dai:
Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis. INTERSPEECH 2011: 1801-1804 - [c47]Naoaki Ito, Yoshihiko Nankaku, Akinobu Lee:
Evaluation of Tree-Trellis Based Decoding in Over-Million LVCSR. INTERSPEECH 2011: 1937-1940 - 2010
- [j7]Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
A Covariance-Tying Technique for HMM-Based Speech Synthesis. IEICE Trans. Inf. Syst. 93-D(3): 595-601 (2010) - [c46]Pascual Martínez-Gómez, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, Germán Sanchis-Trilles:
A Deterministic Annealing-Based Training Algorithm For Statistical Machine Translation Models. EAMT 2010 - [c45]Yoshiaki Takahashi, Akira Tamamori, Yoshihiko Nankaku, Keiichi Tokuda:
Face recognition based on separable lattice 2-D HMM with state duration modeling. ICASSP 2010: 2162-2165 - [c44]Akira Tamamori, Yoshihiko Nankaku, Keiichi Tokuda:
An extension of Separable Lattice 2-D HMMS for rotational data variations. ICASSP 2010: 2206-2209 - [c43]Kyosuke Kazumi, Yoshihiko Nankaku, Keiichi Tokuda:
Factor analyzed voice models for HMM-based speech synthesis. ICASSP 2010: 4234-4237 - [c42]Heiga Zen, Mark J. F. Gales, Yoshihiko Nankaku, Keiichi Tokuda:
Statistical parametric speech synthesis based on product of experts. ICASSP 2010: 4242-4245 - [c41]Toyohiro Hayashi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Speaker adaptation based on nonlinear spectral transform for speech recognition. INTERSPEECH 2010: 542-545 - [c40]Ayami Mase, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda:
HMM-based singing voice synthesis system using pitch-shifted pseudo training data. INTERSPEECH 2010: 845-848 - [c39]Akira Saito, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Voice activity detection based on conditional random fields using multiple features. INTERSPEECH 2010: 2086-2089 - [c38]Shinji Takaki, Yoshihiko Nankaku, Keiichi Tokuda:
Spectral modeling with contextual additive structure for HMM-based speech synthesis. SSW 2010: 100-105 - [c37]Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Bayesian speech synthesis framework integrating training and synthesis processes. SSW 2010: 106-111 - [c36]Keiichiro Oura, Ayami Mase, Tomohiko Yamada, Satoru Muto, Yoshihiko Nankaku, Keiichi Tokuda:
Recent development of the HMM-based singing voice synthesis system - Sinsy. SSW 2010: 211-216
2000 – 2009
- 2009
- [c35]Kaori Yutani, Yosuke Uto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Voice conversion based on simultaneous modelling of spectrum and F0. ICASSP 2009: 3897-3900 - [c34]Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Takashi Masuko, Keiichi Tokuda:
A Bayesian approach to HMM-based speech synthesis. ICASSP 2009: 4029-4032 - [c33]Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda:
Stereo-based stochastic noise compensation based on trajectory GMMS. ICASSP 2009: 4577-4580 - [c32]Yi-Jian Wu, Yoshihiko Nankaku, Keiichi Tokuda:
State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis. INTERSPEECH 2009: 528-531 - [c31]Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
Deterministic annealing based training algorithm for Bayesian speech recognition. INTERSPEECH 2009: 680-683 - [c30]Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda:
A Bayesian approach to Hidden Semi-Markov Model based speech synthesis. INTERSPEECH 2009: 1751-1754 - [c29]Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems. INTERSPEECH 2009: 1759-1762 - 2008
- [j6]Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System. IEICE Trans. Inf. Syst. 91-D(11): 2693-2700 (2008) - [c28]Yoshihiko Nankaku, Kazuhiro Nakamura, Heiga Zen, Keiichi Tokuda:
Acoustic modeling with contextual additive structure for HMM-based speech recognition. ICASSP 2008: 4469-4472 - [c27]Yoshitaka Yoshimi, Ryota Kakitsuba, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Probabilistic answer selection based on conditional random fields for spoken dialog system. INTERSPEECH 2008: 215-218 - [c26]Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Acoustic modeling based on model structure annealing for speech recognition. INTERSPEECH 2008: 932-935 - [c25]Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition. INTERSPEECH 2008: 936-939 - [c24]Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda:
Probabilistic feature mapping based on trajectory HMMs. INTERSPEECH 2008: 1068-1071 - [c23]Kaori Yutani, Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Keiichi Tokuda:
Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matching. INTERSPEECH 2008: 1072-1075 - [c22]Tatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Speaker recognition based on variational Bayesian method. INTERSPEECH 2008: 1417-1420 - [c21]Keiichiro Oura, Yoshihiko Nankaku, Tomoki Toda, Keiichi Tokuda, Ranniery Maia, Shinsuke Sakai, Satoshi Nakamura:
Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems. ISCSLP 2008: 1-4 - 2007
- [c20]Yoshihiko Nankaku, Keiichi Tokuda:
Face Recognition using Hidden Markov Eigenface Models. ICASSP (2) 2007: 469-472 - [c19]Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda:
A trainable excitation model for HMM-based speech synthesis. INTERSPEECH 2007: 1909-1912 - [c18]Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda:
Model-space MLLR for trajectory HMMs. INTERSPEECH 2007: 2065-2068 - [c17]Ranniery Maia, Tomoki Toda, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda:
An excitation model for HMM-based speech synthesis based on residual modeling. SSW 2007: 131-136 - [c16]Yoshihiko Nankaku, Kenichi Nakamura, Tomoki Toda, Keiichi Tokuda:
Spectral conversion based on statistical models including time-sequence matching. SSW 2007: 333-338 - 2006
- [c15]Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer. ICASSP (1) 2006: 33-36 - [c14]Kenichi Nakamura, Tomoki Toda, Yoshihiko Nankaku, Keiichi Tokuda:
On the Use of Phonetic Information for Mapping from Articulatory Movements to Vocal Tract Spectrum. ICASSP (1) 2006: 93-96 - [c13]Daisuke Kurata, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, Zoubin Ghahramani:
Face Recognition Based on Separable Lattice HMMS. ICASSP (5) 2006: 737-740 - [c12]Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura:
Estimating Trajectory Hmm Parameters Using Monte Carlo Em With Gibbs Sampler. ICASSP (1) 2006: 1173-1176 - [c11]Tomohiro Hakamata, Akinobu Lee, Yoshihiko Nankaku, Keiichi Tokuda:
Reducing computation on parallel decoding using frame-wise confidence scores. INTERSPEECH 2006 - [c10]Keijiro Saino, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
An HMM-based singing voice synthesis system. INTERSPEECH 2006 - [c9]Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Akinobu Lee, Keiichi Tokuda:
Voice conversion based on mixtures of factor analyzers. INTERSPEECH 2006 - [c8]Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura:
Speaker adaptation of trajectory HMMs using feature-space MLLR. INTERSPEECH 2006 - 2005
- [j5]Amaro A. de Lima, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, Fernando Gil Resende:
Applying Sparse KPCA for Feature Extraction in Speech Recognition. IEICE Trans. Inf. Syst. 88-D(3): 401-409 (2005) - [j4]Hiroyuki Suzuki, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Continuous Speech Recognition Based on General Factor Dependent Acoustic Models. IEICE Trans. Inf. Syst. 88-D(3): 410-417 (2005) - [j3]Hiroyoshi Yamamoto, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Parameter Sharing in Mixture of Factor Analyzers for Speaker Identification. IEICE Trans. Inf. Syst. 88-D(3): 418-424 (2005) - [j2]Yohei Itaya, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Deterministic Annealing EM Algorithm in Acoustic Modeling for Speaker and Speech Recognition. IEICE Trans. Inf. Syst. 88-D(3): 425-431 (2005) - [c7]Amaro A. de Lima, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, Fernando Gil Resende:
Sparse KPCA for Feature Extraction in Speech Recognition. ICASSP (1) 2005: 353-356 - 2004
- [j1]Amaro A. de Lima, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
On the Use of Kernel PCA for Feature Extraction in Speech Recognition. IEICE Trans. Inf. Syst. 87-D(12): 2802-2811 (2004) - [c6]Hiroyoshi Yamamoto, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Parameter sharing and minimum classification error training of mixtures of factor analyzers for speaker identification. ICASSP (1) 2004: 29-32 - [c5]Yohei Itaya, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Deterministic annealing EM algorithm in parameter estimation for acoustic model. INTERSPEECH 2004: 433-436 - 2003
- [c4]Hiroyuki Suzuki, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Speech recognition using voice-characteristic-dependent acoustic models. ICASSP (1) 2003: 740-743 - [c3]Amaro A. de Lima, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
On the use of kernel PCA for feature extraction in speech recognition. INTERSPEECH 2003: 2625-2628 - 2000
- [c2]Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, Takao Kobayashi:
Normalized Training for HMM-Based Visual Speech Recognition. ICIP 2000: 234-237
1990 – 1999
- 1999
- [c1]Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura:
Intensity- and location-normalized training for HMM-based visual speech recognition. EUROSPEECH 1999: 1287-1290
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:09 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint