default search action

combined dblp search
author search
venue search
publication search

ask others

Masami Akamine

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2016
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/OhtaniTMA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/OhtaniTMA16
Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Masami Akamine:
Statistical Bandwidth Extension for Speech Synthesis Based on Gaussian Mixture Model with Sub-Band Basis Spectrum Model. IEICE Trans. Inf. Syst. 99-D(10): 2481-2489 (2016)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/DrugmanSKA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/DrugmanSKA16
Thomas Drugman, Yannis Stylianou, Yusuke Kida, Masami Akamine:
Voice Activity Detection: Merging Source and Filter-based Information. IEEE Signal Process. Lett. 23(2): 252-256 (2016)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZorilaSIA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZorilaSIA16
Tudor-Catalin Zorila, Yannis Stylianou, Tatsuma Ishihara, Masami Akamine:
Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time-Frequency Energy Reallocation Approach. IEEE ACM Trans. Audio Speech Lang. Process. 24(10): 1808-1818 (2016)
2014
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/MaiaA14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/MaiaA14
Ranniery Maia, Masami Akamine:
On the impact of excitation and spectral parameters for expressive statistical parametric speech synthesis. Comput. Speech Lang. 28(5): 1209-1232 (2014)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/WanLYBCGA14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/WanLYBCGA14
Vincent Wan, Javier Latorre, Kayoko Yanagisawa, Norbert Braunschweiler, Langzhou Chen, Mark J. F. Gales, Masami Akamine:
Building HMM-TTS Voices on Diverse Data. IEEE J. Sel. Top. Signal Process. 8(2): 296-306 (2014)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ChenGBAK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ChenGBAK14
Langzhou Chen, Mark J. F. Gales, Norbert Braunschweiler, Masami Akamine, Kate M. Knill:
Integrated Expression Prediction and Speech Synthesis From Text. IEEE J. Sel. Top. Signal Process. 8(2): 323-335 (2014)
2013
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MaiaAG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MaiaAG13
Ranniery Maia, Masami Akamine, Mark J. F. Gales:
Complex cepstrum for statistical parametric speech synthesis. Speech Commun. 55(5): 606-618 (2013)
2012
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/AkamineA12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/AkamineA12
Masami Akamine, Jitendra Ajmera:
Decision tree-based acoustic models for speech recognition. EURASIP J. Audio Speech Music. Process. 2012: 10 (2012)
2011
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/AkamineA11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/AkamineA11
Masami Akamine, Jitendra Ajmera:
Decision Tree-Based Acoustic Models for Speech Recognition with Improved Smoothness. IEICE Trans. Inf. Syst. 94-D(11): 2250-2258 (2011)
2007
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/scjapan/KagoshimaMSAS07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/scjapan/KagoshimaMSAS07
Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine, Yoshinori Shiga:
An F₀ contour control model using an F₀ contour codebook. Syst. Comput. Jpn. 38(1): 62-72 (2007)
1999
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/scjapan/KagoshimaA99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/scjapan/KagoshimaA99
Takehiko Kagoshima, Masami Akamine:
Automatic generation of synthesis units by unit selection based on closed-loop training. Syst. Comput. Jpn. 30(9): 1-7 (1999)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/IwataYFA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/IwataYFA19
Kenji Iwata, Takami Yoshida, Hiroshi Fujimura, Masami Akamine:
Transfer Learning for Unseen Slots in End-to-End Dialogue State Tracking. IWSDS 2019: 53-65
2018
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/YoshidaIFA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/YoshidaIFA18
Takami Yoshida, Kenji Iwata, Hiroshi Fujimura, Masami Akamine:
Dialog State Tracking for Unseen Values Using an Extended Attention Mechanism. IWSDS 2018: 77-89
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/KobayashiYIFA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/KobayashiYIFA18
Yuka Kobayashi, Takami Yoshida, Kenji Iwata, Hiroshi Fujimura, Masami Akamine:
Out-of-Domain Slot Value Detection for Spoken Dialogue Systems with Context Information. SLT 2018: 854-861
2015
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhtaniNMA15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhtaniNMA15
Yamato Ohtani, Yu Nasu, Masahiro Morita, Masami Akamine:
Emotional transplant in statistical speech synthesis based on emotion additive model. INTERSPEECH 2015: 274-278
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaiaSA15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaiaSA15
Ranniery Maia, Yannis Stylianou, Masami Akamine:
A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization. INTERSPEECH 2015: 603-607
2014
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhtaniTMA14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhtaniTMA14
Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Masami Akamine:
GMM-based bandwidth extension using sub-band basis spectrum model. INTERSPEECH 2014: 2489-2493
2013
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LatorreGKA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LatorreGKA13
Javier Latorre, Mark J. F. Gales, Kate M. Knill, Masami Akamine:
Training a supra-segmental parametric F0 model without interpolating F0. ICASSP 2013: 6880-6884
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaiaAG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaiaAG13
Ranniery Maia, Masami Akamine, Mark J. F. Gales:
Complex cepstrum analysis based on the minimum mean squared error. ICASSP 2013: 7972-7976
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenGBAK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenGBAK13
Langzhou Chen, Mark J. F. Gales, Norbert Braunschweiler, Masami Akamine, Kate M. Knill:
Integrated automatic expression prediction and speech synthesis from text. ICASSP 2013: 7977-7981
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaiaGSA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaiaGSA13
Ranniery Maia, Mark J. F. Gales, Yannis Stylianou, Masami Akamine:
Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis. INTERSPEECH 2013: 2336-2340
[c29]
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/WanABBCKLMSYSAGC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WanABBCKLMSYSAGC13
Vincent Wan, Robert Anderson, Art Blokland, Norbert Braunschweiler, Langzhou Chen, BalaKrishna Kolluru, Javier Latorre, Ranniery Maia, Björn Stenger, Kayoko Yanagisawa, Yannis Stylianou, Masami Akamine, Mark J. F. Gales, Roberto Cipolla:
Photo-realistic expressive text to talking head synthesis. INTERSPEECH 2013: 2667-2669
2012
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaiaAG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaiaAG12
Ranniery Maia, Masami Akamine, Mark J. F. Gales:
Complex cepstrum as phase information in statistical parametric speech synthesis. ICASSP 2012: 4581-4584
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenGWLA12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenGWLA12
Langzhou Chen, Mark J. F. Gales, Vincent Wan, Javier Latorre, Masami Akamine:
Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training. INTERSPEECH 2012: 959-962
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LatorreWGCCKA12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LatorreWGCCKA12
Javier Latorre, Vincent Wan, Mark J. F. Gales, Langzhou Chen, K. K. Chin, Kate M. Knill, Masami Akamine:
Speech factorization for HMM-TTS based on cluster adaptive training. INTERSPEECH 2012: 971-974
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WanLCCGZKA12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WanLCCGZKA12
Vincent Wan, Javier Latorre, K. K. Chin, Langzhou Chen, Mark J. F. Gales, Heiga Zen, Kate M. Knill, Masami Akamine:
Combining multiple high quality corpora for improving HMM-TTS. INTERSPEECH 2012: 1135-1138
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhtaniTMKA12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhtaniTMKA12
Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine:
Histogram-based spectral equalization for HMM-based speech synthesis using mel-LSP. INTERSPEECH 2012: 1155-1158
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhtaniTMKA12a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhtaniTMKA12a
Yamato Ohtani, Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine:
HMM-based speech synthesis using sub-band basis spectrum model. INTERSPEECH 2012: 1440-1443
2011
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LatorreGBKTOA11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LatorreGBKTOA11
Javier Latorre, Mark J. F. Gales, Sabine Buchholz, Kate M. Knill, Masatsune Tamura, Yamato Ohtani, Masami Akamine:
Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification? ICASSP 2011: 4724-4727
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TamuraMKA11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TamuraMKA11
Masatsune Tamura, Masahiro Morita, Takehiko Kagoshima, Masami Akamine:
One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model. ICASSP 2011: 5124-5127
2010
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShinoharaMA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShinoharaMA10
Yusuke Shinohara, Takashi Masuko, Masami Akamine:
Covariance clustering on Riemannian manifolds for acoustic model compression. ICASSP 2010: 4326-4329
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TamuraBKA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TamuraBKA10
Masatsune Tamura, Norbert Braunschweiler, Takehiko Kagoshima, Masami Akamine:
Unit selection speech synthesis using multiple speech units at non-adjacent segments for prosody and waveform generation. ICASSP 2010: 4802-4805
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TamuraKA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TamuraKA10
Masatsune Tamura, Takehiko Kagoshima, Masami Akamine:
Sub-band basis spectrum model for pitch-synchronous log-spectrum and phase based on approximation of sparse coding. INTERSPEECH 2010: 2406-2409
2009
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShinoharaA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShinoharaA09
Yusuke Shinohara, Masami Akamine:
Bayesian feature enhancement using a mixture of unscented transformation for uncertainty decoding of noisy speech. ICASSP 2009: 4569-4572
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AjmeraA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AjmeraA09
Jitendra Ajmera, Masami Akamine:
Decision tree acoustic models for ASR. INTERSPEECH 2009: 1403-1406
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LatorreGA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LatorreGA09
Javier Latorre, Sergio Gracia, Masami Akamine:
Feedback loop for prosody prediction in concatenative speech synthesis. INTERSPEECH 2009: 2067-2070
2008
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShinoharaMA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShinoharaMA08
Yusuke Shinohara, Takashi Masuko, Masami Akamine:
Feature enhancement by speaker-normalized splice for robust speech recognition. ICASSP 2008: 4881-4884
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingYA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingYA08
Hongfei Ding, Koichi Yamamoto, Masami Akamine:
Comparative evaluation of different methods for voice activity detection. INTERSPEECH 2008: 107-110
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AjmeraA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AjmeraA08
Jitendra Ajmera, Masami Akamine:
Speech recognition using soft decision trees. INTERSPEECH 2008: 940-943
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LatorreA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LatorreA08
Javier Latorre, Masami Akamine:
Multilevel parametric-base F0 model for speech synthesis. INTERSPEECH 2008: 2274-2277
2007
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TeunenA07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TeunenA07
Remco Teunen, Masami Akamine:
HMM-based speech recognition using decision trees instead of GMMs. INTERSPEECH 2007: 2097-2100
1999
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AmadaMA99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AmadaMA99
Tadashi Amada, Kimio Miseki, Masami Akamine:
CELP speech coding based on an adaptive pulse position codebook. ICASSP 1999: 13-16
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuhKMSA99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuhKMSA99
Chang K. Suh, Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine:
Toshiba English text-to-speech synthesizer (TESS). EUROSPEECH 1999: 2111-2114
1998
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OshikiriA98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OshikiriA98
Masahiro Oshikiri, Masami Akamine:
A 2.4 kbps variable bit rate ADP-CELP speech coder. ICASSP 1998: 517-520
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AkamineK98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AkamineK98
Masami Akamine, Takehiko Kagoshima:
Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS). ICSLP 1998
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KagoshimaMSA98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KagoshimaMSA98
Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine:
An F0 contour control model for totally speaker driven text to speech system. ICSLP 1998
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SetoMKA98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SetoMKA98
Shigenobu Seto, Masahiro Morita, Takehiko Kagoshima, Masami Akamine:
Automatic rule generation for linguistic features analysis using inductive learning technique: linguistic features analysis in TOS drive TTS system. ICSLP 1998
1997
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KagoshimaA97
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KagoshimaA97
Takehiko Kagoshima, Masami Akamine:
Automatic generation of speech synthesis units based on closed loop training. ICASSP 1997: 963-966
1991
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MisekiA91
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MisekiA91
Kimio Miseki, Masami Akamine:
Adaptive bit-allocation between the pole-zero synthesis filter and excitation in CELP. ICASSP 1991: 229-232
1990
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AkamineM90
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AkamineM90
Masami Akamine, Kimio Miseki:
CELP coding with an adaptive density pulse excitation model. ICASSP 1990: 29-32

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-02844
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-02844
Thomas Drugman, Yannis Stylianou, Yusuke Kida, Masami Akamine:
Voice Activity Detection: Merging Source and Filter-based Information. CoRR abs/1903.02844 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.