![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
Speech Communication, Volume 54
Volume 54, Number 1, January 2012
- Abhishek Jaywant
, Marc D. Pell
:
Categorical processing of negative emotions from speech prosody. 1-10 - Elisabetta Fersini
, Enza Messina
, Francesco Archetti:
Emotional states in judicial courtrooms: An experimental investigation. 11-22 - Mouloud Djamah, Douglas D. O'Shaughnessy:
Fine granularity scalable speech coding using embedded tree-structured vector quantization. 23-39 - Abhijeet Sangwan, John H. L. Hansen:
Automatic analysis of Mandarin accented English using phonological features. 40-54 - Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features. 55-67 - Máire Ní Chiosáin, Pauline Welby
, Robert Espesser:
Is the syllabification of Irish a typological exception? An experimental study. 68-91 - Silke Paulmann, Debra Titone, Marc D. Pell
:
How emotional prosody guides your way: Evidence from eye movements. 92-107 - Peter Jancovic, Xin Zou, Münevver Köküer:
Speech enhancement based on Sparse Code Shrinkage employing multiple speech models. 108-118 - Cong-Thanh Do, Dominique Pastor
, André Goalic:
A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech. 119-133 - Keigo Nakamura, Tomoki Toda
, Hiroshi Saruwatari, Kiyohiro Shikano:
Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech. 134-146 - Ying-Yee Kong, Ala Mullangi:
On the development of a frequency-lowering system that enhances place-of-articulation perception. 147-160
Volume 54, Number 2, February 2012
- Nigel G. Ward, Alejandro Vega, Timo Baumann
:
Prosodic and temporal features for language modeling for dialog. 161-174 - J. Sebastian Andersson, Junichi Yamagishi, Robert A. J. Clark:
Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis. 175-188 - Sophie Bouton
, Pascale Colé
, Willy Serniclaes:
The influence of lexical knowledge on phoneme discrimination in deaf children with cochlear implants. 189-198 - Jón Guðnason
, Mark R. P. Thomas, Daniel P. W. Ellis, Patrick A. Naylor
:
Data-driven voice source waveform analysis and synthesis. 199-211 - George Saon
, Hagen Soltau:
Boosting systems for large vocabulary continuous speech recognition. 212-218 - Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Ariya Rastrow, Nobuyasu Itoh, Masafumi Nishimura:
Acoustically discriminative language model training with pseudo-hypothesis. 219-228 - Masakiyo Fujimoto, Shinji Watanabe
, Tomohiro Nakatani:
Frame-wise model re-estimation method based on Gaussian pruning with weight normalization for noise robust voice activity detection. 229-244 - Vataya Chunwijitra, Takashi Nose, Takao Kobayashi:
A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis. 245-255 - Hamid Reza Tohidypour, Seyyed Ali Seyyedsalehi
, Hossein Behbood, Hossein Roshandel:
A new representation for speech frame recognition based on redundant wavelet filter banks. 256-271 - Fei Chen
, Philipos C. Loizou:
Impact of SNR and gain-function over- and under-estimation on speech intelligibility. 272-281 - Kuldip K. Paliwal
, Belinda Schwerin
, Kamil K. Wójcicki:
Speech enhancement using a minimum mean-square error short-time spectral modulation magnitude estimator. 282-305 - Andrew Hines
, Naomi Harte
:
Speech intelligibility prediction using a Neurogram Similarity Index Measure. 306-320
Volume 54, Number 3, March 2012
- Bert Réveil, Jean-Pierre Martens, Henk van den Heuvel:
Improving proper name recognition by means of automatically learned pronunciation variants. 321-340 - Pandurangarao N. Kulkarni, Prem C. Pandey, Dakshayani S. Jangamashetti:
Multi-band frequency compression for improving speech perception by listeners with moderate sensorineural hearing loss. 341-350 - Antonio Moreno-Daniel, Jay G. Wilpon, Biing-Hwang Juang:
Index-based incremental language model for scalable directory assistance. 351-367 - Daniel Recasens:
A cross-language acoustic study of initial and final allophones of /l/. 368-383 - Takashi Nose, Takao Kobayashi:
Very low bit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized F0 symbols. 384-392 - Amaro A. de Lima, Thiago de M. Prego
, Sergio L. Netto
, Bowon Lee, Amir Said, Ronald W. Schafer, Ton Kalker, Majid Fozunbal:
On the quality-assessment of reverberated speech. 393-401 - Peng Dai
, Ing Yann Soon:
A temporal frequency warped (TFW) 2D psychoacoustic filter for robust speech recognition system. 402-413 - Ioulia Grichkovtsova, Michel Morel, Anne Lacheret:
The role of voice quality and prosodic contour in affective speech perception. 414-429 - Frank Rudzicz
:
Using articulatory likelihoods in the recognition of dysarthric speech. 430-444 - Je Hun Jeon, Yang Liu:
Automatic prosodic event detection using a novel labeling and selection method in co-training. 445-458 - Jordi Adell, David Escudero Mancebo
, Antonio Bonafonte
:
Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence. 459-476 - Jae-Hun Choi, Joon-Hyuk Chang:
On using acoustic environment classification for statistical model-based speech enhancement. 477-490 - Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, Abhinav Sethy, Bhuvana Ramabhadran:
Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech. 491-502 - Angel M. Gomez
, Belinda Schwerin
, Kuldip K. Paliwal
:
Improving objective intelligibility prediction by combining correlation and coherence based methods with a measure based on the negative distortion ratio. 503-515
Volume 54, Number 4, May 2012
- Anis Ben Aicha, Sofia Ben Jebara:
Perceptual speech quality measures separating speech distortion and additive noise degradations. 517-528 - Meihong Wu, Huahui Li, Zhiling Hong, Xinchi Xian, Jingyu Li, Xihong Wu, Liang Li:
Effects of aging on the ability to benefit from prior knowledge of message content in masked speech recognition. 529-542 - Md. Sahidullah
, Goutam Saha
:
Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. 543-565 - David Escudero Mancebo
, Lourdes Aguilar
, María Vanrell
, Pilar Prieto
:
Analysis of inter-transcriber consistency in the Cat_ToBI prosodic labeling system. 566-582
Volume 54, Number 5, June 2012
- William Ricardo Rodríguez
, Oscar Saz, Eduardo Lleida
:
A prelingual tool for the education of altered voices. 583-600 - Evaldas Vaiciukynas
, Antanas Verikas, Adas Gelzinis, Marija Bacauskiene, Virgilijus Uloza:
Exploring similarity-based classification of larynx disorders from human voice. 601-610 - David M. Howard
, Evelyn Abberton, Adrian Fourcin:
Disordered voice measurement and auditory analysis. 611-621 - Tiago H. Falk
, Wai-Yip Chan, Fraser Shein:
Characterization of atypical vocal source excitation, temporal dynamics and prosody for objective measurement of dysarthric word intelligibility. 622-631 - Marieke de Bruijn, Louis ten Bosch, Dirk J. Kuik, Birgit I. Witte, Johannes A. Langendijk
, C. René Leemans, Irma Verdonck-de Leeuw
:
Acoustic-phonetic and artificial neural network feature analysis to assess speech quality of stop consonants produced by patients treated for oral or oropharyngeal cancer. 632-640 - Sevasti-Zoi Karakozoglou, Nathalie Henrich
, Christophe d'Alessandro
, Yannis Stylianou:
Automatic glottal segmentation using local-based active contours and application to glottovibrography. 641-654 - Ali Alpan, Jean Schoentgen, Youri Maryn, Francis Grenez, P. Murphy:
Assessment of disordered voice via the first rahmonic. 655-663 - Alain Ghio
, Gilles Pouchoulin
, Bernard Teston, Serge Pinto, Corinne Fredouille, Céline De Looze, Danièle Robert, François Viallet, Antoine Giovanni:
How to manage sound, physiological and clinical data of 2500 dysphonic and dysarthric speakers? 664-679
Volume 54, Number 6, July 2012
- Pilar Prieto
, María Vanrell
, Lluïsa Astruc, Elinor Payne, Brechtje Post:
Phonotactic and phrasal properties of speech rhythm. Evidence from Catalan, English, and Spanish. 681-702 - Keiichiro Oura, Junichi Yamagishi, Mirjam Wester, Simon King
, Keiichi Tokuda:
Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping. 703-714 - Tobias Kaufmann, Beat Pfister:
Syntactic language modeling with formal grammars. 715-731 - Petr Zelinka, Milan Sigmund
, Jiri Schimmel:
Impact of vocal effort variability on automatic speech recognition. 732-742 - Rigas Kotsakis, George Kalliris
, Charalampos Dimoulas
:
Investigation of broadcast-audio semantic analysis scenarios employing radio-programme-adaptive pattern classification. 743-762 - Mohammad Hossein Moattar
, Mohammad Mehdi Homayounpour
:
Variational conditional random fields for online speaker detection and tracking. 763-780 - Mirjam Wester:
Talker discrimination across languages. 781-790 - Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
Efficient training of discriminative language models by sample selection. 791-800 - Herman Kamper
, Félicien Jeje Muamba Mukanya, Thomas Niesler:
Multi-accent acoustic modelling of South African English. 801-813 - Eduardo Pavez
, Jorge F. Silva
:
Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition. 814-835 - Ronan Flynn
, Edward Jones
:
Feature selection for reduced-bandwidth distributed speech recognition. 836-843 - David M. Howard
, Evelyn Abberton, Adrian Fourcin:
Erratum to "Disordered voice measurement and auditory analysis" [Speech Comm. 54(2012) 611-621]. 844
Volume 54, Number 7, September 2012
- Lan Wang, Hui Chen, Sheng Li
, Helen M. Meng:
Phoneme-level articulatory animation in pronunciation training. 845-856 - Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King
, Keiichi Tokuda:
Impacts of machine translation and speech synthesis on speech-to-speech translation. 857-866 - Shajith Ikbal, Hemant Misra, Hynek Hermansky
, Mathew Magimai-Doss
:
Phase AutoCorrelation (PAC) features for noise robust speech recognition. 867-880 - Ronan Flynn
, Edward Jones
:
Reducing bandwidth for robust distributed speech recognition in conditions of packet loss. 881-892 - Thorsten Smit, Friedrich Türckheim, Robert Mores:
Fast and robust formant detection from LP data. 893-902 - Ali Hassan
, Robert I. Damper:
Classification of emotional speech using 3DEC hierarchical classifier. 903-916 - Hugo Quené
, Gün Refik Semin
, Francesco Foroni
:
Audible smiles and frowns affect speech comprehension. 917-922
Volume 54, Number 8, October 2012
- Yana Yunusova
, Melanie Baljko
, Grigore Pintilie, Krista Rudy, Petros Faloutsos, John Daskalogiannakis:
Acquisition of the 3D surface of the palate by in-vivo digitization with Wave. 923-931 - Qinghua Sun, Keikichi Hirose, Nobuaki Minematsu:
A method for generation of Mandarin F0 contours based on tone nucleus model and superpositional model. 932-945 - Peggy P. K. Mok:
Effects of consonant cluster syllabification on vowel-to-vowel coarticulation in English. 946-956 - Zhongbo Li, Shenghui Zhao, Stefan Bruhn, Jing Wang, Jingming Kuang:
Comparison and optimization of packet loss recovery methods based on AMR-WB for VoIP. 957-974
Volume 54, Number 9, November 2012
- Okko Räsänen
:
Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions. 975-997 - Toshio Irino, Yoshie Aoki, Hideki Kawahara
, Roy D. Patterson:
Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination. 998-1013 - Atsunori Ogawa, Atsushi Nakamura:
Joint estimation of confidence and error causes in speech recognition. 1014-1028 - Irene Ayllón Clemente, Martin Heckmann
, Britta Wrede
:
Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation. 1029-1048 - Khiet P. Truong, David A. van Leeuwen, Franciska M. G. de Jong:
Speech-based recognition of self-reported and observed emotion in a dimensional space. 1049-1063
Volume 54, Number 10, December 2012
- Mohammad Hossein Moattar
, Mohammad Mehdi Homayounpour
:
A review on speaker diarization systems and approaches. 1065-1103 - Veena Karjigi
, Preeti Rao:
Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling. 1104-1120 - Edward Ozimek, Dariusz Kutzner, Pawel Libiszewski:
Speech intelligibility tested by the Pediatric Matrix Sentence test in 3-6 year old children. 1121-1131 - Doris Baum:
Recognising speakers from the topics they talk about. 1132-1142
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.