


default search action
SAPA@INTERSPEECH 2012: Portland, OR, USA
- ISCA Workshop on Statistical And Perceptual Audition, SAPA 2012, Portland, OR, USA, September 7-8, 2012. ISCA 2012
Keynote Paper
- Tuomas Virtanen:
Human sound perception - what can we learn from it when developing audio analysis algorithms?
Contributed Papers
- Majid Mirbagheri, Yanbo Xu, Shihab A. Shamma:
Pitch estimation using mutual information. 1-4 - Mauro Nicolao, Roger K. Moore:
Establishing some principles of human speech production through two-dimensional computational models. 5-10 - Tomoyasu Nakano, Masataka Goto:
A spectral envelope estimation method based on F0-adaptive multi-frame integration analysis. 11-16 - Cong-Thanh Do, Claude Barras:
Cochlear implant-like processing of speech signal for speaker verification. 17-21 - Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King:
Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise. 22-27 - Sunder Ram Krishnan, Chandra Sekhar Seelamantula:
A generalized Stein's estimation approach for speech enhancement based on perceptual criteria. 28-33 - Zoltán Tüske, Friedhelm R. Drepper, Ralf Schlüter:
Non-stationary signal processing and its application in speech recognition. 34-39 - Liang Lu, Arnab Ghoshal, Steve Renals:
Joint uncertainty decoding with unscented transform for noise robust subspace Gaussian mixture models. 40-45 - M. Ali Basha Shaik, David Rybach, Stefan Hahn, Ralf Schlüter, Hermann Ney:
Hierarchical hybrid language models for open vocabulary continuous speech recognition using WFST. 46-51 - Serena Soldo, Mathew Magimai-Doss, Hervé Bourlard:
Template-based ASR using posterior features and synthetic references: comparing different TTS systems. 52-57 - Kalu U. Ogbureke, João P. Cabral, Julie Carson-Berndsen:
Explicit duration modelling in HMM-based speech synthesis using a hybrid hidden Markov model-multilayer perceptron. 58-63 - Deepu Vijayasenan, Fabio Valente:
Dimensionality reduction of large TDOA vectors for speaker diarization. 64-67 - Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel, Dietrich Klakow:
Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response power. 68-73 - Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard, Volkan Cevher:
Structured sparse coding for microphone array location calibration. 74-79 - Takuya Yoshioka, Daichi Sakaue:
Log-normal matrix factorization with application to speech-music separation. 80-85 - Rahil Mahdian Toroghi, Friedrich Faubel, Dietrich Klakow:
Multi-channel speech separation with soft time-frequency masking. 86-91 - Heyun Huang, Louis ten Bosch, Bert Cranen, Lou Boves:
Smoothing speech trajectories by regularization. 92-97 - Joris Driesen, Jort F. Gemmeke, Hugo Van hamme:
Data-driven speech representations for NMF-based word learning. 98-103 - Samuel K. Ngouoko M, Martin Heckmann, Britta Wrede:
Spectro-temporal features with distribution equalization. 104-109 - Kamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj:
Language identification using spectro-temporal patch features. 110-113 - Josh H. McDermott, Daniel P. W. Ellis, Hideki Kawahara:
Inharmonic speech: a tool for the study of speech perception and separation. 114-117

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.