default search action
IEEE Transactions on Audio, Speech & Language Processing, Volume 19
Volume 19, Number 1, January 2011
- Tobias May, Steven van de Par, Armin Kohlrausch:
A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End. 1-13 - Stefan Strahl, Heiko Hansen, Alfred Mertins:
A Dynamic Fine-Grain Scalable Compression Scheme With Application to Progressive Audio Coding. 14-23 - Albertus C. den Brinker, Harish Krishnamoorthi, E. A. Verbitskiy:
Similarities and Differences Between Warped Linear Prediction and Laguerre Linear Prediction. 24-33 - Konrad Kowalczyk, Maarten van Walstijn:
Room Acoustics Simulation Using 3-D Compact Explicit FDTD Schemes. 34-46 - Philipos C. Loizou, Gibak Kim:
Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions. 47-56 - Mikel Gainza, Eugene Coyle:
Tempo Detection Using a Hybrid Multiband Approach. 57-68 - Takuya Yoshioka, Tomohiro Nakatani, Masato Miyoshi, Hiroshi G. Okuno:
Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization. 69-84 - Yun Lei, John H. L. Hansen:
Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese. 85-96 - Luis Antonio Azpicueta-Ruiz, Marcus Zeller, Aníbal R. Figueiras-Vidal, Jerónimo Arenas-García, Walter Kellermann:
Adaptive Combination of Volterra Kernels and Its Application to Nonlinear Acoustic Echo Cancellation. 97-110 - Jayme G. A. Barbedo, George Tzanetakis:
Musical Instrument Classification Using Individual Partials. 111-122 - Maarten Van Segbroeck, Hugo Van hamme:
Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition. 123-137 - Hélène Papadopoulos, Geoffroy Peeters:
Joint Estimation of Chords and Downbeats From an Audio Signal. 138-152 - Tuomo Raitio, Antti Suni, Junichi Yamagishi, Hannu Pulakka, Jani Nurminen, Martti Vainio, Paavo Alku:
HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering. 153-165 - Mohsen A. Rashwan, Mohamed Al-Badrashiny, Mohamed Attia, Sherif M. Abdou, Ahmed Rafea:
A Stochastic Arabic Diacritizer Based on a Hybrid of Factorized and Unfactorized Textual Features. 166-175 - Andre Holzapfel, Yannis Stylianou:
Scale Transform in Rhythmic Similarity of Music. 176-185 - Suhadi Suhadi, Carsten Last, Tim Fingscheidt:
A Data-Driven Approach to A Priori SNR Estimation. 186-195 - Ning Wang, P. C. Ching, Nengheng Zheng, Tan Lee:
Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features. 196-205 - Alexander Krueger, Ernst Warsitz, Reinhold Haeb-Umbach:
Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation. 206-219
Volume 19, Number 2, February 2011
- Joel Pinto, Garimella S. V. S. Sivaram, Mathew Magimai-Doss, Hynek Hermansky, Hervé Bourlard:
Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator. 225-241 - Michael Stark, Michael Wohlmayr, Franz Pernkopf:
Source-Filter-Based Single-Channel Speech Separation Using Pitch Information. 242-255 - Etan Fisher, Boaz Rafaely:
Near-Field Spherical Microphone Array Processing With Radial Filtering. 256-265 - Weiqiang Zhang, Liang He, Yan Deng, Jia Liu, Michael T. Johnson:
Time-Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition. 266-276 - Colin Breithaupt, Rainer Martin:
Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions. 277-289 - Yannis Pantazis, Olivier Rosec, Yannis Stylianou:
Adaptive AM-FM Signal Decomposition With Application to Speech Analysis. 290-300 - Mitsuko Aramaki, Mireille Besson, Richard Kronland-Martinet, Sølvi Ystad:
Controlling the Perceived Material in an Impact Sound Synthesizer. 301-314 - D. K. Kim, Mark J. F. Gales:
Noisy Constrained Maximum-Likelihood Linear Regression for Noise-Robust Speech Recognition. 315-325 - Namgook Cho, C.-C. Jay Kuo:
Sparse Music Representation With Source-Specific Dictionaries and Its Application to Signal Separation. 326-337 - Ben Milner, Jonathan Darch:
Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients. 338-347 - Joseph Tepperman, Sungbok Lee, Shrikanth S. Narayanan, Abeer Alwan:
A Generative Student Model for Scoring Word Reading Skills. 348-360 - Shefeng Yan, Haohai Sun, U. Peter Svensson, Xiaochuan Ma, J. M. Hovem:
Optimal Modal Beamforming for Spherical Microphone Arrays. 361-371 - Marco Kühne, Roberto Togneri, Sven Nordholm:
A New Evidence Model for Missing Data Speech Recognition With Applications in Reverberant Multi-Source Environments. 372-384 - Dinh-Quy Nguyen, Woon-Seng Gan, Andy W. H. Khong:
Time-Reversal Approach to the Stereophonic Acoustic Echo Cancellation Problem. 385-395 - Fritz Menzer, Christof Faller, Hervé Lissek:
Obtaining Binaural Room Impulse Responses From B-Format Impulse Responses Using Frequency-Dependent Coherence Matching. 396-405 - Zbynek Koldovský, Petr Tichavský:
Time-Domain Blind Separation of Audio Sources on the Basis of a Complete ICA Decomposition of an Observation Space. 406-416 - Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda:
Continuous Stochastic Feature Mapping Based on Trajectory HMMs. 417-430 - Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization. 431-438
Volume 19, Number 3, March 2011
- Mohammad A. Dmour, Mike E. Davies:
A New Framework for Underdetermined Speech Extraction Using Mixture of Beamformers. 445-457 - Miroslav Zivanovic, Johan Schoukens:
On The Polynomial Approximation for Time-Variant Harmonic Signal Modeling. 458-467 - Ana I. García-Moral, Rubén Solera-Ureña, Carmen Peláez-Moreno, Fernando Díaz-de-María:
Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems. 468-481 - Jen-Tzung Chien, Chuang-Hua Chueh:
Dirichlet Class Language Models for Speech Recognition. 482-495 - Feipeng Li, Jont B. Allen:
Manipulation of Consonants in Natural Speech. 496-504 - Donglai Zhu, Bin Ma, Haizhou Li:
Speaker Verification With Feature-Space MAPLR Parameters. 505-515 - Hiroshi Sawada, Shoko Araki, Shoji Makino:
Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment. 516-527 - Konrad Kowalczyk, Maarten van Walstijn, Damian T. Murphy:
A Phase Grating Approach to Modeling Surface Diffusion in FDTD Room Acoustics Simulations. 528-537 - Fei Liu, Feifan Liu, Yang Liu:
A Supervised Framework for Keyword Extraction From Meeting Transcripts. 538-548 - Lin Wang, Heping Ding, Fuliang Yin:
A Region-Growing Permutation Alignment Approach in Frequency-Domain Blind Source Separation of Speech Mixtures. 549-557 - L. Anders Ekman, Volodya Grancharov, W. Bastiaan Kleijn:
Double-Ended Quality Assessment System for Super-Wideband Speech. 558-569 - Jia Jia, Shen Zhang, Fanbo Meng, Yongxin Wang, Lianhong Cai:
Emotional Audio-Visual Speech Synthesis Based on PAD. 570-582 - Francesco Nesta, Ted S. Wada, Biing-Hwang Juang:
Batch-Online Semi-Blind Source Separation Applied to Multi-Channel Acoustic Echo Cancellation. 583-599 - Prasanta Kumar Ghosh, Andreas Tsiartas, Shrikanth S. Narayanan:
Robust Voice Activity Detection Using Long-Term Signal Variability. 600-613 - Sheng Wu, Xiaojun Qiu, Ming Wu:
Stereo Acoustic Echo Cancellation Employing Frequency-Domain Preprocessing and Adaptive Filter. 614-623 - Francesco Nesta, Piergiorgio Svaizer, Maurizio Omologo:
Convolutive BSS of Short Mixtures by ICA Recursively Regularized Across Frequencies. 624-639 - Juan Andres Morales-Cordovilla, Antonio M. Peinado, Victoria E. Sánchez, José A. González:
Feature Extraction Based on Pitch-Synchronous Averaging for Robust Speech Recognition. 640-651 - Miguel Ferrer, Alberto González, Maria de Diego, Gema Piñero:
Transient Analysis of the Conventional Filtered-x Affine Projection Algorithm for Active Noise Control. 652-657
Volume 19, Number 4, May 2011
- Ivan Himawan, Iain McCowan, Sridha Sridharan:
Clustered Blind Beamforming From Ad-Hoc Microphone Arrays. 661-676 - Guilin Ma, Fredrik Gran, Finn Jacobsen, Finn T. Agerkvist:
Adaptive Feedback Cancellation With Band-Limited LPC Vocoder in Digital Hearing Aids. 677-687 - Dong Wang, Simon King, Joe Frankel:
Stochastic Pronunciation Modeling for Out-of-Vocabulary Spoken Term Detection. 688-698 - Ashutosh Pandey, V. John Mathews:
Low-Delay Signal Processing for Digital Hearing Aids. 699-710 - Charles D. Creusere, Joseph C. Hardin:
Assessing the Quality of Audio Containing Temporally Varying Distortions. 711-720 - Evgeny Matusov, Hermann Ney:
Lattice-Based ASR-MT Interface for Speech Translation. 721-732 - Rogier C. van Dalen, Mark J. F. Gales:
Extended VTS for Noise-Robust Speech Recognition. 733-743 - Romain Hennequin, Roland Badeau, Bertrand David:
NMF With Time-Frequency Activations to Model Nonstationary Audio Events. 744-753 - Roberto Barra-Chicote, José Manuel Pardo, Javier Ferreiros, Juan Manuel Montero:
Speaker Diarization Based on Intensity Channel Contribution. 754-761 - Yi-Hsuan Yang, Homer H. Chen:
Ranking-Based Emotion Recognition for Music Organization and Retrieval. 762-774 - Mohammed Ariful Haque, Toufiqul Islam, Md. Kamrul Hasan:
Robust Speech Dereverberation Based on Blind Adaptive Estimation of Acoustic Channels. 775-787 - Najim Dehak, Patrick Kenny, Réda Dehak, Pierre Dumouchel, Pierre Ouellet:
Front-End Factor Analysis for Speaker Verification. 788-798 - Michael Wohlmayr, Michael Stark, Franz Pernkopf:
A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models. 799-810 - Perry Groot, Tom Heskes, Tjeerd Dijkstra, James M. Kates:
Predicting Preference Judgments of Individual Normal and Hearing-Impaired Listeners With Gaussian Processes. 811-821 - Ji Ming, Ramji Srinivasan, Danny Crookes:
A Corpus-Based Approach to Speech Enhancement From Nonstationary Noise. 822-836 - Arshia Cont, Shlomo Dubnov, Gérard Assayag:
On the Information Geometry of Audio Streams With Applications to Similarity Computing. 837-846 - Hayley Hung, Yan Huang, Gerald Friedland, Daniel Gatica-Perez:
Estimating Dominance in Multi-Party Meetings Using Speaker Diarization. 847-860 - Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, Khe Chai Sim:
Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification. 861-870 - Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen:
Leveraging Kullback-Leibler Divergence Measures and Information-Rich Cues for Speech Summarization. 871-882 - Chi Zhang, John H. L. Hansen:
Whisper-Island Detection Based on Unsupervised Segmentation With Entropy-Based Speech Feature Processing. 883-894 - Matthew Gibson, William Byrne:
Unsupervised Intralingual and Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthesis Using Two-Pass Decision Tree Construction. 895-904 - Min-Seok Choi, Hong-Goo Kang:
A Two-Channel Noise Estimator for Speech Enhancement in a Highly Nonstationary Environment. 905-915 - Saman Mousazadeh, Israel Cohen:
AR-GARCH in Presence of Noise: Parameter Estimation and Its Application to Voice Activity Detection. 916-926 - Qiang Wu, Liqing Zhang, Guangchuan Shi:
Robust Multifactor Speech Feature Extraction Based on Gabor Analysis. 927-936 - Peifeng Ji, Ee-Leng Tan, Woon-Seng Gan, Jun Yang:
A Comparative Analysis of Preprocessing Methods for the Parametric Loudspeaker Based on the Khokhlov-Zabolotskaya-Kuznetsov Equation for Speech Reproduction. 937-946 - Frank Rudzicz:
Articulatory Knowledge in the Recognition of Dysarthric Speech. 947-960 - Bin Gao, Wai Lok Woo, Satnam Singh Dlay:
Single-Channel Source Separation Using EMD-Subband Variable Regularized Sparse Features. 961-976 - Daniel Rudoy, Thomas F. Quatieri, Patrick J. Wolfe:
Time-Varying Autoregressions in Speech: Detection Theory and Applications. 977-989 - Hyeon-Jin Jeon, Tae-Gyu Chang, Sungwook Yu, Sen M. Kuo:
A Narrowband Active Noise Control System With Frequency Corrector. 990-1002 - Emiru Tsunoo, George Tzanetakis, Nobutaka Ono, Shigeki Sagayama:
Beyond Timbral Statistics: Improving Music Classification Using Percussive Patterns and Bass Lines. 1003-1014 - Matthew P. Black, Joseph Tepperman, Shrikanth S. Narayanan:
Automatic Prediction of Children's Reading Ability for High-Level Literacy Assessment. 1015-1028 - Dongho Kim, Jin H. Kim, Kee-Eung Kim:
Robust Performance Evaluation of POMDP-Based Dialogue Systems. 1029-1040 - Lifu Wu, Hongsen He, Xiaojun Qiu:
An Active Impulsive Noise Control Algorithm With Logarithmic Transformation. 1041-1044 - Haohai Sun, Shefeng Yan, U. Peter Svensson:
Robust Minimum Sidelobe Beamforming for Spherical Microphone Arrays. 1045-1051
Volume 19, Number 5, July 2011
- Emily Mower, Maja J. Mataric, Shrikanth S. Narayanan:
A Framework for Automatic Human Emotion Classification Using Emotion Profiles. 1057-1070 - Kai Yu, Steve J. Young:
Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis. 1071-1079 - Gilles Degottex, Axel Röbel, Xavier Rodet:
Phase Minimization for Glottal Model Estimation. 1080-1090 - Zhaozhang Jin, DeLiang Wang:
HMM-Based Multipitch Tracking for Noisy and Reverberant Speech. 1091-1102 - Ralf Schlüter, Markus Nußbaum-Thom, Hermann Ney:
On the Relationship Between Bayes Risk and Word Error Rate in ASR. 1103-1112 - Jerome R. Bellegarda:
A Data-Driven Affective Analysis Framework Toward Naturally Expressive Speech Synthesis. 1113-1122 - Yang Lu, Philipos C. Loizou:
Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty. 1123-1137 - Georg Heigold, Hermann Ney, Patrick Lehnen, Tobias Gass, Ralf Schlüter:
Equivalence of Generative and Log-Linear Models. 1138-1148 - Aastha Gupta, Thushara D. Abhayapala:
Three-Dimensional Sound Field Reproduction Using Multiple Circular Loudspeaker Arrays. 1149-1159 - Shasha Xie, Yang Liu:
Using N-Best Lists and Confusion Networks for Meeting Summarization. 1160-1169 - Werayuth Charoenruengkit, Nurgun Erdol:
The Effect of Spectral Estimation on Speech Enhancement Performance. 1170-1179 - I. Yücel Özbek, Mark Hasegawa-Johnson, Mübeccel Demirekler:
Estimation of Articulatory Trajectories Based on Gaussian Mixture Model (GMM) With Audio-Visual Information Fusion and Dynamic Kalman Smoothing. 1180-1195 - Wei-Ho Tsai, Hao-Ping Lin:
Background Music Removal Based on Cepstrum Transformation for Popular Singer Identification. 1196-1205 - José A. González, Antonio M. Peinado, Angel M. Gomez, José L. Carmona:
Efficient MMSE Estimation and Uncertainty Processing for Multienvironment Robust Speech Recognition. 1206-1220 - Shefeng Yan, Haohai Sun, Xiaochuan Ma, U. Peter Svensson, Chaohuan Hou:
Time-Domain Implementation of Broadband Beamformer in Spherical Harmonics Domain. 1221-1230 - Vladimir Britanak:
On Properties, Relations, and Simplified Implementation of Filter Banks in the Dolby Digital (Plus) AC-3 Audio Coding Standards. 1231-1241 - Geoffroy Peeters:
Spectral and Temporal Periodicity Representations of Rhythm for the Automatic Classification of Music Audio Signal. 1242-1252 - C.-Y. Lin, H.-C. Wang:
Burst Onset Landmark Detection and Its Application to Speech Recognition. 1253-1264 - Pejman Mowlaee, Mads Græsbøll Christensen, Søren Holdt Jensen:
New Results on Single-Channel Speech Separation Using Sinusoidal Modeling. 1265-1277 - Stas Tiomkin, David Malah, Slava Shechtman, Zvi Kons:
A Hybrid Text-to-Speech System That Combines Concatenative and Statistical Synthesis Units. 1278-1288 - Han-Ping Shen, Jui-Feng Yeh, Chung-Hsien Wu:
Speaker Clustering Using Decision Tree-Based Phone Cluster Models With Multi-Space Probability Distributions. 1289-1300 - T. Etame, Régine Le Bouquin-Jeannès, Catherine Quinquis, Lætitia Gros, Gérard Faucon:
Towards a New Reference Impairment System in the Subjective Evaluation of Speech Codecs. 1301-1315 - Chengyuan Ma, Chin-Hui Lee:
A Regularized Maximum Figure-of-Merit (rMFoM) Approach to Supervised and Semi-Supervised Learning. 1316-1327 - Fabien Ringeval, Jean Demouy, György Szaszák, Mohamed Chetouani, L. Robel, Jean Xavier, David Cohen, Monique Plaza:
Automatic Intonation Recognition for the Prosodic Assessment of Language-Impaired Children. 1328-1342 - Emanuele Coviello, Antoni B. Chan, Gert R. G. Lanckriet:
Time Series Models for Semantic Music Annotation. 1343-1359 - Maider Lehr, Izhak Shafran:
Learning a Discriminative Weighted Finite-State Transducer for Speech Recognition. 1360-1367 - Bram Cornelis, Marc Moonen, Jan Wouters:
Performance Analysis of Multichannel Wiener Filter-Based Noise Reduction in Hearing Aids Under Second Order Statistics Estimation Errors. 1368-1381 - Anthony Griffin, Toni Hirvonen, Christos Tzagkarakis, Athanasios Mouchtaris, Panagiotis Tsakalides:
Single-Channel and Multi-Channel Sinusoidal Audio Coding Using Compressed Sensing. 1382-1395 - Jibran Yousafzai, Peter Sollich, Zoran Cvetkovic, Bin Yu:
Combined Features and Kernel Design for Noise Robust Phoneme Classification Using Support Vector Machines. 1396-1407 - Xing Fan, John H. L. Hansen:
Speaker Identification Within Whispered Speech Audio Streams. 1408-1421 - Peter Birkholz, Bernd J. Kröger, Christiane Neuschaefer-Rube:
Model-Based Reproduction of Articulatory Trajectories for Consonant-Vowel Sequences. 1422-1433 - Wooil Kim, John H. L. Hansen:
A Novel Mask Estimation Method Employing Posterior-Based Representative Mean Estimate for Missing-Feature Speech Recognition. 1434-1443 - Kishore Prahallad, Alan W. Black:
Segmentation of Monologues in Audio Books for Building Synthetic Voices. 1444-1449
Volume 19, Number 6, August 2011
- Hiroshi Saruwatari, Yohei Ishikawa, Yu Takahashi, Takayuki Inoue, Kiyohiro Shikano, Kazunobu Kondo:
Musical Noise Controllable Algorithm of Channelwise Spectral Subtraction and Adaptive Beamforming Based on Higher Order Statistics. 1457-1466 - Andrea Andò:
Conversion of Multichannel Sound Signal Maintaining Physical Properties of Sound in Reproduced Sound Field. 1467-1475 - Antonio Miguel, Alfonso Ortega, Luis Buera, Eduardo Lleida:
Bayesian Networks for Discrete Observation Distributions in Speech Recognition. 1476-1489 - Anthony Lombard, Yuanhang Zheng, Herbert Buchner, Walter Kellermann:
TDOA Estimation for Multiple Sound Sources in Noisy and Reverberant Environments Using Broadband Independent Component Analysis. 1490-1503 - Dimitrios Dimitriadis, Petros Maragos, Alexandros Potamianos:
On the Effects of Filterbank Design and Energy Computation on Robust Speech Recognition. 1504-1516 - Ciira Wa Maina, John MacLaren Walsh:
Joint Speech Enhancement and Speaker Identification Using Approximate Bayesian Inference. 1517-1529 - Stefania Cecchi, Laura Romoli, Paolo Peretti, Francesco Piazza:
A Combined Psychoacoustic Approach for Stereo Acoustic Echo Cancellation. 1530-1539 - A. Levy, Sharon Gannot, Emanuël A. P. Habets:
Multiple-Hypothesis Extended Particle Filter for Acoustic Source Localization in Reverberant Environments. 1540-1555 - H. D. Tran, Haizhou Li:
Sound Event Recognition With Probabilistic Distance SVMs. 1556-1568 - Stefan Hahn, Marco Dinarelli, Christian Raymond, Fabrice Lefèvre, Patrick Lehnen, Renato de Mori, Alessandro Moschitti, Hermann Ney, Giuseppe Riccardi:
Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages. 1569-1583 - Ronen Talmon, Israel Cohen, Sharon Gannot:
Transient Noise Reduction Using Nonlocal Diffusion Filters. 1584-1599 - Ke Hu, DeLiang Wang:
Unvoiced Speech Segregation From Nonspeech Interference via CASA and Spectral Subtraction. 1600-1609 - Fabrizio Argenti, Paolo Nesi, Gianni Pantaleo:
Automatic Transcription of Polyphonic Music Based on the Constant-Q Bispectral Analysis. 1610-1630 - Hyunson Seo, Chi-Sang Jung, Hong-Goo Kang:
Robust Session Variability Compensation for SVM Speaker Verification. 1631-1641 - Ibrahim Almajai, Ben Milner:
Visually Derived Wiener Filters for Speech Enhancement. 1642-1651 - Dalei Wu, Yan Yin, Hui Jiang:
Large-Margin Estimation of Hidden Markov Models With Second-Order Cone Programming for Speech Recognition. 1652-1664 - Haitian Xu, Mark J. F. Gales, K. K. Chin:
Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition. 1665-1676 - Roy Wallace, Brendan Baker, Robbie Vogt, Sridha Sridharan:
Discriminative Optimization of the Figure of Merit for Phonetic Spoken Term Detection. 1677-1687 - Peter Grosche, Meinard Müller:
Extracting Predominant Local Pulse Information From Music Recordings. 1688-1701 - Yao Qian, Zhizheng Wu, Boyang Gao, Frank K. Soong:
Improved Prosody Generation by Maximizing Joint Probability of State and Longer Units. 1702-1710 - Yan Jennifer Wu, Thushara D. Abhayapala:
Spatial Multizone Soundfield Reproduction: Theory and Design. 1711-1720 - Mathieu Parvaix, Laurent Girin:
Informed Source Separation of Linear Instantaneous Under-Determined Audio Mixtures by Source Index Embedding. 1721-1733 - Jacob Benesty, Constantin Paleologu, Silviu Ciochina:
On Regularization in Adaptive Filtering. 1734-1742 - Mehdi Bekrani, Andy W. H. Khong, Mojtaba Lotfizad:
A Linear Neural Network-Based Approach to Stereophonic Acoustic Echo Cancellation. 1743-1753 - Geoffroy Peeters, Hélène Papadopoulos:
Simultaneous Beat and Downbeat-Tracking Using a Probabilistic Framework: Theory and Large-Scale Evaluation. 1754-1769 - Takayuki Inoue, Hiroshi Saruwatari, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo:
Theoretical Analysis of Musical Noise in Generalized Spectral Subtraction Based on Higher Order Statistics. 1770-1779 - Ki-Seung Lee, Seok-Pil Lee:
A Relevant Distance Criterion for Interpolation of Head-Related Transfer Functions. 1780-1790 - Qi Li, Yan Huang:
An Auditory-Based Feature Extraction Algorithm for Robust Speaker Identification Under Mismatched Conditions. 1791-1801 - Pasi Saari, Tuomas Eerola, Olivier Lartillot:
Generalizability and Simplicity as Criteria in Feature Selection: Application to Mood Classification in Music. 1802-1812 - Saikat Chatterjee, W. Bastiaan Kleijn:
Auditory Model-Based Design and Optimization of Feature Vectors for Automatic Speech Recognition. 1813-1825 - Mehdi Bekrani, Andy W. H. Khong, Mojtaba Lotfizad:
A Clipping-Based Selective-Tap Adaptive Filtering Approach to Stereophonic Acoustic Echo Cancellation. 1826-1836 - Hélène Lachambre, Régine André-Obrecht, Julien Pinquier:
Distinguishing Monophonies From Polyphonies Using Weibull Bivariate Distributions. 1837-1842 - Joshua D. Reiss:
Design of Audio Parametric Equalizer Filters Directly in the Digital Domain. 1843-1848
Volume 19, Number 7, September 2011
- Guruprasad Seshadri, Bayya Yegnanarayana:
Performance of an Event-Based Instantaneous Fundamental Frequency Estimator for Distant Speech Signals. 1853-1864 - Yegui Xiao:
A New Efficient Narrowband Active Noise Control System and its Performance Analysis. 1865-1874 - Sheng-yi Kong, Lin-Shan Lee:
Semantic Analysis and Organization of Spoken Documents Based on Parameters Derived From Latent Topics. 1875-1889 - Taufiq Hasan, John H. L. Hansen:
A Study on Universal Background Model Training in Speaker Verification. 1890-1899 - Nilesh Madhu, Rainer Martin:
A Versatile Framework for Speaker Separation Using a Model-Based Speaker Localization Approach. 1900-1912 - Vikramjit Mitra, Hosung Nam, Carol Y. Espy-Wilson, Elliot Saltzman, Louis Goldstein:
Articulatory Information for Noise Robust Speech Recognition. 1913-1924 - Qiang Huang, Stephen J. Cox:
Inferring the Structure of a Tennis Game Using Audio Information. 1925-1937 - Maria E. Markaki, Yannis Stylianou:
Voice Pathology Detection and Discrimination Based on Modulation Spectral Features. 1938-1948 - Eleftheria Georganti, Tobias May, Steven van de Par, Aki Härmä, John Mourjopoulos:
Speaker Distance Detection Using a Single Microphone. 1949-1961 - Tacksung Choi, Young-Cheol Park, Dae Hee Youn, Seok-Pil Lee:
Virtual Sound Rendering in a Stereophonic Loudspeaker Setup. 1962-1974 - Gil Dobry, Ron M. Hecht, Mireille Avigal, Yaniv Zigel:
Supervector Dimension Reduction for Efficient Speaker Age Estimation Based on the Acoustic Speech Signal. 1975-1985 - Jesper Kjær Nielsen, Mads Græsbøll Christensen, Ali Taylan Cemgil, Simon J. Godsill, Søren Holdt Jensen:
Bayesian Interpolation and Parameter Estimation in a Dynamic Sinusoidal Model. 1986-1998 - Sungwoong Kim, Sungrack Yun, Chang D. Yoo:
Large Margin Discriminative Semi-Markov Model for Phonetic Recognition. 1999-2012 - Juan Pablo Bello:
Measuring Structural Similarity in Music. 2013-2025 - Iain McCowan, David Dean, Mitchell McLaren, Robert Vogt, Sridha Sridharan:
The Delta-Phase Spectrum With Application to Voice Activity Detection and Speaker Recognition. 2026-2038 - Chris Hummersone, Russell Mason, Tim Brookes:
Ideal Binary Mask Ratio: A Novel Metric for Assessing Binary-Mask-Based Sound Source Separation Algorithms. 2039-2045 - Valentin Emiya, Emmanuel Vincent, Niklas Harlander, Volker Hohmann:
Subjective and Objective Quality Assessment of Audio Source Separation. 2046-2057 - Muhammad Tahir Akhtar, Wataru Mitsuhashi:
Improving Performance of Hybrid Active Noise Control Systems for Uncorrelated Narrowband Disturbances. 2058-2066 - Jort F. Gemmeke, Tuomas Virtanen, Antti Hurmalainen:
Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition. 2067-2080 - Brian Roark, Margaret Mitchell, John-Paul Hosom, Kristy Hollingshead, Jeffrey A. Kaye:
Spoken Language Derived Measures for Detecting Mild Cognitive Impairment. 2081-2090 - Jun Du, Yu Hu, Hui Jiang:
Boosted Mixture Learning of Gaussian Mixture Hidden Markov Models Based on Maximum Likelihood for Speech Recognition. 2091-2100 - Nobutaka Ito, Hikaru Shimizu, Nobutaka Ono, Shigeki Sagayama:
Diffuse Noise Suppression Using Crystal-Shaped Microphone Arrays. 2101-2110 - Serajul Haque, Roberto Togneri, Anthony Zaknich:
An Auditory Motivated Asymmetric Compression Technique for Speech Recognition. 2111-2124 - Cees H. Taal, Richard C. Hendriks, Richard Heusdens, Jesper Jensen:
An Algorithm for Intelligibility Prediction of Time-Frequency Weighted Noisy Speech. 2125-2136 - Ryouichi Nishimura, Parham Mokhtari, Hironori Takemoto, Hiroaki Kato:
An Attempt to Calibrate Headphones for Reproduction of Sound Pressure at the Eardrum. 2137-2145 - Stefano Papetti, Federico Avanzini, Davide Rocchesso:
Numerical Methods for a Nonlinear Impact Model: A Comparative Study With Closed-Form Corrections. 2146-2158 - Mehrez Souden, Jingdong Chen, Jacob Benesty, Sofiène Affes:
An Integrated Solution for Online Multichannel Noise Tracking and Reduction. 2159-2169 - Hannu Pulakka, Paavo Alku:
Bandwidth Extension of Telephone Speech Using a Neural Network and a Filter Bank Implementation for Highband Mel Spectrum. 2170-2183 - Yi-Hsuan Yang, Homer H. Chen:
Prediction of the Distribution of Perceived Music Emotions Using Discrete Samples. 2184-2196 - Behnaz Ghoraani, Sridhar Krishnan:
Time-Frequency Matrix Feature Extraction and Classification of Environmental Audio Signals. 2197-2209 - Amitai Koretz, Joseph Tabrikian:
Maximum A Posteriori Probability Multiple-Pitch Tracking Using the Harmonic Model. 2210-2221 - Laurent Oudre, Yves Grenier, Cédric Févotte:
Chord Recognition by Fitting Rescaled Chroma Vectors to Chord Templates. 2222-2233 - Boaz Rafaely, Dima Khaykin:
Optimal Model-Based Beamforming and Independent Steering for Spherical Loudspeaker Arrays. 2234-2238 - Mads Græsbøll Christensen, Søren Holdt Jensen:
New Results on Perceptual Distortion Minimization and Nonlinear Least-Squares Frequency Estimation. 2239-2244
Volume 19, Number 8, November 2011
- Laurent Oudre, Cédric Févotte, Yves Grenier:
Probabilistic Template-Based Chord Recognition. 2249-2259 - Jacob Benesty, Jingdong Chen, Yiteng Huang:
Binaural Noise Reduction in the Time Domain With a Stereo Setup. 2260-2272 - Zengli Yang, Yahong Rosa Zheng, Steven L. Grant:
Proportionate Affine Projection Sign Algorithms for Network Echo Cancellation. 2273-2284 - Jun Du, Qiang Huo:
A Feature Compensation Approach Using High-Order Vector Taylor Series Approximation of an Explicit Distortion Model for Noisy Speech Recognition. 2285-2293 - J. Reed, C.-H. Lee:
Preference Music Ratings Prediction Using Tokenization and Minimum Classification Error Training. 2294-2303 - Han-Wen Hsu, Chi-Min Liu:
Decimation-Whitening Filter in Spectral Band Replication. 2304-2313 - Theodore Petsatodis, Christos Boukis, Fotios Talantzis, Zheng-Hua Tan, Ramjee Prasad:
Convex Combination of Multiple Statistical Models With Application to VAD. 2314-2327 - Zhaozhang Jin, DeLiang Wang:
Reverberant Speech Segregation Based on Multipitch Tracking and Classification. 2328-2337 - Dogan Can, Murat Saraclar:
Lattice Indexing for Spoken Term Detection. 2338-2347 - Mikel Peñagarikano, Amparo Varona, Luis Javier Rodríguez-Fuentes, Germán Bordel:
Improved Modeling of Cross-Decoder Phone Co-Occurrences in SVM-Based Phonotactic Language Recognition. 2348-2363 - Trevor Burton, Rafik A. Goubran:
A Generalized Proportionate Subband Adaptive Second-Order Volterra Filter for Acoustic Echo Cancellation in Changing Environments. 2364-2373 - Yusuke Hioka, Kenta Niwa, Sumitaka Sakauchi, Ken'ichi Furuya, Youichi Haneda:
Estimating Direct-to-Reverberant Energy Ratio Using D/R Spatial Correlation Matrix Model. 2374-2384 - Cyril Joder, Slim Essid, Gaël Richard:
A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching. 2385-2397 - Richard E. Turner, Maneesh Sahani:
Demodulation as Probabilistic Inference. 2398-2411 - Giovanni L. Sicuranza, Alberto Carini:
A Generalized FLANN Filter for Nonlinear Active Noise Control. 2412-2417 - Qun Feng Tan, Panayiotis G. Georgiou, Shrikanth Narayanan:
Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End. 2418-2429 - Boaz Rafaely:
Bessel Nulls Recovery in Spherical Microphone Arrays for Time-Limited Signals. 2430-2438 - Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Suman V. Ravuri, Wen Wang:
Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features. 2439-2450 - Timothy J. Hazen:
MCE Training Techniques for Topic Identification of Spoken Audio Documents. 2451-2460 - Dong Yu, Jinyu Li, Li Deng:
Calibration of Confidence Measures in Speech Recognition. 2461-2473 - Cong Liu, Yu Hu, Li-Rong Dai, Hui Jiang:
Trust Region-Based Optimization for Maximum Mutual Information Estimation of HMMs in Speech Recognition. 2474-2485 - Simo Särkkä, Antti Huovilainen:
Accurate Discretization of Analog Audio Filters With Application to Parametric Equalizer Design. 2486-2493 - Deyi Xiong, Min Zhang, Haizhou Li:
A Maximum-Entropy Segmentation Model for Statistical Machine Translation. 2494-2505 - Magnus Berggren, Markus Borgh, Christian Schüldt, Fredric Lindström, Ingvar Claesson:
Low-Complexity Network Echo Cancellation Approach for Systems Equipped With External Memory. 2506-2515 - Leonardo O. Nunes, Luiz W. P. Biscainho, Bowon Lee, Amir Said, Ton Kalker, Ronald W. Schafer:
Degradation Type Classifier for Full Band Speech Contaminated With Echo, Broadband Noise, and Reverberation. 2516-2526 - Jorge I. Marin-Hurtado, David V. Anderson:
FFT-Based Block Processing in Speech Enhancement: Potential Artifacts and Solutions. 2527-2537 - Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard, Mathew Magimai-Doss:
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection. 2538-2551 - S. R. Mahadeva Prasanna, Gayadhar Pradhan:
Significance of Vowel-Like Regions for Speaker Verification Under Degraded Conditions. 2552-2565 - Julio Vargas, Steve McLaughlin:
Speech Analysis and Synthesis Based on Dynamic Modes. 2566-2578 - Bengt Jonas Borgstrom, Abeer Alwan:
A Unified Framework for Designing Optimal STSA Estimators Assuming Maximum Likelihood Phase Equivalence of Speech and Noise. 2579-2590 - Brian King, Les Atlas:
Single-Channel Source Separation Using Complex Matrix Factorization. 2591-2597 - Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, David Nahamoo, Dimitri Kanevsky:
Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR. 2598-2613 - Huijun Ding, Ing Yann Soon, Chai Kiat Yeo:
A DCT-Based Speech Enhancement System With Pitch Synchronous Analysis. 2614-2623 - Dongwen Ying, Yonghong Yan, Jianwu Dang, Frank K. Soong:
Voice Activity Detection Based on an Unsupervised Learning Framework. 2624-2633
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.