default search action
Richard M. Stern
Person information
- affiliation: Carnegie Mellon University, Electrical and Computer Engineering, Pittsburgh, PA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i9]Hanzhi Yin, Gang Cheng, Christian J. Steinmetz, Ruibin Yuan, Richard M. Stern, Roger B. Dannenberg:
Modeling Analog Dynamic Range Compressors using Deep Learning and State-space Models. CoRR abs/2403.16331 (2024) - 2023
- [j27]Eduardo Alvarado, Nicolás Grágeda, Alejandro Luzanto, Rodrigo Mahú, Jorge Wuth, Laura Mendoza, Richard M. Stern, Néstor Becerra Yoma:
Automatic Detection of Dyspnea in Real Human-Robot Interaction Scenarios. Sensors 23(17): 7590 (2023) - [c129]Mark Lindsey, Nathaniel R. Robinson, Francis Kubala, Richard M. Stern:
Reducing the Cost of Spoof Detection Labeling using Mixed-Strategy Active Learning and Pretrained Models. ASRU 2023: 1-7 - [c128]Mark Lindsey, Tyler Vuong, Richard M. Stern:
Unsupervised Voice Type Discrimination Score Adaptation Using X-Vector Clusters. ICASSP 2023: 1-5 - [c127]Eduardo Alvarado, Nicolás Grágeda, Alejandro Luzanto, Rodrigo Mahú, Jorge Wuth, Laura Mendoza, Richard M. Stern, Néstor Becerra Yoma:
Respiratory distress estimation in human-robot interaction scenario. INTERSPEECH 2023: 1763-1767 - [i8]Mark Lindsey, Ankit Shah, Francis Kubala, Richard M. Stern:
Online Active Learning For Sound Event Detection. CoRR abs/2309.14460 (2023) - 2022
- [c126]Tyler Vuong, Richard M. Stern:
Improved Modulation-Domain Loss for Neural-Network-based Speech Enhancement. INTERSPEECH 2022: 206-210 - [c125]Tyler Vuong, Nikhil Madaan, Rohan Panda, Richard M. Stern:
Investigating the Important Temporal Modulations for Deep-Learning-Based Speech Activity Detection. SLT 2022: 525-531 - [i7]Yinghao Ma, Richard M. Stern:
Learnable Front Ends Based on Temporal Modulation for Music Tagging. CoRR abs/2211.15254 (2022) - 2021
- [c124]Tyler Vuong, Yangyang Xia, Richard M. Stern:
A Modulation-Domain Loss for Neural-Network-Based Real-Time Speech Enhancement. ICASSP 2021: 6643-6647 - [c123]Yangyang Xia, Li-Wei Chen, Alexander Rudnicky, Richard M. Stern:
Temporal Context in Speech Emotion Recognition. Interspeech 2021: 3370-3374 - [c122]Tyler Vuong, Yangyang Xia, Richard M. Stern:
The Application of Learnable STRF Kernels to the 2021 Fearless Steps Phase-03 SAD Challenge. Interspeech 2021: 4364-4368 - 2020
- [c121]Tyler Vuong, Yangyang Xia, Richard M. Stern:
Learnable Spectro-Temporal Receptive Fields for Robust Voice Type Discrimination. INTERSPEECH 2020: 1957-1961 - [i6]Jorge Wuth, Richard M. Stern, Néstor Becerra Yoma:
Non causal deep learning based dereverberation. CoRR abs/2009.02832 (2020)
2010 – 2019
- 2019
- [c120]Anjali Menon, Chanwoo Kim, Richard M. Stern:
Robust Recognition of Reverberant and Noisy Speech Using Coherence-based Processing. ICASSP 2019: 6775-6779 - [i5]José Novoa, Rodrigo Mahú, Alejandro Díaz, Jorge Wuth, Richard M. Stern, Néstor Becerra Yoma:
Weighted delay-and-sum beamforming guided by visual tracking for human-robot interaction. CoRR abs/1906.07298 (2019) - [i4]José Novoa, Josué Fredes, Jorge Wuth, Fernando Huenupán, Richard M. Stern, Néstor Becerra Yoma:
On combining features for single-channel robust speech recognition in reverberant environments. CoRR abs/1906.07299 (2019) - 2018
- [c119]Chanwoo Kim, Anjali Menon, Michiel Bacchiani, Richard M. Stern:
Sound Source Separation Using Phase Difference and Reliable Mask Selection Selection. ICASSP 2018: 5559-5563 - [c118]Yangyang Xia, Richard M. Stern:
A Priori SNR Estimation Based on a Recurrent Neural Network for Robust Speech Enhancement. INTERSPEECH 2018: 3274-3278 - [c117]Xinhui Zhou, Chiman Kwan, Bulent Ayhan, Chanwoo Kim, Kshitiz Kumar, Richard M. Stern:
A Comparative Study of Spatial Speech Separation Techniques to Improve Speech Recognition. ISNN 2018: 494-502 - [i3]Juan Pablo Escudero, Víctor Poblete, José Novoa, Jorge Wuth, Josué Fredes, Rodrigo Mahú, Richard M. Stern, Néstor Becerra Yoma:
Highly-Reverberant Real Environment database: HRRE. CoRR abs/1801.09651 (2018) - [i2]José Novoa, Juan Pablo Escudero, Jorge Wuth, Víctor Poblete, Simon King, Richard M. Stern, Néstor Becerra Yoma:
Exploring the robustness of features and enhancement on speech recognition systems in highly-reverberant real environments. CoRR abs/1803.09013 (2018) - [i1]Juan Pablo Escudero, José Novoa, Rodrigo Mahú, Jorge Wuth, Fernando Huenupán, Richard M. Stern, Néstor Becerra Yoma:
An improved DNN-based spectral feature mapping that removes noise and reverberation for robust automatic speech recognition. CoRR abs/1803.09016 (2018) - 2017
- [j26]Josué Fredes, José Novoa, Simon King, Richard M. Stern, Néstor Becerra Yoma:
Locally Normalized Filter Banks Applied to Deep Neural-Network-Based Robust Speech Recognition. IEEE Signal Process. Lett. 24(4): 377-381 (2017) - [j25]Fernando de-la-Calle-Silos, Richard M. Stern:
Synchrony-Based Feature Extraction for Robust Automatic Speech Recognition. IEEE Signal Process. Lett. 24(8): 1158-1162 (2017) - [c116]Anjali Menon, Chanwoo Kim, Umpei Kurokawa, Richard M. Stern:
Binaural processing for robust recognition of degraded speech. ASRU 2017: 24-31 - [c115]José Novoa, Jorge Wuth, Juan Pablo Escudero, Josué Fredes, Rodrigo Mahú, Richard M. Stern, Néstor Becerra Yoma:
Robustness Over Time-Varying Channels in DNN-HMM ASR Based Human-Robot Interaction. INTERSPEECH 2017: 839-843 - [c114]Anjali Menon, Chanwoo Kim, Richard M. Stern:
Robust Speech Recognition Based on Binaural Auditory Processing. INTERSPEECH 2017: 3872-3876 - [p3]Vikramjit Mitra, Horacio Franco, Richard M. Stern, Julien van Hout, Luciana Ferrer, Martin Graciarena, Wen Wang, Dimitra Vergyri, Abeer Alwan, John H. L. Hansen:
Robust Features in Deep-Learning-Based Speech Recognition. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 187-217 - 2016
- [j24]Byung Joon Cho, Haeyong Kwon, Ji-Won Cho, Chanwoo Kim, Richard M. Stern, Hyung-Min Park:
A Subband-Based Stationary-Component Suppression Method Using Harmonics and Power Ratio for Reverberant Speech Recognition. IEEE Signal Process. Lett. 23(6): 780-784 (2016) - [j23]Chanwoo Kim, Richard M. Stern:
Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(7): 1315-1329 (2016) - [c113]Víctor Poblete, Juan Pablo Escudero, Josué Fredes, José Novoa, Richard M. Stern, Simon King, Néstor Becerra Yoma:
The Use of Locally Normalized Cepstral Coefficients (LNCC) to Improve Speaker Recognition Accuracy in Highly Reverberant Rooms. INTERSPEECH 2016: 2373-2377 - [c112]Vikramjit Mitra, Julien van Hout, Wen Wang, Chris Bartels, Horacio Franco, Dimitra Vergyri, Abeer Alwan, Adam Janin, John H. L. Hansen, Richard M. Stern, Abhijeet Sangwan, Nelson Morgan:
Fusion Strategies for Robust Speech Recognition and Keyword Spotting for Channel- and Noise-Degraded Speech. INTERSPEECH 2016: 3683-3687 - 2015
- [j22]Víctor Poblete, Felipe Espic, Simon King, Richard M. Stern, Fernando Huenupán, Josué Fredes, Néstor Becerra Yoma:
A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification. Comput. Speech Lang. 31(1): 1-27 (2015) - [j21]Griffin D. Romigh, Douglas S. Brungart, Richard M. Stern, Brian D. Simpson:
Efficient Real Spherical Harmonic Representation of Head-Related Transfer Functions. IEEE J. Sel. Top. Signal Process. 9(5): 921-930 (2015) - [c111]Mark J. Harvilla, Richard M. Stern:
Efficient audio declipping using regularized least squares. ICASSP 2015: 221-225 - [c110]Hynek Hermansky, Lukás Burget, Jordan Cohen, Emmanuel Dupoux, Naomi Feldman, John Godfrey, Sanjeev Khudanpur, Matthew Maciejewski, Sri Harish Reddy Mallidi, Anjali Menon, Tetsuji Ogawa, Vijayaditya Peddinti, Richard C. Rose, Richard M. Stern, Matthew Wiesner, Karel Veselý:
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop. ICASSP 2015: 5009-5013 - [c109]Mark J. Harvilla, Richard M. Stern:
Robust parameter estimation for audio declipping in noise. INTERSPEECH 2015: 2459-2463 - [c108]Josué Fredes, José Novoa, Víctor Poblete, Simon King, Richard M. Stern, Néstor Becerra Yoma:
Robustness to additive noise of locally-normalized cepstral coefficients in speaker verification. INTERSPEECH 2015: 3011-3015 - [c107]Shoou-I Yu, Lu Jiang, Zhongwen Xu, Zhenzhong Lan, Shicheng Xu, Xiaojun Chang, Xuanchong Li, Zexi Mao, Chuang Gan, Yajie Miao, Xingzhong Du, Yang Cai, Lara J. Martin, Nikolas Wolfe, Anurag Kumar, Huan Li, Ming Lin, Zhigang Ma, Yi Yang, Deyu Meng, Shiguang Shan, Pinar Duygulu Sahin, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Teruko Mitamura, Richard M. Stern, Alexander G. Hauptmann:
CMU Informedia@TRECVID 2015: MED/SIN/LNK/SED. TRECVID 2015 - 2014
- [j20]Víctor Poblete, Néstor Becerra Yoma, Richard M. Stern:
Optimization of the parameters characterizing sigmoidal rate-level functions based on acoustic features. Speech Commun. 56: 19-34 (2014) - [c106]Amir R. Moghimi, Richard M. Stern:
An analysis of binaural spectro-temporal masking as nonlinear beamforming. ICASSP 2014: 835-839 - [c105]Mark J. Harvilla, Richard M. Stern:
Least squares signal declipping for robust speech recognition. INTERSPEECH 2014: 2073-2077 - [c104]Amir R. Moghimi, Bhiksha Raj, Richard M. Stern:
Post-masking: a hybrid approach to array processing for speech recognition. INTERSPEECH 2014: 2425-2429 - [c103]Hyung-Min Park, Matthew Maciejewski, Chanwoo Kim, Richard M. Stern:
Robust speech recognition in reverberant environments using subband-based steady-state monaural and binaural suppression. INTERSPEECH 2014: 2715-2718 - [c102]Chanwoo Kim, Kean K. Chin, Michiel Bacchiani, Richard M. Stern:
Robust speech recognition using temporal masking and thresholding algorithm. INTERSPEECH 2014: 2734-2738 - [c101]Shoou-I Yu, Lu Jiang, Zhongwen Xu, Zhenzhong Lan, Shicheng Xu, Xiaojun Chang, Xuanchong Li, Zexi Mao, Chuang Gan, Yajie Miao, Xingzhong Du, Yang Cai, Lara J. Martin, Nikolas Wolfe, Anurag Kumar, Huan Li, Ming Lin, Zhigang Ma, Yi Yang, Deyu Meng, Shiguang Shan, Pinar Duygulu Sahin, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Teruko Mitamura, Richard M. Stern, Alexander G. Hauptmann, Anil Armagan, Yicheng Zhao:
Informedia @ TRECVID 2014. TRECVID 2014 - 2013
- [j19]Hynek Hermansky, Jordan R. Cohen, Richard M. Stern:
Perceptual Properties of Current Speech Recognition Technology. Proc. IEEE 101(9): 1968-1985 (2013) - [c100]Víctor Poblete, Néstor Becerra Yoma, Richard M. Stern:
Optimization of sigmoidal rate-level function based on acoustic features. INTERSPEECH 2013: 896-900 - [c99]Zhenzhong Lan, Lu Jiang, Shoou-I Yu, Chenqiang Gao, Shourabh Rawat, Yang Cai, Shicheng Xu, Haoquan Shen, Xuanchong Li, Yipei Wang, Waito Sze, Yan Yan, Zhigang Ma, Nicolas Ballas, Deyu Meng, Wei Tong, Yi Yang, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Richard M. Stern, Teruko Mitamura, Eric Nyberg, Alexander G. Hauptmann:
Informedia@TRECVID 2013. TRECVID 2013 - 2012
- [j18]Richard M. Stern, Nelson Morgan:
Hearing Is Believing: Biologically Inspired Methods for Robust Automatic Speech Recognition. IEEE Signal Process. Mag. 29(6): 34-43 (2012) - [j17]Yu-Hsiang Bosco Chiu, Bhiksha Raj, Richard M. Stern:
Learning-Based Auditory Encoding for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 20(3): 900-914 (2012) - [c98]Chanwoo Kim, Richard M. Stern:
Power-Normalized Cepstral Coefficients (PNCC) for robust speech recognition. ICASSP 2012: 4101-4104 - [c97]Chanwoo Kim, Charbel El Khawand, Richard M. Stern:
Two-microphone source separation algorithm based on statistical modeling of angle distributions. ICASSP 2012: 4629-4632 - [c96]Mark Harvilla, Richard M. Stern:
Histogram-based subband powerwarping and spectral averaging for robust speech recognition under matched and multistyle training. ICASSP 2012: 4697-4700 - [c95]L. Paola García-Perera, Juan Arturo Nolazco-Flores, Bhiksha Raj, Richard M. Stern:
Optimization of the DET curve in speaker verification. SLT 2012: 318-323 - [c94]Shoou-I Yu, Zhongwen Xu, Duo Ding, Waito Sze, Francisco Vicente, Zhenzhong Lan, Yang Cai, Shourabh Rawat, Peter F. Schulam, Nisarga Markandaiah, Sohail Bahmani, Antonio Juárez, Wei Tong, Yi Yang, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Richard M. Stern, Teruko Mitamura, Eric Nyberg, Lu Jiang, Qiang Chen, Lisa M. Brown, Ankur Datta, Quanfu Fan, Rogério Schmidt Feris, Shuicheng Yan, Alexander G. Hauptmann, Sharath Pankanti:
Informedia @TRECVID 2012. TRECVID 2012 - [p2]Richard M. Stern, Nelson Morgan:
Features Based on Auditory Physiology and Perception. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 193-227 - 2011
- [j16]Wooil Kim, Richard M. Stern:
Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise. Speech Commun. 53(1): 1-11 (2011) - [c93]Kshitiz Kumar, Rita Singh, Bhiksha Raj, Richard M. Stern:
Gammatone sub-band magnitude-domain dereverberation for ASR. ICASSP 2011: 4604-4607 - [c92]Kshitiz Kumar, Chanwoo Kim, Richard M. Stern:
Delta-spectral cepstral coefficients for robust speech recognition. ICASSP 2011: 4784-4787 - [c91]Chanwoo Kim, Kshitiz Kumar, Richard M. Stern:
Binaural sound source separation motivated by auditory processing. ICASSP 2011: 5072-5075 - [c90]Kshitiz Kumar, Bhiksha Raj, Rita Singh, Richard M. Stern:
An iterative least-squares technique for dereverberation. ICASSP 2011: 5488-5491 - 2010
- [c89]Ziad Al Bawab, Bhiksha Raj, Richard M. Stern:
A hybrid physical and statistical dynamic articulatory framework incorporating analysis-by-synthesis for improved phone classification. ICASSP 2010: 4194-4197 - [c88]Yu-Hsiang Bosco Chiu, Bhiksha Raj, Richard M. Stern:
Learning-based auditory encoding for robust speech recognition. ICASSP 2010: 4278-4281 - [c87]Kshitiz Kumar, Richard M. Stern:
Maximum-likelihood-based cepstral inverse filtering for blind speech dereverberation. ICASSP 2010: 4282-4285 - [c86]Chanwoo Kim, Richard M. Stern:
Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring. ICASSP 2010: 4574-4577 - [c85]Chanwoo Kim, Richard M. Stern, Kiwan Eom, Jaewon Lee:
Automatic selection of thresholds for signal separation algorithms based on interaural delay. INTERSPEECH 2010: 729-732 - [c84]Chanwoo Kim, Richard M. Stern:
Nonlinear enhancement of onset for robust speech recognition. INTERSPEECH 2010: 2058-2061
2000 – 2009
- 2009
- [j15]Hyung-Min Park, Richard M. Stern:
Spatial separation of speech signals using amplitude estimation based on interaural comparisons of zero-crossings. Speech Commun. 51(1): 15-25 (2009) - [c83]Chanwoo Kim, Richard M. Stern:
Power function-based power distribution normalization algorithm for robust speech recognition. ASRU 2009: 188-193 - [c82]Chanwoo Kim, Kshitiz Kumar, Richard M. Stern:
Robust speech recognition using a Small Power Boosting algorithm. ASRU 2009: 243-248 - [c81]Yu-Hsiang Bosco Chiu, Richard M. Stern:
Minimum variance modulation filter for robust speech recognition. ICASSP 2009: 3917-3920 - [c80]Chanwoo Kim, Richard M. Stern:
Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction. INTERSPEECH 2009: 28-31 - [c79]Yu-Hsiang Bosco Chiu, Bhiksha Raj, Richard M. Stern:
Towards fusion of feature extraction and acoustic model training: a top down process for robust speech recognition. INTERSPEECH 2009: 32-35 - [c78]Luis Buera, Antonio Miguel, Alfonso Ortega, Eduardo Lleida, Richard M. Stern:
Unsupervised training scheme with non-stereo data for empirical feature vector compensation. INTERSPEECH 2009: 1247-1250 - [c77]Ziad Al Bawab, Lorenzo Turicchia, Richard M. Stern, Bhiksha Raj:
Deriving vocal tract shapes from electromagnetic articulograph data via geometric adaptation and matching. INTERSPEECH 2009: 2051-2054 - [c76]Chanwoo Kim, Kshitiz Kumar, Bhiksha Raj, Richard M. Stern:
Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain. INTERSPEECH 2009: 2495-2498 - [c75]Lingyun Gu, Richard M. Stern:
Speaker segmentation and clustering for simultaneously presented speech. INTERSPEECH 2009: 2551-2554 - 2008
- [c74]Lingyun Gu, Richard M. Stern:
Single-channel speech separation based on modulation frequency. ICASSP 2008: 25-28 - [c73]Kshitiz Kumar, Richard M. Stern:
Environment-invariant compensation for reverberation using linear post-filtering for minimum distortion. ICASSP 2008: 4121-4124 - [c72]Ziad Al Bawab, Bhiksha Raj, Richard M. Stern:
Analysis-by-synthesis features for speech recognition. ICASSP 2008: 4185-4188 - [c71]Yu-Hsiang Bosco Chiu, Richard M. Stern:
Analysis of physiologically-motivated signal processing for robust speech recognition. INTERSPEECH 2008: 1000-1003 - [c70]Chanwoo Kim, Richard M. Stern:
Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis. INTERSPEECH 2008: 2598-2601 - 2007
- [c69]Hyung-Min Park, Richard M. Stern:
Missing Feature Speech Recognition using Dereverberation and Echo Suppression in Reverberant Environments. ICASSP (4) 2007: 381-384 - [c68]Kshitiz Kumar, Tsuhan Chen, Richard M. Stern:
Profile View Lip Reading. ICASSP (4) 2007: 429-432 - [c67]Richard M. Stern, Evandro B. Gouvêa, Govindarajan Thattai:
"polyaural" array processing for automatic speech recognition in degraded environments. INTERSPEECH 2007: 926-929 - 2006
- [j14]Michael L. Seltzer, Richard M. Stern:
Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments. IEEE Trans. Speech Audio Process. 14(6): 2109-2121 (2006) - [c66]Wooil Kim, Richard M. Stern:
Band-Independent Mask Estimation for Missing-Feature Reconstruction in the Presence of Unknown Background Noise. ICASSP (1) 2006: 305-308 - [c65]Hyung-Min Park, Richard M. Stern:
Spatial Separation of Speech Signals Using Continuously-Variable Masks Estimated From Comparisons of Zero Crossings. ICASSP (4) 2006: 1165-1168 - [c64]Yunbin Deng, Xiaokun Li, Chiman Kwan, Roger Xu, Bhiksha Raj, Richard M. Stern, David Williamson:
An integrated approach to improve speech recognition rate for non-native speakers. INTERSPEECH 2006 - [c63]Chanwoo Kim, Yu-Hsiang Bosco Chiu, Richard M. Stern:
Physiologically-motivated synchrony-based processing for robust automatic speech recognition. INTERSPEECH 2006 - [c62]Narayanaswamy Balakrishnan, Rashmi Gangadharaiah, Richard M. Stern:
Voting for two speaker segmentation. INTERSPEECH 2006 - 2005
- [j13]Nam Soo Kim, Woohyung Lim, Richard M. Stern:
Feature compensation based on switching linear dynamic model. IEEE Signal Process. Lett. 12(6): 473-476 (2005) - [j12]Bhiksha Raj, Richard M. Stern:
Missing-feature approaches in speech recognition. IEEE Signal Process. Mag. 22(5): 101-116 (2005) - [c61]Wooil Kim, Richard M. Stern, Hanseok Ko:
Environment-independent mask estimation for missing-feature reconstruction. INTERSPEECH 2005: 2637-2640 - [c60]Chiman Kwan, Xiaokun Li, Debang Lao, Yunbin Deng, Zhubing Ren, Bhiksha Raj, Rita Singh, Richard M. Stern:
Voice driven applications in non-stationary and chaotic environment. ROBIO 2005: 127-132 - [p1]Richard M. Stern:
Signal Separation Motivated by Human Auditory Perception: Applications to Automatic Speech Recognition. Speech Separation by Humans and Machines 2005: 135-154 - 2004
- [j11]Yasunari Obuchi, Nobuo Hataoka, Richard M. Stern:
Normalization of Time-Derivative Parameters for Robust Speech Recognition in Small Devices. IEICE Trans. Inf. Syst. 87-D(4): 1004-1011 (2004) - [j10]Bhiksha Raj, Michael L. Seltzer, Richard M. Stern:
Reconstruction of missing features for robust speech recognition. Speech Commun. 43(4): 275-296 (2004) - [j9]Michael L. Seltzer, Bhiksha Raj, Richard M. Stern:
A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition. Speech Commun. 43(4): 379-393 (2004) - [j8]Michael L. Seltzer, Bhiksha Raj, Richard M. Stern:
Likelihood-maximizing beamforming for robust hands-free speech recognition. IEEE Trans. Speech Audio Process. 12(5): 489-498 (2004) - [c59]Xiang Li, Richard M. Stern:
Feature generation based on maximum normalized acoustic likelihood for improved speech recognition. ICASSP (1) 2004: 545-548 - [c58]Michael L. Seltzer, Richard M. Stern:
Parameter sharing in subband likelihood-maximizing beamforming for speech recognition using microphone arrays. ICASSP (1) 2004: 881-884 - [c57]Bhiksha Raj, Rita Singh, Richard M. Stern:
On tracking noise with linear dynamical system models. ICASSP (1) 2004: 965-968 - [c56]Xiang Li, Richard M. Stern:
Parallel feature generation based on maximizing normalized acoustic likelihood. INTERSPEECH 2004: 953-956 - [c55]Luis R. Salgado-Garza, Richard M. Stern, Juan Arturo Nolazco-Flores:
N-Best List Rescoring Using Syntactic Trigrams. MICAI 2004: 79-88 - 2003
- [c54]Michael L. Seltzer, Richard M. Stern:
Subband parameter optimization of microphone arrays for speech recognition in reverberant environments. ICASSP (1) 2003: 408-411 - [c53]Xiang Li, Richard M. Stern:
Training of stream weights for the decoding of speech using parallel feature streams. ICASSP (1) 2003: 832-835 - [c52]Yasunari Obuchi, Richard M. Stern:
Normalization of time-derivative parameters using histogram equalization. INTERSPEECH 2003: 665-668 - [c51]Xiang Li, Richard M. Stern:
Feature generation based on maximum classification probability for improved speech recognition. INTERSPEECH 2003: 845-848 - [c50]Jon P. Nedel, Richard M. Stern:
Duration normalization and hypothesis combination for improved spontaneous speech recognition. INTERSPEECH 2003: 1509-1512 - 2002
- [j7]Rita Singh, Bhiksha Raj, Richard M. Stern:
Automatic generation of subword units for speech recognition systems. IEEE Trans. Speech Audio Process. 10(2): 89-99 (2002) - [c49]Michael L. Seltzer, Bhiksha Raj, Richard M. Stern:
Speech recognizer-based microphone array processing for robust hands-free speech recognition. ICASSP 2002: 897-900 - [c48]Xiang Li, Rita Singh, Richard M. Stern:
Combining search spaces of heterogeneous recognizers for improved speech recogniton. INTERSPEECH 2002: 405-408 - 2001
- [j6]Juan M. Huerta, Richard M. Stern:
Distortion-class modeling for robust speech recognition under GSM RPE-LTP coding. Speech Commun. 34(1-2): 213-225 (2001) - [c47]Rita Singh, Michael L. Seltzer, Bhiksha Raj, Richard M. Stern:
Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination. ICASSP 2001: 273-276 - [c46]Jon P. Nedel, Richard M. Stern:
Duration normalization for improved recognition of spontaneous and read speech via missing feature methods. ICASSP 2001: 313-316 - 2000
- [c45]Sam-Joo Doh, Richard M. Stern:
Inter-class MLLR for speaker adaptation. ICASSP 2000: 1543-1546 - [c44]Rita Singh, Bhiksha Raj, Richard M. Stern:
Automatic generation of phone sets and lexical transcriptions. ICASSP 2000: 1691-1694 - [c43]Rita Singh, Bhiksha Raj, Richard M. Stern:
Structured redefinition of sound units by merging and splitting for improved speech recognition. INTERSPEECH 2000: 151-154 - [c42]Bhiksha Raj, Michael L. Seltzer, Richard M. Stern:
Reconstruction of damaged spectrographic features for robust speech recognition. INTERSPEECH 2000: 357-360 - [c41]Michael L. Seltzer, Bhiksha Raj, Richard M. Stern:
Classifier-based mask estimation for missing feature methods of robust speech recognition. INTERSPEECH 2000: 538-541 - [c40]Sam-Joo Doh, Richard M. Stern:
Using class weighting in inter-class MLLR. INTERSPEECH 2000: 560-563 - [c39]Jon P. Nedel, Rita Singh, Richard M. Stern:
Phone transition acoustic modeling: application to speaker independent and spontaneous speech systems. INTERSPEECH 2000: 572-575 - [c38]Jon P. Nedel, Rita Singh, Richard M. Stern:
Automatic subword unit refinement for spontaneous speech recognition via phone splitting. INTERSPEECH 2000: 588-591 - [c37]Juan M. Huerta, Richard M. Stern:
Instantaneous-distortion based weighted acoustic modeling for robust recognition of coded speech. INTERSPEECH 2000: 842-845
1990 – 1999
- 1999
- [c36]Rita Singh, Bhiksha Raj, Richard M. Stern:
Automatic clustering and generation of contextual questions for tied states in hidden Markov models. ICASSP 1999: 117-120 - [c35]Rita Singh, Bhiksha Raj, Richard M. Stern:
Domain adduced state tying for cross-domain acoustic modelling. EUROSPEECH 1999: 1707-1710 - 1998
- [j5]Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
Data-driven environmental compensation for speech recognition: A unified approach. Speech Commun. 24(4): 267-285 (1998) - [c34]Juan M. Huerta, Richard M. Stern:
Speech recognition from GSM codec parameters. ICSLP 1998 - [c33]Bhiksha Raj, Rita Singh, Richard M. Stern:
Inference of missing spectrographic features for robust speech recognition. ICSLP 1998 - 1997
- [c32]Bhiksha Raj, Vipul N. Parikh, Richard M. Stern:
The effects of background music on speech recognition accuracy. ICASSP 1997: 851-854 - [c31]Evandro B. Gouvêa, Richard M. Stern:
Speaker normalization through formant-based warping of the frequency scale. EUROSPEECH 1997: 1139-1142 - [c30]Juan M. Huerta, Richard M. Stern:
Compensation for environmental and speaker variability by normalization of pole locations. EUROSPEECH 1997: 1147-1150 - 1996
- [c29]Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
A vector Taylor series approach for environment-independent speech recognition. ICASSP 1996: 733-736 - [c28]Bhiksha Raj, Evandro Bacci Gouvêa, Pedro J. Moreno, Richard M. Stern:
Cepstral compensation by polynomial approximation for environment-independent speech recognition. ICSLP 1996: 2340-2343 - 1995
- [c27]Pedro J. Moreno, Bhiksha Raj, Evandro B. Gouvêa, Richard M. Stern:
Multivariate-Gaussian-based cepstral normalization for robust speech recognition. ICASSP 1995: 137-140 - [c26]Matthew A. Siegler, Richard M. Stern:
On the effects of speech rate in large vocabulary speech recognition systems. ICASSP 1995: 612-615 - [c25]Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
A unified approach for robust speech recognition. EUROSPEECH 1995: 481-484 - 1994
- [c24]Fu-Hua Liu, Richard M. Stern, Alejandro Acero, Pedro J. Moreno:
Environment normalization for robust speech recognition using direct cepstral comparison. ICASSP (2) 1994: 61-64 - [c23]Pedro J. Moreno, Richard M. Stern:
Sources of degradation of speech recognition in the telephone network. ICASSP (1) 1994: 109-112 - [c22]Richard M. Stern, Fu-Hua Liu, Pedro J. Moreno, Alejandro Acero:
Signal processing for robust speech recognition. ICSLP 1994: 1027-1030 - [c21]Nobutoshi Hanai, Richard M. Stern:
Robust speech recognition in the automobile. ICSLP 1994: 1339-1342 - [c20]Yoshiaki Ohshima, Richard M. Stern:
Environmental robustness in automatic speech recognition using physiologic ally-motivated signal processing. ICSLP 1994: 1347-1350 - [c19]Fu-Hua Liu, Pedro J. Moreno, Richard M. Stern, Alejandro Acero:
Signal Processing for Robust Speech Recognition. HLT 1994 - 1993
- [c18]Thomas M. Sullivan, Richard M. Stern:
Multi-microphone correlation-based processing for robust speech recognition. ICASSP (2) 1993: 91-94 - [c17]Fu-Hua Liu, Richard M. Stern, Xuedong Huang, Alejandro Acero:
Efficient Cepstral Normalization For Robust Speech Recognition. HLT 1993 - 1992
- [c16]Fu-Hua Liu, Alejandro Acero, Richard M. Stern:
Efficient joint compensation of speech for the effects of additive noise and linear filtering. ICASSP 1992: 257-260 - [c15]Richard M. Stern, Fu-Hua Liu, Yoshiaki Ohshima, Thomas M. Sullivan, Alejandro Acero:
Multiple approaches to robust speech recognition. ICSLP 1992: 695-698 - [c14]Richard M. Stern, Fu-Hua Liu, Yoshiaki Ohshima, Thomas M. Sullivan, Alejandro Acero:
Multiple Approaches to Robust Speech Recognition. HLT 1992 - [c13]Wayne H. Ward, Sunil lssar, Xuedong Huang, Hsiao-Wuen Hon, Mei-Yuh Hwang, Sheryl Young, Michael Matessa, Fu-Hua Liu, Richard M. Stern:
Speech Understanding in Open Tasks. HLT 1992 - 1991
- [c12]William A. Rozzi, Richard M. Stern:
Speaker adaptation in continuous speech recognition via estimation of correlated mean vectors. ICASSP 1991: 865-868 - [c11]Alejandro Acero, Richard M. Stern:
Robust speech recognition by normalization of the acoustic space. ICASSP 1991: 893-896 - 1990
- [j4]Douglas A. Coast, Richard M. Stern, Gerald G. Cano, Stanley A. Briller:
An approach to cardiac arrhythmia analysis using hidden Markov models. IEEE Trans. Biomed. Eng. 37(9): 826-836 (1990) - [c10]Alejandro Acero, Richard M. Stern:
Environmental robustness in automatic speech recognition. ICASSP 1990: 849-852 - [c9]Alejandro Acero, Richard M. Stern:
Acoustical pre-processing for robust spoken language systems. ICSLP 1990: 1121-1124 - [c8]Alejandro Acero, Richard M. Stern:
Towards Environment-Independent Spoken Language Systems. HLT 1990 - [c7]Richard M. Stern:
Overview of the Third DARPA Speech and Natural Language Workshop. HLT 1990
1980 – 1989
- 1989
- [c6]Richard M. Stern, Alejandro Acero:
ACOUSTICAL PRE-PROCESSING FOR ROBUST SPEECH RECOGNITION. HLT (2) 1989 - 1988
- [c5]Wayne H. Ward, Alexander G. Hauptmann, Richard M. Stern, Thomas Chanak:
Parsing spoken phrases despite missing words. ICASSP 1988: 275-278 - 1987
- [j3]Richard M. Stern, Moshé J. Lasry:
Dynamic speaker adaptation for feature-based isolated word recognition. IEEE Trans. Acoust. Speech Signal Process. 35(6): 751-763 (1987) - [c4]Richard M. Stern, Wayne H. Ward, Alexander G. Hauptmann, Juan Leon:
Sentence parsing with weak grammatical constraints. ICASSP 1987: 380-383 - 1984
- [j2]James L. Crowley, Richard M. Stern:
Fast Computation of the Difference of Low-Pass Transform. IEEE Trans. Pattern Anal. Mach. Intell. 6(2): 212-222 (1984) - [j1]Moshé J. Lasry, Richard M. Stern:
A Posteriori Estimation of Correlated Jointly Gaussian Mean Vectors. IEEE Trans. Pattern Anal. Mach. Intell. 6(4): 530-535 (1984) - [c3]Moshé J. Lasry, Richard M. Stern:
Unsupervised adaptation to new speakers in feature-based letter recognition. ICASSP 1984: 21-24 - 1983
- [c2]Ronald A. Cole, Richard M. Stern, Michael S. Phillips, Scott M. Brill, Andrew P. Pilant, Philippe Specker:
Feature-based speaker-independent recognition of isolated english letters. ICASSP 1983: 731-733 - [c1]Richard M. Stern, Moshé J. Lasry:
Dynamic speaker adaptation for isolated letter recognition using MAP estimation. ICASSP 1983: 734-737
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-26 01:51 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint