default search action
Bhuvana Ramabhadran
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c211]Takaaki Saeki, Gary Wang, Nobuyuki Morioka, Isaac Elias, Kyle Kastner, Andrew Rosenberg, Bhuvana Ramabhadran, Heiga Zen, Françoise Beaufays, Hadar Shemtov:
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data. ICASSP 2024: 11546-11550 - [c210]Gowtham Ramesh, Kartik Audhkhasi, Bhuvana Ramabhadran:
Task Vector Algebra for ASR Models. ICASSP 2024: 12256-12260 - [i44]Takaaki Saeki, Gary Wang, Nobuyuki Morioka, Isaac Elias, Kyle Kastner, Andrew Rosenberg, Bhuvana Ramabhadran, Heiga Zen, Françoise Beaufays, Hadar Shemtov:
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data. CoRR abs/2402.18932 (2024) - [i43]Zhong Meng, Zelin Wu, Rohit Prabhavalkar, Cal Peyser, Weiran Wang, Nanxin Chen, Tara N. Sainath, Bhuvana Ramabhadran:
Text Injection for Neural Contextual Biasing. CoRR abs/2406.02921 (2024) - [i42]Neeraj Gaur, Rohan Agrawal, Gary Wang, Parisa Haghani, Andrew Rosenberg, Bhuvana Ramabhadran:
ASTRA: Aligning Speech and Text Representations for Asr without Sampling. CoRR abs/2406.06664 (2024) - [i41]Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Neeraj Gaur, Zhong Meng:
Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions. CoRR abs/2406.14701 (2024) - [i40]Bolaji Yusuf, Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran:
Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. CoRR abs/2407.04641 (2024) - [i39]Shikhar Vashishth, Harman Singh, Shikhar Bharadwaj, Sriram Ganapathy, Chulayuth Asawaroengchai, Kartik Audhkhasi, Andrew Rosenberg, Ankur Bapna, Bhuvana Ramabhadran:
STAB: Speech Tokenizer Assessment Benchmark. CoRR abs/2409.02384 (2024) - [i38]Fadi Biadsy, Youzheng Chen, Isaac Elias, Kyle Kastner, Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran:
Zero-shot Cross-lingual Voice Transfer for TTS. CoRR abs/2409.13910 (2024) - [i37]Christopher Richardson, Roshan Sharma, Neeraj Gaur, Parisa Haghani, Anirudh Sundar, Bhuvana Ramabhadran:
Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking. CoRR abs/2411.00150 (2024) - 2023
- [j19]Dong Yu, Yifan Gong, Michael A. Picheny, Bhuvana Ramabhadran, Dilek Hakkani-Tür, Rohit Prasad, Heiga Zen, Jan Skoglund, Jan Honza Cernocký, Lukás Burget, Abdelrahman Mohamed:
Twenty-Five Years of Evolution in Speech and Language Processing. IEEE Signal Process. Mag. 40(5): 27-39 (2023) - [c209]Yosuke Higuchi, Andrew Rosenberg, Yuan Wang, Murali Karthick Baskar, Bhuvana Ramabhadran:
Mask-Conformer: Augmenting Conformer with Mask-Predict Decoder. ASRU 2023: 1-8 - [c208]Kartik Audhkhasi, Brian Farris, Bhuvana Ramabhadran, Pedro J. Moreno:
Modular Conformer Training for Flexible End-to-End ASR. ICASSP 2023: 1-5 - [c207]Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel S. Park, David Rybach, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley:
Large-Scale Language Model Rescoring on Long-Form Data. ICASSP 2023: 1-5 - [c206]Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. ICASSP 2023: 1-5 - [c205]Takaaki Saeki, Heiga Zen, Zhehuai Chen, Nobuyuki Morioka, Gary Wang, Yu Zhang, Ankur Bapna, Andrew Rosenberg, Bhuvana Ramabhadran:
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech. ICASSP 2023: 1-5 - [c204]Gary Wang, Kyle Kastner, Ankur Bapna, Zhehuai Chen, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang:
Understanding Shared Speech-Text Representations. ICASSP 2023: 1-5 - [c203]Mohammad Zeineldeen, Kartik Audhkhasi, Murali Karthick Baskar, Bhuvana Ramabhadran:
Robust Knowledge Distillation from RNN-T Models with Noisy Training Labels Using Full-Sum Loss. ICASSP 2023: 1-5 - [c202]Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Kartik Audhkhasi:
O-1: Self-training with Oracle and 1-best Hypothesis. INTERSPEECH 2023: 77-81 - [c201]Yochai Blau, Rohan Agrawal, Lior Madmony, Gary Wang, Andrew Rosenberg, Zhehuai Chen, Zorik Gekhman, Genady Beryozkin, Parisa Haghani, Bhuvana Ramabhadran:
Using Text Injection to Improve Recognition of Personal Identifiers in Speech. INTERSPEECH 2023: 191-195 - [i36]Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. CoRR abs/2302.08583 (2023) - [i35]Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara N. Sainath, Pedro J. Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu:
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages. CoRR abs/2303.01037 (2023) - [i34]Mohammad Zeineldeen, Kartik Audhkhasi, Murali Karthick Baskar, Bhuvana Ramabhadran:
Robust Knowledge Distillation from RNN-T Models With Noisy Training Labels Using Full-Sum Loss. CoRR abs/2303.05958 (2023) - [i33]Gary Wang, Kyle Kastner, Ankur Bapna, Zhehuai Chen, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang:
Understanding Shared Speech-Text Representations. CoRR abs/2304.14514 (2023) - [i32]Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel S. Park, David Rybach, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley:
Large-scale Language Model Rescoring on Long-form Data. CoRR abs/2306.08133 (2023) - [i31]Yochai Blau, Rohan Agrawal, Lior Madmony, Gary Wang, Andrew Rosenberg, Zhehuai Chen, Zorik Gekhman, Genady Beryozkin, Parisa Haghani, Bhuvana Ramabhadran:
Using Text Injection to Improve Recognition of Personal Identifiers in Speech. CoRR abs/2308.07393 (2023) - [i30]Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Kartik Audhkhasi:
O-1: Self-training with Oracle and 1-best Hypothesis. CoRR abs/2308.07486 (2023) - 2022
- [j18]Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro J. Moreno:
Ask2Mask: Guided Data Selection for Masked Speech Modeling. IEEE J. Sel. Top. Signal Process. 16(6): 1357-1366 (2022) - [j17]Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. IEEE J. Sel. Top. Signal Process. 16(6): 1519-1532 (2022) - [c200]Neeraj Gaur, Tongzhou Chen, Ehsan Variani, Parisa Haghani, Bhuvana Ramabhadran, Pedro J. Moreno:
Multilingual Second-Pass Rescoring for Automatic Speech Recognition Systems. ICASSP 2022: 6407-6411 - [c199]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno, Gary Wang:
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses. ICASSP 2022: 7677-7681 - [c198]Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno:
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition. INTERSPEECH 2022: 1026-1030 - [c197]Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. INTERSPEECH 2022: 1031-1035 - [c196]Ehsan Variani, Michael Riley, David Rybach, Cyril Allauzen, Tongzhou Chen, Bhuvana Ramabhadran:
On Adaptive Weight Interpolation of the Hybrid Autoregressive Transducer. INTERSPEECH 2022: 1646-1650 - [c195]Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Nicolás Serrano:
Reducing Domain mismatch in Self-supervised speech pre-training. INTERSPEECH 2022: 3028-3032 - [c194]Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Jesse Emond, Yinghui Huang, Pedro J. Moreno:
Non-Parallel Voice Conversion for ASR Augmentation. INTERSPEECH 2022: 3408-3412 - [c193]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno, Ankur Bapna, Heiga Zen:
MAESTRO: Matched Speech Text Representations through Modality Matching. INTERSPEECH 2022: 4093-4097 - [c192]Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park:
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR. SLT 2022: 23-30 - [c191]Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Pedro J. Moreno, Nanxin Chen:
Maestro-U: Leveraging Joint Speech-Text Representation Learning for Zero Supervised Speech ASR. SLT 2022: 68-75 - [c190]Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. SLT 2022: 197-204 - [i29]Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro J. Moreno:
Ask2Mask: Guided Data Selection for Masked Speech Modeling. CoRR abs/2202.12719 (2022) - [i28]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno, Ankur Bapna, Heiga Zen:
MAESTRO: Matched Speech Text Representations through Modality Matching. CoRR abs/2204.03409 (2022) - [i27]Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. CoRR abs/2204.07553 (2022) - [i26]Alëna Aksënova, Zhehuai Chen, Chung-Cheng Chiu, Daan van Esch, Pavel Golik, Wei Han, Levi King, Bhuvana Ramabhadran, Andrew Rosenberg, Suzan Schwartz, Gary Wang:
Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data. CoRR abs/2205.08014 (2022) - [i25]Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno:
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition. CoRR abs/2209.06096 (2022) - [i24]Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Yinghui Huang, Jesse Emond, Pedro Moreno Mengibar:
Non-Parallel Voice Conversion for ASR Augmentation. CoRR abs/2209.06987 (2022) - [i23]Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Pedro J. Moreno, Nanxin Chen:
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR. CoRR abs/2210.10027 (2022) - [i22]Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park:
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR. CoRR abs/2210.10879 (2022) - [i21]Takaaki Saeki, Heiga Zen, Zhehuai Chen, Nobuyuki Morioka, Gary Wang, Yu Zhang, Ankur Bapna, Andrew Rosenberg, Bhuvana Ramabhadran:
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech. CoRR abs/2210.15447 (2022) - [i20]Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. CoRR abs/2210.17049 (2022) - 2021
- [c189]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro J. Moreno:
Injecting Text in Self-Supervised Speech Pretraining. ASRU 2021: 251-258 - [c188]Hainan Xu, Yinghui Huang, Yun Zhu, Kartik Audhkhasi, Bhuvana Ramabhadran:
Convolutional Dropout and Wordpiece Augmentation for End-to-End Speech Recognition. ICASSP 2021: 5984-5988 - [c187]Neeraj Gaur, Brian Farris, Parisa Haghani, Isabel Leal, Pedro J. Moreno, Manasa Prasad, Bhuvana Ramabhadran, Yun Zhu:
Mixture of Informed Experts for Multilingual Speech Recognition. ICASSP 2021: 6234-6238 - [c186]Rohan Doshi, Youzheng Chen, Liyang Jiang, Xia Zhang, Fadi Biadsy, Bhuvana Ramabhadran, Fang Chu, Andrew Rosenberg, Pedro J. Moreno:
Extending Parrotron: An End-to-End, Speech Conversion and Speech Recognition Model for Atypical Speech. ICASSP 2021: 6988-6992 - [c185]Zhehuai Chen, Andrew Rosenberg, Yu Zhang, Heiga Zen, Mohammadreza Ghodsi, Yinghui Huang, Jesse Emond, Gary Wang, Bhuvana Ramabhadran, Pedro J. Moreno:
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation. Interspeech 2021: 736-740 - [c184]Kartik Audhkhasi, Tongzhou Chen, Bhuvana Ramabhadran, Pedro J. Moreno:
Mixture Model Attention: Flexible Streaming and Non-Streaming Automatic Speech Recognition. Interspeech 2021: 1812-1816 - [c183]Isabel Leal, Neeraj Gaur, Parisa Haghani, Brian Farris, Pedro J. Moreno, Manasa Prasad, Bhuvana Ramabhadran, Yun Zhu:
Self-Adaptive Distillation for Multilingual Speech Recognition: Leveraging Student Independence. Interspeech 2021: 2556-2560 - [c182]Hainan Xu, Kartik Audhkhasi, Yinghui Huang, Jesse Emond, Bhuvana Ramabhadran:
Regularizing Word Segmentation by Creating Misspellings. Interspeech 2021: 2561-2565 - [c181]Zhehuai Chen, Bhuvana Ramabhadran, Fadi Biadsy, Xia Zhang, Youzheng Chen, Liyang Jiang, Fang Chu, Rohan Doshi, Pedro J. Moreno:
Conformer Parrotron: A Faster and Stronger End-to-End Speech Conversion and Recognition Model for Atypical Speech. Interspeech 2021: 4828-4832 - [i19]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro J. Moreno:
Injecting Text in Self-Supervised Speech Pretraining. CoRR abs/2108.12226 (2021) - [i18]Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. CoRR abs/2109.13226 (2021) - 2020
- [c180]Guangzhi Sun, Yu Zhang, Ron J. Weiss, Yuan Cao, Heiga Zen, Andrew Rosenberg, Bhuvana Ramabhadran, Yonghui Wu:
Generating Diverse and Natural Text-to-Speech Samples Using a Quantized Fine-Grained VAE and Autoregressive Prosody Prior. ICASSP 2020: 6699-6703 - [c179]Gary Wang, Andrew Rosenberg, Zhehuai Chen, Yu Zhang, Bhuvana Ramabhadran, Yonghui Wu, Pedro J. Moreno:
Improving Speech Recognition Using Consistent Predictions on Synthesized Speech. ICASSP 2020: 7029-7033 - [c178]Ehsan Variani, Tongzhou Chen, James Apfel, Bhuvana Ramabhadran, Seungji Lee, Pedro J. Moreno:
Neural Oracle Search on N-BEST Hypotheses. ICASSP 2020: 7824-7828 - [c177]Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Anjuli Kannan, Brian Roark:
Language-Agnostic Multilingual Modeling. ICASSP 2020: 8239-8243 - [c176]Zhehuai Chen, Andrew Rosenberg, Yu Zhang, Gary Wang, Bhuvana Ramabhadran, Pedro J. Moreno:
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection. INTERSPEECH 2020: 556-560 - [c175]Gary Wang, Andrew Rosenberg, Zhehuai Chen, Yu Zhang, Bhuvana Ramabhadran, Pedro J. Moreno:
SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR. INTERSPEECH 2020: 2832-2836 - [c174]Yun Zhu, Parisa Haghani, Anshuman Tripathi, Bhuvana Ramabhadran, Brian Farris, Hainan Xu, Han Lu, Hasim Sak, Isabel Leal, Neeraj Gaur, Pedro J. Moreno, Qian Zhang:
Multilingual Speech Recognition with Self-Attention Structured Parameterization. INTERSPEECH 2020: 4741-4745 - [i17]Guangzhi Sun, Yu Zhang, Ron J. Weiss, Yuan Cao, Heiga Zen, Andrew Rosenberg, Bhuvana Ramabhadran, Yonghui Wu:
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior. CoRR abs/2002.03788 (2020) - [i16]Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Anjuli Kannan, Brian Roark:
Language-agnostic Multilingual Modeling. CoRR abs/2004.09571 (2020) - [i15]Arindrima Datta, Guanlong Zhao, Bhuvana Ramabhadran, Eugene Weinstein:
LSTM Acoustic Models Learn to Align and Pronounce with Graphemes. CoRR abs/2008.06121 (2020)
2010 – 2019
- 2019
- [c173]Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro J. Moreno, Yonghui Wu, Zelin Wu:
Speech Recognition with Augmented Synthesized Speech. ASRU 2019: 996-1002 - [c172]Min Ma, Bhuvana Ramabhadran, Jesse Emond, Andrew Rosenberg, Fadi Biadsy:
Comparison of Data Augmentation and Adaptation Strategies for Code-switched Automatic Speech Recognition. ICASSP 2019: 6081-6085 - [c171]Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, R. J. Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran:
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning. INTERSPEECH 2019: 2080-2084 - [c170]Anjuli Kannan, Arindrima Datta, Tara N. Sainath, Eugene Weinstein, Bhuvana Ramabhadran, Yonghui Wu, Ankur Bapna, Zhifeng Chen, Seungji Lee:
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model. INTERSPEECH 2019: 2130-2134 - [i14]Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, R. J. Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran:
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning. CoRR abs/1907.04448 (2019) - [i13]Anjuli Kannan, Arindrima Datta, Tara N. Sainath, Eugene Weinstein, Bhuvana Ramabhadran, Yonghui Wu, Ankur Bapna, Zhifeng Chen, Seungji Lee:
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model. CoRR abs/1909.05330 (2019) - [i12]Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro J. Moreno, Yonghui Wu, Zelin Wu:
Speech Recognition with Augmented Synthesized Speech. CoRR abs/1909.11699 (2019) - 2018
- [c169]Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny:
Building Competitive Direct Acoustics-to-Word Models for English Conversational Speech Recognition. ICASSP 2018: 4759-4763 - [c168]Andrew Rosenberg, Raul Fernandez, Bhuvana Ramabhadran:
Measuring the Effect of Linguistic Resources on Prosody Modeling for Speech Synthesis. ICASSP 2018: 5114-5118 - [c167]Xuesong Yang, Kartik Audhkhasi, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson:
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition. ICASSP 2018: 5989-5993 - [c166]Yinghui Huang, Abhinav Sethy, Kartik Audhkhasi, Bhuvana Ramabhadran:
Whole Sentence Neural Language Models. ICASSP 2018: 6089-6093 - [c165]Bhuvana Ramabhadran:
Open Problems in Speech Recognition. INTERSPEECH 2018: 1766 - [c164]Takashi Fukuda, Raul Fernandez, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Alexander Sorin, Gakuto Kurata:
Data Augmentation Improves Recognition of Foreign Accented Speech. INTERSPEECH 2018: 2409-2413 - [c163]Jesse Emond, Bhuvana Ramabhadran, Brian Roark, Pedro J. Moreno, Min Ma:
Transliteration Based Approaches to Improve Code-Switched Speech Recognition Performance. SLT 2018: 448-455 - [i11]Xuesong Yang, Kartik Audhkhasi, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson:
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition. CoRR abs/1802.02656 (2018) - 2017
- [j16]Kartik Audhkhasi, Andrew Rosenberg, George Saon, Abhinav Sethy, Bhuvana Ramabhadran, Stanley F. Chen, Michael Picheny:
Recent progress in deep end-to-end models for spoken language processing. IBM J. Res. Dev. 61(4-5): 2:1-2:10 (2017) - [j15]Bhuvana Ramabhadran, Nancy F. Chen, Mary P. Harper, Brian Kingsbury, Kate M. Knill:
Introduction to the Special Issue on End-to-End Speech and Language Processing. IEEE J. Sel. Top. Signal Process. 11(8): 1237-1239 (2017) - [j14]Kartik Audhkhasi, Andrew Rosenberg, Abhinav Sethy, Bhuvana Ramabhadran, Brian Kingsbury:
End-to-End ASR-Free Keyword Search From Speech. IEEE J. Sel. Top. Signal Process. 11(8): 1351-1359 (2017) - [j13]I-Hsin Chung, Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Vernon Austel, Upendra V. Chaudhari, Brian Kingsbury:
Parallel Deep Neural Network Training for Big Data on Blue Gene/Q. IEEE Trans. Parallel Distributed Syst. 28(6): 1703-1714 (2017) - [c162]Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy:
Language modeling with highway LSTM. ASRU 2017: 244-251 - [c161]Ewout van den Berg, Bhuvana Ramabhadran, Michael Picheny:
Training variance and performance evaluation of neural networks in speech. ICASSP 2017: 2287-2291 - [c160]Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Tom Sercu, Kartik Audhkhasi, Abhinav Sethy, Markus Nußbaum-Thom, Andrew Rosenberg:
Knowledge distillation across ensembles of multilingual models for low-resource languages. ICASSP 2017: 4825-4829 - [c159]Kartik Audhkhasi, Andrew Rosenberg, Abhinav Sethy, Bhuvana Ramabhadran, Brian Kingsbury:
End-to-end ASR-free keyword search from speech. ICASSP 2017: 4840-4844 - [c158]Takashi Fukuda, Osamu Ichikawa, Gakuto Kurata, Ryuki Tachibana, Samuel Thomas, Bhuvana Ramabhadran:
Effective joint training of denoising feature space transforms and Neural Network based acoustic models. ICASSP 2017: 5190-5194 - [c157]Osamu Ichikawa, Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata, Bhuvana Ramabhadran:
Harmonic feature fusion for robust neural network-based acoustic modeling. ICASSP 2017: 5195-5199 - [c156]Andrew Rosenberg, Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran, Michael Picheny:
End-to-end speech recognition and keyword search on low-resource languages. ICASSP 2017: 5280-5284 - [c155]Tom Sercu, George Saon, Jia Cui, Xiaodong Cui, Bhuvana Ramabhadran, Brian Kingsbury, Abhinav Sethy:
Network architectures for multilingual speech representation learning. ICASSP 2017: 5295-5299 - [c154]Raul Fernandez, Andrew Rosenberg, Alexander Sorin, Bhuvana Ramabhadran, Ron Hoory:
Voice-transformation-based data augmentation for prosodic classification. ICASSP 2017: 5530-5534 - [c153]George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall:
English Conversational Telephone Speech Recognition by Humans and Machines. INTERSPEECH 2017: 132-136 - [c152]Yinghui Huang, Abhinav Sethy, Bhuvana Ramabhadran:
Fast Neural Network Language Model Lookups at N-Gram Speeds. INTERSPEECH 2017: 274-278 - [c151]Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, George Saon:
Empirical Exploration of Novel Architectures and Objectives for Language Models. INTERSPEECH 2017: 279-283 - [c150]Asaf Rendel, Raul Fernandez, Zvi Kons, Andrew Rosenberg, Ron Hoory, Bhuvana Ramabhadran:
Weakly-Supervised Phrase Assignment from Text in a Speech-Synthesis System Using Noisy Labels. INTERSPEECH 2017: 759-763 - [c149]Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo:
Direct Acoustics-to-Word Models for English Conversational Speech Recognition. INTERSPEECH 2017: 959-963 - [c148]Masayuki Suzuki, Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Kenneth Ward Church, Mark Drake:
Symbol Sequence Search from Telephone Conversation. INTERSPEECH 2017: 3612-3616 - [c147]Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata, Samuel Thomas, Jia Cui, Bhuvana Ramabhadran:
Efficient Knowledge Distillation from an Ensemble of Teachers. INTERSPEECH 2017: 3697-3701 - [c146]Andrew Rosenberg, Bhuvana Ramabhadran:
Bias and Statistical Significance in Evaluating Speech Synthesis with Mean Opinion Scores. INTERSPEECH 2017: 3976-3980 - [i10]Kartik Audhkhasi, Andrew Rosenberg, Abhinav Sethy, Bhuvana Ramabhadran, Brian Kingsbury:
End-to-End ASR-free Keyword Search from Speech. CoRR abs/1701.04313 (2017) - [i9]George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall:
English Conversational Telephone Speech Recognition by Humans and Machines. CoRR abs/1703.02136 (2017) - [i8]Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo:
Direct Acoustics-to-Word Models for English Conversational Speech Recognition. CoRR abs/1703.07754 (2017) - [i7]Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy:
Language Modeling with Highway LSTM. CoRR abs/1709.06436 (2017) - [i6]Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny:
Building competitive direct acoustics-to-word models for English conversational speech recognition. CoRR abs/1712.03133 (2017) - 2016
- [c145]Jie Chen, Lingfei Wu, Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran:
Efficient one-vs-one kernel ridge regression for speech recognition. ICASSP 2016: 2454-2458 - [c144]Asaf Rendel, Raul Fernandez, Ron Hoory, Bhuvana Ramabhadran:
Using continuous lexical embeddings to improve symbolic-prosody prediction in a text-to-speech front-end. ICASSP 2016: 5655-5659 - [c143]Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran:
Semantic word embedding neural network language models for automatic speech recognition. ICASSP 2016: 5995-5999 - [c142]Markus Nußbaum-Thom, Jia Cui, Bhuvana Ramabhadran, Vaibhava Goel:
Acoustic Modeling Using Bidirectional Gated Recurrent Convolutional Units. INTERSPEECH 2016: 390-394 - [c141]Masayuki Suzuki, Ryuki Tachibana, Samuel Thomas, Bhuvana Ramabhadran, George Saon:
Domain Adaptation of CNN Based Acoustic Models Under Limited Resource Settings. INTERSPEECH 2016: 1588-1592 - [c140]Samuel Thomas, Kartik Audhkhasi, Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran:
Multilingual Data Selection for Low Resource Speech Recognition. INTERSPEECH 2016: 3853-3857 - [i5]Ewout van den Berg, Bhuvana Ramabhadran, Michael Picheny:
Training variance and performance evaluation of neural networks in speech. CoRR abs/1606.04521 (2016) - [i4]Dmitriy Serdyuk, Kartik Audhkhasi, Philemon Brakel, Bhuvana Ramabhadran, Samuel Thomas, Yoshua Bengio:
Invariant Representations for Noisy Speech Recognition. CoRR abs/1612.01928 (2016) - 2015
- [j12]Tara N. Sainath, Brian Kingsbury, George Saon, Hagen Soltau, Abdel-rahman Mohamed, George E. Dahl, Bhuvana Ramabhadran:
Deep Convolutional Neural Networks for Large-scale Speech Tasks. Neural Networks 64: 39-48 (2015) - [c139]Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, Abhinav Sethy, Kartik Audhkhasi, Xiaodong Cui, Ellen Kislal, Lidia Mangu, Markus Nußbaum-Thom, Michael Picheny, Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang, Philip C. Woodland:
Multilingual representations for low resource speech recognition and keyword search. ASRU 2015: 259-266 - [c138]Abhinav Sethy, Stanley F. Chen, Ebru Arisoy, Bhuvana Ramabhadran:
Unnormalized exponential and neural network language models. ICASSP 2015: 5416-5420 - [c137]Ebru Arisoy, Abhinav Sethy, Bhuvana Ramabhadran, Stanley F. Chen:
Bidirectional recurrent neural network language models for automatic speech recognition. ICASSP 2015: 5421-5425 - [c136]Ewout van den Berg, Daniel Brand, Rajesh Bordawekar, Leonid Rachevsky, Bhuvana Ramabhadran:
Efficient GPU implementation of convolutional neural networks for speech recognition. INTERSPEECH 2015: 1483-1487 - [c135]Raul Fernandez, Asaf Rendel, Bhuvana Ramabhadran, Ron Hoory:
Using deep bidirectional recurrent neural networks for prosodic-target prediction in a unit-selection text-to-speech system. INTERSPEECH 2015: 1606-1610 - [c134]Andrew Rosenberg, Raul Fernandez, Bhuvana Ramabhadran:
Modeling phrasing and prominence using deep recurrent learning. INTERSPEECH 2015: 3066-3070 - [c133]Jia Cui, George Saon, Bhuvana Ramabhadran, Brian Kingsbury:
A multi-region deep neural network model in speech recognition. INTERSPEECH 2015: 3244-3248 - [c132]Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran:
Diverse Embedding Neural Network Language Models. ICLR (Workshop) 2015 - 2014
- [j11]Murat Saraclar, Ciprian Chelba, Bhuvana Ramabhadran:
Editorial for the special issue on spoken content retrieval. Comput. Speech Lang. 28(5): 1019-1020 (2014) - [j10]Ebru Arisoy, Stanley F. Chen, Bhuvana Ramabhadran, Abhinav Sethy:
Converting Neural Network Language Models into Back-off Language Models for Efficient Decoding in Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(1): 184-192 (2014) - [c131]Po-Sen Huang, Haim Avron, Tara N. Sainath, Vikas Sindhwani, Bhuvana Ramabhadran:
Kernel methods match Deep Neural Networks on TIMIT. ICASSP 2014: 205-209 - [c130]Vijayaditya Peddinti, Tara N. Sainath, Shay Maymon, Bhuvana Ramabhadran, David Nahamoo, Vaibhava Goel:
Deep Scattering Spectrum with deep neural networks. ICASSP 2014: 210-214 - [c129]Abhinav Sethy, Stanley F. Chen, Bhuvana Ramabhadran, Paul Vozila:
Static interpolation of exponential n-gram models using features of features. ICASSP 2014: 4878-4882 - [c128]Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George Saon, Bhuvana Ramabhadran:
Improvements to filterbank and delta learning within a deep neural network framework. ICASSP 2014: 6839-6843 - [c127]Jia Cui, Jonathan Mamou, Brian Kingsbury, Bhuvana Ramabhadran:
Automatic keyword selection for keyword search development and tuning. ICASSP 2014: 7839-7843 - [c126]Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran, Shrikanth S. Narayanan:
Semi-supervised term-weighted value rescoring for keyword search. ICASSP 2014: 7869-7873 - [c125]Raul Fernandez, Jia Cui, Andrew Rosenberg, Bhuvana Ramabhadran, Xiaodong Cui:
Exploiting vocal-source features to improve ASR accuracy for low-resource languages. INTERSPEECH 2014: 805-809 - [c124]Jia Cui, Bhuvana Ramabhadran, Xiaodong Cui, Andrew Rosenberg, Brian Kingsbury, Abhinav Sethy:
Recent improvements in neural network acoustic modeling for LVCSR in low resource languages. INTERSPEECH 2014: 840-844 - [c123]Tara N. Sainath, Vijayaditya Peddinti, Brian Kingsbury, Petr Fousek, Bhuvana Ramabhadran, David Nahamoo:
Deep scattering spectra with deep neural networks for LVCSR tasks. INTERSPEECH 2014: 900-904 - [c122]Tara N. Sainath, I-Hsin Chung, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Brian Kingsbury, George Saon, Vernon Austel, Upendra V. Chaudhari:
Parallel deep neural network training for LVCSR tasks using blue gene/Q. INTERSPEECH 2014: 1048-1052 - [c121]Ewout van den Berg, Bhuvana Ramabhadran:
Dictionary-based pitch tracking with dynamic programming. INTERSPEECH 2014: 1347-1351 - [c120]Xiaodong Cui, Brian Kingsbury, Jia Cui, Bhuvana Ramabhadran, Andrew Rosenberg, Mohammad Sadegh Rasooli, Owen Rambow, Nizar Habash, Vaibhava Goel:
Improving deep neural network acoustic modeling for audio corpus indexing under the IARPA babel program. INTERSPEECH 2014: 2103-2107 - [c119]Raul Fernandez, Asaf Rendel, Bhuvana Ramabhadran, Ron Hoory:
Prosody contour prediction with long short-term memory, bi-directional, deep recurrent neural networks. INTERSPEECH 2014: 2268-2272 - [c118]I-Hsin Chung, Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Vernon Austel, Upendra V. Chaudhari, Brian Kingsbury:
Parallel Deep Neural Network Training for Big Data on Blue Gene/Q. SC 2014: 745-753 - 2013
- [j9]Tara N. Sainath, Brian Kingsbury, Hagen Soltau, Bhuvana Ramabhadran:
Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks. IEEE Trans. Speech Audio Process. 21(11): 2267-2276 (2013) - [c117]Abhinav Sethy, Stanley F. Chen, Ebru Arisoy, Bhuvana Ramabhadran, Kartik Audhkhasi, Shrikanth S. Narayanan, Paul Vozila:
Joint training of interpolated exponential n-gram models. ASRU 2013: 25-30 - [c116]Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, Bhuvana Ramabhadran:
Learning filter banks within a deep neural network framework. ASRU 2013: 297-302 - [c115]Tara N. Sainath, Lior Horesh, Brian Kingsbury, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Accelerating Hessian-free optimization for Deep Neural Networks by implicit preconditioning and sampling. ASRU 2013: 303-308 - [c114]Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to Deep Convolutional Neural Networks for LVCSR. ASRU 2013: 315-320 - [c113]Murat Saraclar, Abhinav Sethy, Bhuvana Ramabhadran, Lidia Mangu, Jia Cui, Xiaodong Cui, Brian Kingsbury, Jonathan Mamou:
An empirical study of confusion modeling in keyword search for low resource languages. ASRU 2013: 464-469 - [c112]Tara N. Sainath, Brian Kingsbury, Vikas Sindhwani, Ebru Arisoy, Bhuvana Ramabhadran:
Low-rank matrix factorization for Deep Neural Network training with high-dimensional output targets. ICASSP 2013: 6655-6659 - [c111]Jia Cui, Xiaodong Cui, Bhuvana Ramabhadran, Janice Kim, Brian Kingsbury, Jonathan Mamou, Lidia Mangu, Michael Picheny, Tara N. Sainath, Abhinav Sethy:
Developing speech recognition systems for corpus indexing under the IARPA Babel program. ICASSP 2013: 6753-6757 - [c110]Raul Fernandez, Asaf Rendel, Bhuvana Ramabhadran, Ron Hoory:
F0 contour prediction with a deep belief network-Gaussian process hybrid model. ICASSP 2013: 6885-6889 - [c109]Rohit Prabhavalkar, Tara N. Sainath, David Nahamoo, Bhuvana Ramabhadran, Dimitri Kanevsky:
An evaluation of posterior modeling techniques for phonetic recognition. ICASSP 2013: 7165-7169 - [c108]Ebru Arisoy, Stanley F. Chen, Bhuvana Ramabhadran, Abhinav Sethy:
Converting Neural Network Language Models into back-off language models for efficient decoding in automatic speech recognition. ICASSP 2013: 8242-8246 - [c107]Jonathan Mamou, Jia Cui, Xiaodong Cui, Mark J. F. Gales, Brian Kingsbury, Kate M. Knill, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schlüter, Abhinav Sethy, Philip C. Woodland:
System combination and score normalization for spoken term detection. ICASSP 2013: 8272-8276 - [c106]Brian Kingsbury, Jia Cui, Xiaodong Cui, Mark J. F. Gales, Kate M. Knill, Jonathan Mamou, Lidia Mangu, David Nolden, Michael Picheny, Bhuvana Ramabhadran, Ralf Schlüter, Abhinav Sethy, Philip C. Woodland:
A high-performance Cantonese keyword search system. ICASSP 2013: 8277-8281 - [c105]Tara N. Sainath, Abdel-rahman Mohamed, Brian Kingsbury, Bhuvana Ramabhadran:
Deep convolutional neural networks for LVCSR. ICASSP 2013: 8614-8618 - [i3]Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to deep convolutional neural networks for LVCSR. CoRR abs/1309.1501 (2013) - [i2]Tara N. Sainath, Lior Horesh, Brian Kingsbury, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improving training time of Hessian-free optimization for deep neural networks using preconditioning and sampling. CoRR abs/1309.1508 (2013) - [i1]Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran, Shrikanth S. Narayanan:
Generalized Ambiguity Decomposition for Understanding Ensemble Diversity. CoRR abs/1312.7463 (2013) - 2012
- [j8]Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Ariya Rastrow, Nobuyasu Itoh, Masafumi Nishimura:
Acoustically discriminative language model training with pseudo-hypothesis. Speech Commun. 54(2): 219-228 (2012) - [j7]Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, Abhinav Sethy, Bhuvana Ramabhadran:
Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech. Speech Commun. 54(3): 491-502 (2012) - [j6]Junlan Feng, Bhuvana Ramabhadran, John H. L. Hansen, Jason D. Williams:
Trends in Speech and Language Processing [In the Spotlight]. IEEE Signal Process. Mag. 29(1): 177-179 (2012) - [j5]Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky, Dirk Van Compernolle, Kris Demuynck, Jort F. Gemmeke, Jerome R. Bellegarda, Shiva Sundaram:
Exemplar-Based Processing for Speech Recognition: An Overview. IEEE Signal Process. Mag. 29(6): 98-113 (2012) - [c104]Nobuyasu Itoh, Tara N. Sainath, Dan-Ning Jiang, Jie Zhou, Bhuvana Ramabhadran:
N-best entropy based data selection for acoustic modeling. ICASSP 2012: 4133-4136 - [c103]Takashi Fukuda, Ryuki Tachibana, Upendra V. Chaudhari, Bhuvana Ramabhadran, Puming Zhan:
Constructing ensembles of dissimilar acoustic models using hidden attributes of training data. ICASSP 2012: 4141-4144 - [c102]Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran:
Auto-encoder bottleneck features using deep belief networks. ICASSP 2012: 4153-4156 - [c101]Christian Plahl, Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo:
Improved pre-training of Deep Belief Networks using Sparse Encoding Symmetric Machines. ICASSP 2012: 4165-4168 - [c100]Raul Fernandez, Steve Minnis, Bhuvana Ramabhadran:
Prediction of F0 contours from symbolic and numerical variables using continuous conditional random fields. ICASSP 2012: 4621-4624 - [c99]Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran, Shrikanth S. Narayanan:
Creating ensemble of diverse maximum entropy models. ICASSP 2012: 4845-4848 - [c98]Tara N. Sainath, David Nahamoo, Dimitri Kanevsky, Bhuvana Ramabhadran:
Enhancing Exemplar-Based Posteriors for Speech Recognition Tasks. INTERSPEECH 2012: 2130-2133 - [c97]Andrew Rosenberg, Raul Fernandez, Bhuvana Ramabhadran:
Phrase Boundary Assignment from Text in Multiple Domains. INTERSPEECH 2012: 2558-2561 - [c96]Ebru Arisoy, Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran:
Deep Neural Network Language Models. WLM@NAACL-HLT 2012: 20-28 - [e1]Bhuvana Ramabhadran, Sanjeev Khudanpur, Ebru Arisoy:
Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, WLM@NAACL-HLT 2012, Montrèal, Canada, June 8, 2012. Association for Computational Linguistics 2012, ISBN 978-1-937284-20-6 [contents] - 2011
- [j4]Michael Picheny, David Nahamoo, Vaibhava Goel, Brian Kingsbury, Bhuvana Ramabhadran, Steven J. Rennie, George Saon:
Trends and advances in speech recognition. IBM J. Res. Dev. 55(5): 2 (2011) - [j3]Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, David Nahamoo, Dimitri Kanevsky:
Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2598-2613 (2011) - [c95]Ryuki Tachibana, Takashi Fukuda, Upendra V. Chaudhari, Bhuvana Ramabhadran, Puming Zhan:
Frame-level AnyBoost for LVCSR with the MMI Criterion. ASRU 2011: 12-17 - [c94]Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran, Petr Fousek, Petr Novák, Abdel-rahman Mohamed:
Making Deep Belief Networks effective for large vocabulary continuous speech recognition. ASRU 2011: 30-35 - [c93]Tara N. Sainath, David Nahamoo, Dimitri Kanevsky, Bhuvana Ramabhadran, Parikshit M. Shah:
A convex hull approach to sparse representations for exemplar-based speech recognition. ASRU 2011: 59-64 - [c92]Stanley F. Chen, Abhinav Sethy, Bhuvana Ramabhadran:
Pruning exponential language models. ASRU 2011: 237-242 - [c91]Alexander Sorin, Hagai Aronowitz, Jonathan Mamou, Orith Toledo-Ronen, Ron Hoory, Michael Kuritzky, Yael Erez, Bhuvana Ramabhadran, Abhinav Sethy:
Speech processing and retrieval in a personal memory aid system for the elderly. ICASSP 2011: 1749-1752 - [c90]Raul Fernandez, Bhuvana Ramabhadran:
Exploiting active-learning strategies for annotating prosodic events with limited labeled data. ICASSP 2011: 2208-2211 - [c89]Tara N. Sainath, David Nahamoo, Bhuvana Ramabhadran, Dimitri Kanevsky, Vaibhava Goel, Parikshit M. Shah:
Exemplar-based Sparse Representation phone identification features. ICASSP 2011: 4492-4495 - [c88]Bin Zhang, Abhinav Sethy, Tara N. Sainath, Bhuvana Ramabhadran:
Application specific loss minimization using gradient boosting. ICASSP 2011: 4880-4883 - [c87]Ariya Rastrow, Markus Dreyer, Abhinav Sethy, Sanjeev Khudanpur, Bhuvana Ramabhadran, Mark Dredze:
Hill climbing on speech lattices: A new rescoring framework. ICASSP 2011: 5032-5035 - [c86]Abdel-rahman Mohamed, Tara N. Sainath, George E. Dahl, Bhuvana Ramabhadran, Geoffrey E. Hinton, Michael A. Picheny:
Deep Belief Networks using discriminative features for phone recognition. ICASSP 2011: 5060-5063 - [c85]Dimitri Kanevsky, David Nahamoo, Tara N. Sainath, Bhuvana Ramabhadran, Peder A. Olsen:
A-Functions: A generalization of Extended Baum-Welch transformations to convex optimization. ICASSP 2011: 5164-5167 - [c84]Abhinav Sethy, Stanley F. Chen, Bhuvana Ramabhadran:
Distributed training of large scale exponential language models. ICASSP 2011: 5520-5523 - [c83]Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, Abhinav Sethy, Bhuvana Ramabhadran:
Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition. ICASSP 2011: 5572-5575 - [c82]Ruhi Sarikaya, Geoffrey E. Hinton, Bhuvana Ramabhadran:
Deep belief nets for natural language call-routing. ICASSP 2011: 5680-5683 - [c81]Ebru Arisoy, Bhuvana Ramabhadran, Hong-Kwang Jeff Kuo:
Feature Combination Approaches for Discriminative Language Models. INTERSPEECH 2011: 617-620 - [c80]Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky:
Reducing Computational Complexities of Exemplar-Based Sparse Representations with Applications to Large Vocabulary Speech Recognition. INTERSPEECH 2011: 785-788 - [c79]Dimitri Kanevsky, David Nahamoo, Tara N. Sainath, Bhuvana Ramabhadran:
Convergence of Line Search A-Function Methods. INTERSPEECH 2011: 997-1000 - [c78]Ruhi Sarikaya, Stanley F. Chen, Bhuvana Ramabhadran:
Shrinkage-Based Features for Natural Language Call Routing. INTERSPEECH 2011: 1309-1312 - [c77]Leonid Rachevsky, Dimitri Kanevsky, Ruhi Sarikaya, Bhuvana Ramabhadran:
Clustering with Modified Cosine Distance Learned from Constraints. INTERSPEECH 2011: 1313-1316 - [c76]Jonathan Mamou, Abhinav Sethy, Bhuvana Ramabhadran, Ron Hoory, Paul Vozila:
Improved Spoken Query Transcription Using Co-Occurrence Information. INTERSPEECH 2011: 1473-1476 - [c75]Andrew Rosenberg, Raul Fernandez, Bhuvana Ramabhadran:
"What is... Dengue Fever?" - Modeling and Predicting Pronunciation Errors in a Text-to-Speech System. INTERSPEECH 2011: 2189-2192 - [c74]Stanley F. Chen, Stephen M. Chu, Ahmad Emami, Lidia Mangu, Bhuvana Ramabhadran, Ruhi Sarikaya, Abhinav Sethy:
Performance prediction and shrinking language models. MLSLP 2011 - 2010
- [c73]Dimitri Kanevsky, Avishy Carmi, Lior Horesh, Pini Gurfil, Bhuvana Ramabhadran, Tara N. Sainath:
Kalman filtering for compressed sensing. FUSION 2010: 1-8 - [c72]Avishy Carmi, Tara N. Sainath, Pini Gurfil, Dimitri Kanevsky, David Nahamoo, Bhuvana Ramabhadran:
The Use of isometric transformations and bayesian estimation in compressive sensing for fMRI classification. ICASSP 2010: 493-496 - [c71]Tara N. Sainath, Avishy Carmi, Dimitri Kanevsky, Bhuvana Ramabhadran:
Bayesian compressive sensing for phonetic classification. ICASSP 2010: 4370-4373 - [c70]Srikanth Vishnubhotla, Raul Fernandez, Bhuvana Ramabhadran:
An autoencoder neural-network based low-dimensionality approach to excitation modeling for HMM-based text-to-speech. ICASSP 2010: 4614-4617 - [c69]Ruhi Sarikaya, Ahmad Emami, Mohamed Afify, Bhuvana Ramabhadran:
Continuous space language modeling techniques. ICASSP 2010: 5186-5189 - [c68]Carolina Parada, Abhinav Sethy, Bhuvana Ramabhadran:
Balancing false alarms and hits in Spoken Term Detection. ICASSP 2010: 5286-5289 - [c67]Mark E. Epstein, Bhuvana Ramabhadran, Rajesh Balchandran:
Improved language modeling for conversational applications using sentence quality. ICASSP 2010: 5378-5381 - [c66]Rajesh Balchandran, Leonid Rachevsky, Bhuvana Ramabhadran, Miroslav Novak:
Techniques for topic detection based processing in spoken dialog systems. INTERSPEECH 2010: 82-85 - [c65]Vaibhava Goel, Tara N. Sainath, Bhuvana Ramabhadran, Peder A. Olsen, David Nahamoo, Dimitri Kanevsky:
Incorporating sparse representation phone identification features in automatic speech recognition using exponential families. INTERSPEECH 2010: 1345-1348 - [c64]Raul Fernandez, Bhuvana Ramabhadran:
Discriminative training and unsupervised adaptation for labeling prosodic events with limited training data. INTERSPEECH 2010: 1429-1432 - [c63]Ruhi Sarikaya, Stanley F. Chen, Abhinav Sethy, Bhuvana Ramabhadran:
Impact of word classing on shrinkage-based language models. INTERSPEECH 2010: 1804-1807 - [c62]Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky, Abhinav Sethy:
Sparse representation features for speech recognition. INTERSPEECH 2010: 2254-2257 - [c61]Abhinav Sethy, Tara N. Sainath, Bhuvana Ramabhadran, Dimitri Kanevsky:
Data selection for language modeling using sparse representations. INTERSPEECH 2010: 2258-2261 - [c60]Tara N. Sainath, Sameer Maskey, Dimitri Kanevsky, Bhuvana Ramabhadran, David Nahamoo, Julia Hirschberg:
Sparse representations for text categorization. INTERSPEECH 2010: 2266-2269 - [c59]Dimitri Kanevsky, Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo:
An analysis of sparseness and regularization in exemplar-based methods for speech classification. INTERSPEECH 2010: 2842-2845 - [c58]Ariya Rastrow, Frederick Jelinek, Abhinav Sethy, Bhuvana Ramabhadran:
Unsupervised Model Adaptation using Information-Theoretic Criterion. HLT-NAACL 2010: 190-197
2000 – 2009
- 2009
- [j2]Abhinav Sethy, Panayiotis G. Georgiou, Bhuvana Ramabhadran, Shrikanth S. Narayanan:
An Iterative Relative Entropy Minimization-Based Data Selection Approach for n-Gram Model Adaptation. IEEE Trans. Speech Audio Process. 17(1): 13-23 (2009) - [c57]Stanley F. Chen, Lidia Mangu, Bhuvana Ramabhadran, Ruhi Sarikaya, Abhinav Sethy:
Scaling shrinkage-based language models. ASRU 2009: 299-304 - [c56]Ariya Rastrow, Abhinav Sethy, Bhuvana Ramabhadran:
Constrained discriminative training of N-gram language models. ASRU 2009: 311-316 - [c55]Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny:
An exploration of large vocabulary tools for small vocabulary phonetic recognition. ASRU 2009: 359-364 - [c54]Carolina Parada, Abhinav Sethy, Bhuvana Ramabhadran:
Query-by-example Spoken Term Detection For OOV terms. ASRU 2009: 404-409 - [c53]Narges Bani Asadi, Irina Rish, Katya Scheinberg, Dimitri Kanevsky, Bhuvana Ramabhadran:
Map approach to learning sparse Gaussian Markov networks. ICASSP 2009: 1721-1724 - [c52]Dimitri Kanevsky, Tara N. Sainath, Bhuvana Ramabhadran:
A generalized family of parameter estimation techniques. ICASSP 2009: 1725-1728 - [c51]Ariya Rastrow, Abhinav Sethy, Bhuvana Ramabhadran:
A new method for OOV detection using hybrid word/fragment system. ICASSP 2009: 3953-3956 - [c50]Dogan Can, Erica Cooper, Abhinav Sethy, Christopher M. White, Bhuvana Ramabhadran, Murat Saraclar:
Effect of pronounciations on OOV queries in spoken term detection. ICASSP 2009: 3957-3960 - [c49]Christopher M. White, Abhinav Sethy, Bhuvana Ramabhadran, Patrick J. Wolfe, Erica Cooper, Murat Saraclar, James K. Baker:
Unsupervised pronunciation validation. ICASSP 2009: 4301-4304 - [c48]Ruhi Sarikaya, Sameer Maskey, R. Zhang, Ea-Ee Jan, D. Wang, Bhuvana Ramabhadran, Salim Roukos:
Iterative sentence-pair extraction from quasi-parallel corpora for machine translation. INTERSPEECH 2009: 432-435 - [c47]Ariya Rastrow, Abhinav Sethy, Bhuvana Ramabhadran, Frederick Jelinek:
Towards using hybrid word and fragment units for vocabulary independent LVCSR systems. INTERSPEECH 2009: 1931-1934 - [c46]Vit Libal, Bhuvana Ramabhadran, Nadia Mana, Fabio Pianesi, Paul Chippendale, Oswald Lanz, Gerasimos Potamianos:
Multimodal Classification of Activities of Daily Living Inside Smart Homes. IWANN (2) 2009: 687-694 - [c45]Osamuyimen Stewart, Michael Picheny, David M. Lubensky, Bhuvana Ramabhadran:
Cultural voice markers in speech-to-speech machine translation systems. IWIC 2009: 313-316 - [c44]Bhuvana Ramabhadran, Abhinav Sethy, Jonathan Mamou, Brian Kingsbury, Upendra V. Chaudhari:
Fast decoding for open vocabulary spoken term detection. HLT-NAACL (Short Papers) 2009: 277-280 - [c43]Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jansche, Sanjeev Khudanpur, Bhuvana Ramabhadran, Michael Riley, Murat Saraclar, Abhinav Sethy, Morgan Ulinski, Christopher M. White:
Web derived pronunciations for spoken term detection. SIGIR 2009: 83-90 - 2008
- [c42]Raul Fernandez, Zvi Kons, Slava Shechtman, Zhiwei Shuang, Ron Hoory, Bhuvana Ramabhadran, Yong Qin:
The IBM Submission to the 2008 Text-to-Speech Blizzard Challenge. Blizzard Challenge 2008 - [c41]Daniel Povey, Dimitri Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Karthik Visweswariah:
Boosted MMI for model and feature-space discriminative training. ICASSP 2008: 4057-4060 - [c40]Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabhadran:
Gradient steepness metrics using extended Baum-Welch transformations for universal pattern recognition tasks. ICASSP 2008: 4533-4536 - [c39]Dimitri Kanevsky, Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo:
Generalization of extended baum-welch parameter estimation for discriminative training and decoding. INTERSPEECH 2008: 277-280 - [c38]Abhinav Sethy, Bhuvana Ramabhadran:
Bag-of-word normalized n-gram models. INTERSPEECH 2008: 1594-1597 - [c37]Sangyun Hahn, Abhinav Sethy, Hong-Kwang Jeff Kuo, Bhuvana Ramabhadran:
A study of unsupervised clustering techniques for language modeling. INTERSPEECH 2008: 1598-1601 - [c36]Jonathan Mamou, Bhuvana Ramabhadran:
Phonetic query expansion for spoken document retrieval. INTERSPEECH 2008: 2106-2109 - 2007
- [c35]Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabhadran:
Broad phonetic class recognition in a Hidden Markov model framework using extended Baum-Welch transformations. ASRU 2007: 306-311 - [c34]Bhuvana Ramabhadran, Olivier Siohan, Abhinav Sethy:
The IBM 2007 speech transcription system for European parliamentary speeches. ASRU 2007: 472-477 - [c33]Brett Matthews, Upendra V. Chaudhari, Bhuvana Ramabhadran:
Fast audio search using vector space modelling. ASRU 2007: 641-646 - [c32]Abhinav Sethy, Shrikanth S. Narayanan, Bhuvana Ramabhadran:
Data Driven Approach for Language Model Adaptation using Stepwise Relative Entropy Minimization. ICASSP (4) 2007: 177-180 - [c31]Jonathan Mamou, Bhuvana Ramabhadran, Olivier Siohan:
Vocabulary independent spoken term detection. SIGIR 2007: 615-622 - [c30]Raul Fernandez, Bhuvana Ramabhadran:
Automatic exploration of corpus-specific properties for expressive text-to-speech: a case study in emphasis. SSW 2007: 34-39 - 2006
- [c29]Geoffrey Zweig, Olivier Siohan, George Saon, Bhuvana Ramabhadran, Daniel Povey, Lidia Mangu, Brian Kingsbury:
Automated Quality Monitoring in the Call Center with ASR and Maximum Entropy. ICASSP (1) 2006: 589-592 - [c28]Bhuvana Ramabhadran, Olivier Siohan, Lidia Mangu, Geoffrey Zweig, Martin Westphal, Henrik Schulz, Alvaro Soneiro:
The IBM 2006 speech transcription system for european parliamentary speeches. INTERSPEECH 2006 - [c27]Geoffrey Zweig, Olivier Siohan, George Saon, Bhuvana Ramabhadran, Daniel Povey, Lidia Mangu, Brian Kingsbury:
Automated Quality Monitoring for Call Centers using Speech and NLP Technologies. HLT-NAACL 2006 - [c26]George Saon, Bhuvana Ramabhadran, Geoffrey Zweig:
On the Effect Ofword Error Rate on Automated Quality Monitoring. SLT 2006: 106-109 - 2005
- [c25]Olivier Siohan, Bhuvana Ramabhadran, Brian Kingsbury:
Contructing Ensembles of ASR Systems Using Randomized Decision Trees. ICASSP (1) 2005: 197-200 - [c24]Bhuvana Ramabhadran:
Exploiting large quantities of spontaneous speech for unsupervised training of acoustic models. INTERSPEECH 2005: 1617-1620 - 2004
- [j1]William Byrne, David S. Doermann, Martin Franz, Samuel Gustman, Jan Hajic, Douglas W. Oard, Michael Picheny, Josef Psutka, Bhuvana Ramabhadran, Dagobert Soergel, Todd Ward, Wei-Jing Zhu:
Automatic recognition of spontaneous speech for access to multilingual oral history archives. IEEE Trans. Speech Audio Process. 12(4): 420-435 (2004) - [c23]Bhuvana Ramabhadran, Olivier Siohan, Geoffrey Zweig:
Use of metadata to improve recognition of spontaneous speech and named entities. INTERSPEECH 2004: 381-384 - [c22]Olivier Siohan, Bhuvana Ramabhadran, Geoffrey Zweig:
Speech recognition error analysis on the English MALACH corpus. INTERSPEECH 2004: 413-416 - [c21]Abhinav Sethy, Shrikanth S. Narayanan, Bhuvana Ramabhadran:
Measuring convergence in language model estimation using relative entropy. INTERSPEECH 2004: 1057-1060 - [c20]Douglas W. Oard, Dagobert Soergel, David S. Doermann, Xiaoli Huang, G. Craig Murray, Jianqiang Wang, Bhuvana Ramabhadran, Martin Franz, Samuel Gustman, James Mayfield, Liliya Kharevych, Stephanie M. Strassel:
Building an information retrieval test collection for spontaneous conversational speech. SIGIR 2004: 41-48 - 2003
- [c19]Bhuvana Ramabhadran, Jing Huang, Michael Picheny:
Towards automatic transcription of large spoken archives - English ASR for the MALACH project. ICASSP (1) 2003: 216-219 - [c18]Martin Franz, Bhuvana Ramabhadran, Todd Ward, Michael Picheny:
Automated transcription and topic segmentation of large spoken archives. INTERSPEECH 2003: 953-956 - [c17]Bhuvana Ramabhadran, Jing Huang, Upendra V. Chaudhari, Giridharan Iyengar, Harriet J. Nock:
Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives. INTERSPEECH 2003: 2589-2592 - 2002
- [c16]Dagobert Soergel, Samuel Gustman, Mark Kornbluh, Bhuvana Ramabhadran, Jerry Goldman:
Access to large spoken archives: Uses and technology. Sponsored by SIG VIS. ASIST 2002: 469-470 - [c15]Samuel Gustman, Dagobert Soergel, Douglas W. Oard, William J. Byrne, Michael Picheny, Bhuvana Ramabhadran, Douglas Greenberg:
Supporting access to large digital oral history archives. JCDL 2002: 18-27 - [c14]Douglas W. Oard, Dina Demner-Fushman, Jan Hajic, Bhuvana Ramabhadran, Samuel Gustman, William J. Byrne, Dagobert Soergel, Bonnie J. Dorr, Philip Resnik, Michael Picheny:
Cross-Language Access to Recorded Speech in the MALACH Project. TSD 2002: 57-64 - [c13]Josef Psutka, Pavel Ircing, Josef V. Psutka, Vlasta Radová, William J. Byrne, Jan Hajic, Samuel Gustman, Bhuvana Ramabhadran:
Automatic Transcription of Czech Language Oral History in the MALACH Project: Resources and Initial Experiments. TSD 2002: 253-260 - 2001
- [c12]Yuqing Gao, Bhuvana Ramabhadran, C. Julian Chen, Hakan Erdogan, Michael Picheny:
Innovative approaches for large vocabulary name recognition. ICASSP 2001: 53-56 - [c11]Andrew Aaron, Scott Saobing Chen, Paul S. Cohen, Satya Dharanipragada, Ellen Eide, Martin Franz, Jean-Michel LeRoux, X. Luo, Benoît Maison, Lidia Mangu, T. Mathes, Miroslav Novak, Peder A. Olsen, Michael Picheny, Harry Printz, Bhuvana Ramabhadran, Andrej Sakrajda, George Saon, Borivoj Tydlitát, Karthik Visweswariah, D. Yuk:
Speech recognition for DARPA Communicator. ICASSP 2001: 489-492 - [c10]Robert E. Donovan, Abraham Ittycheriah, Martin Franz, Bhuvana Ramabhadran, Ellen Eide, Mahesh Viswanathan, Raimo Bakis, Wael Hamza, Michael A. Picheny, P. Gleason, T. Rutherfoord, P. Cox, D. Green, Eric Janke, S. Revelin, Claire Waast, B. Zeller, C. Guenther, J. Kunzmann:
Current status of the IBM Trainable Speech Synthesis System. SSW 2001: 207 - 2000
- [c9]Bhuvana Ramabhadran, Yuqing Gao:
Decision tree based rate of speech modeling for speech recognition. INTERSPEECH 2000: 600-603 - [c8]Bhuvana Ramabhadran, Yuqing Gao, Michael Picheny:
Dynamic selection of feature spaces for robust speech recognition. INTERSPEECH 2000: 913-916
1990 – 1999
- 1999
- [c7]Bhuvana Ramabhadran, Sabine Deligne, Abraham Ittycheriah:
Acoustics-based baseform generation with pronunciation and/or phonotactic models. EUROSPEECH 1999: 507-510 - [c6]Peter V. de Souza, Bhuvana Ramabhadran, Yuqing Gao, Michael Picheny:
Enhanced likelihood computation using regression. EUROSPEECH 1999: 1699-1702 - 1998
- [c5]Bhuvana Ramabhadran, Lalit R. Bahl, Peter DeSouza, Mukund Padmanabhan:
Acoustics-only based automatic phonetic baseform generation. ICASSP 1998: 309-312 - [c4]Mukund Padmanabhan, Ellen Eide, Bhuvana Ramabhadran, Ganesh N. Ramaswamy, Lalit R. Bahl:
Speech recognition performance on a voicemail transcription task. ICASSP 1998: 913-916 - [c3]Ramesh A. Gopinath, Bhuvana Ramabhadran, Satya Dharanipragada:
Factor analysis invariant to linear transformations of data. ICSLP 1998 - [c2]Mukund Padmanabhan, Bhuvana Ramabhadran, Sankar Basu:
Speech recognition performance on a new voicemail transcription task. ICSLP 1998 - [c1]Bhuvana Ramabhadran, Abraham Ittycheriah:
Phonological rules for enhancing acoustic enrollment of unknown words. ICSLP 1998
Coauthor Index
aka: Abdelrahman Mohamed
aka: Pedro Moreno Mengibar
aka: Michael A. Picheny
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-11 21:38 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint