default search action
Ehsan Variani
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c29]Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. ICASSP 2023: 1-5 - [c28]Ehsan Variani, Ke Wu, David Rybach, Cyril Allauzen, Michael Riley:
Alignment Entropy Regularization. ICASSP 2023: 1-5 - [c27]Ke Wu, Ehsan Variani, Tom Bagby, Michael Riley:
Last: Scalable Lattice-Based Speech Modelling in Jax. ICASSP 2023: 1-5 - [i13]Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. CoRR abs/2302.08583 (2023) - [i12]Ke Wu, Ehsan Variani, Tom Bagby, Michael Riley:
LAST: Scalable Lattice-Based Speech Modelling in JAX. CoRR abs/2304.13134 (2023) - 2022
- [c26]Neeraj Gaur, Tongzhou Chen, Ehsan Variani, Parisa Haghani, Bhuvana Ramabhadran, Pedro J. Moreno:
Multilingual Second-Pass Rescoring for Automatic Speech Recognition Systems. ICASSP 2022: 6407-6411 - [c25]Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey:
UserLibri: A Dataset for ASR Personalization Using Only Text. INTERSPEECH 2022: 694-698 - [c24]Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. INTERSPEECH 2022: 1031-1035 - [c23]Ehsan Variani, Michael Riley, David Rybach, Cyril Allauzen, Tongzhou Chen, Bhuvana Ramabhadran:
On Adaptive Weight Interpolation of the Hybrid Autoregressive Transducer. INTERSPEECH 2022: 1646-1650 - [c22]Ehsan Variani, Ke Wu, Michael D. Riley, David Rybach, Matt Shannon, Cyril Allauzen:
Global Normalization for Streaming Speech Recognition in a Modular Framework. NeurIPS 2022 - [c21]Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. SLT 2022: 197-204 - [i11]Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. CoRR abs/2204.07553 (2022) - [i10]Ehsan Variani, Ke Wu, Michael Riley, David Rybach, Matt Shannon, Cyril Allauzen:
Global Normalization for Streaming Speech Recognition in a Modular Framework. CoRR abs/2205.13674 (2022) - [i9]Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey:
UserLibri: A Dataset for ASR Personalization Using Only Text. CoRR abs/2207.00706 (2022) - [i8]Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. CoRR abs/2210.17049 (2022) - [i7]Ehsan Variani, Ke Wu, David Rybach, Cyril Allauzen, Michael Riley:
Alignment Entropy Regularization. CoRR abs/2212.12442 (2022) - 2021
- [c20]Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman:
Cascaded Encoders for Unifying Streaming and Non-Streaming ASR. ICASSP 2021: 5629-5633 - [c19]Tara N. Sainath, Yanzhang He, Arun Narayanan, Rami Botros, Ruoming Pang, David Rybach, Cyril Allauzen, Ehsan Variani, James Qin, Quoc-Nam Le-The, Shuo-Yiin Chang, Bo Li, Anmol Gulati, Jiahui Yu, Chung-Cheng Chiu, Diamantino Caseiro, Wei Li, Qiao Liang, Pat Rondon:
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling. Interspeech 2021: 1777-1781 - [c18]Cyril Allauzen, Ehsan Variani, Michael Riley, David Rybach, Hao Zhang:
A Hybrid Seq-2-Seq ASR Design for On-Device and Server Applications. Interspeech 2021: 4044-4048 - 2020
- [c17]Ehsan Variani, David Rybach, Cyril Allauzen, Michael Riley:
Hybrid Autoregressive Transducer (HAT). ICASSP 2020: 6139-6143 - [c16]Ehsan Variani, Tongzhou Chen, James Apfel, Bhuvana Ramabhadran, Seungji Lee, Pedro J. Moreno:
Neural Oracle Search on N-BEST Hypotheses. ICASSP 2020: 7824-7828 - [i6]Erik McDermott, Hasim Sak, Ehsan Variani:
A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition. CoRR abs/2002.11268 (2020) - [i5]Ehsan Variani, David Rybach, Cyril Allauzen, Michael Riley:
Hybrid Autoregressive Transducer (hat). CoRR abs/2003.07705 (2020) - [i4]Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman:
Cascaded encoders for unifying streaming and non-streaming ASR. CoRR abs/2010.14606 (2020)
2010 – 2019
- 2019
- [c15]Erik McDermott, Hasim Sak, Ehsan Variani:
A Density Ratio Approach to Language Model Fusion in End-to-End Automatic Speech Recognition. ASRU 2019: 434-441 - [c14]Ehsan Variani, Ananda Theertha Suresh, Mitchel Weintraub:
West: Word Encoded Sequence Transducers. ICASSP 2019: 7340-7344 - 2018
- [c13]Ehsan Variani, Tom Bagby, Kamel Lahouel, Erik McDermott, Michiel Bacchiani:
Sampled Connectionist Temporal Classification. ICASSP 2018: 4959-4963 - [c12]Chanwoo Kim, Ehsan Variani, Arun Narayanan, Michiel Bacchiani:
Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models. INTERSPEECH 2018: 3028-3032 - [i3]Ehsan Variani, Ananda Theertha Suresh, Mitchel Weintraub:
WEST: Word Encoded Sequence Transducers. CoRR abs/1811.08417 (2018) - 2017
- [j2]Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Bo Li, Arun Narayanan, Ehsan Variani, Michiel Bacchiani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 965-979 (2017) - [c11]Bo Li, Tara N. Sainath, Arun Narayanan, Joe Caroselli, Michiel Bacchiani, Ananya Misra, Izhak Shafran, Hasim Sak, Golan Pundak, Kean K. Chin, Khe Chai Sim, Ron J. Weiss, Kevin W. Wilson, Ehsan Variani, Chanwoo Kim, Olivier Siohan, Mitchel Weintraub, Erik McDermott, Richard Rose, Matt Shannon:
Acoustic Modeling for Google Home. INTERSPEECH 2017: 399-403 - [c10]Ehsan Variani, Tom Bagby, Erik McDermott, Michiel Bacchiani:
End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow. INTERSPEECH 2017: 1641-1645 - [p1]Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani, Bo Li, Ehsan Variani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Raw Multichannel Processing Using Deep Neural Networks. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 105-133 - [i2]Chanwoo Kim, Ehsan Variani, Arun Narayanan, Michiel Bacchiani:
Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models. CoRR abs/1712.03439 (2017) - 2016
- [j1]Hesam Sagha, Feipeng Li, Ehsan Variani, José del R. Millán, Ricardo Chavarriaga, Björn W. Schuller:
Stream fusion for multi-stream automatic speech recognition. Int. J. Speech Technol. 19(4): 669-675 (2016) - [c9]Ehsan Variani, Tara N. Sainath, Izhak Shafran, Michiel Bacchiani:
Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling. INTERSPEECH 2016: 808-812 - [c8]Tara N. Sainath, Arun Narayanan, Ron J. Weiss, Ehsan Variani, Kevin W. Wilson, Michiel Bacchiani, Izhak Shafran:
Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction. INTERSPEECH 2016: 1971-1975 - 2015
- [c7]Ehsan Variani, Erik McDermott, Georg Heigold:
A Gaussian Mixture Model layer jointly optimized with discriminative features within a Deep Neural Network architecture. ICASSP 2015: 4270-4274 - [c6]Ehsan Variani, Kamel Lahouel, Avner Bar-Hen, Bruno Jedynak:
NON-adaptive policies for 20 questions target localization. ISIT 2015: 775-778 - [i1]Ehsan Variani, Kamel Lahouel, Avner Bar-Hen, Bruno Jedynak:
Non-Adaptative Policies for 20 Questions Target Localization. CoRR abs/1504.05996 (2015) - 2014
- [c5]Ehsan Variani, Xin Lei, Erik McDermott, Ignacio López-Moreno, Javier Gonzalez-Dominguez:
Deep neural networks for small footprint text-dependent speaker verification. ICASSP 2014: 4052-4056 - 2013
- [c4]Hynek Hermansky, Ehsan Variani, Vijayaditya Peddinti:
Mean temporal distance: Predicting ASR error from temporal properties of speech signal. ICASSP 2013: 7423-7426 - [c3]Ehsan Variani, Feipeng Li, Hynek Hermansky:
Multi-stream recognition of noisy speech with performance monitoring. INTERSPEECH 2013: 2978-2981 - 2012
- [c2]Ehsan Variani, Hynek Hermansky:
Estimating Classifier Performance in Unknown Noise. INTERSPEECH 2012: 1800-1803 - 2011
- [c1]Ehsan Variani, Thomas Schaaf:
VTLN in the MFCC Domain: Band-Limited versus Local Interpolation. INTERSPEECH 2011: 1273-1276
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-05 21:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint