default search action

combined dblp search
author search
venue search
publication search

ask others

Ehsan Variani

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengWPSCVZLRR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengWPSCVZLRR23
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. ICASSP 2023: 1-5
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VarianiWRAR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VarianiWRAR23
Ehsan Variani, Ke Wu, David Rybach, Cyril Allauzen, Michael Riley:
Alignment Entropy Regularization. ICASSP 2023: 1-5
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuVBR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuVBR23
Ke Wu, Ehsan Variani, Tom Bagby, Michael Riley:
Last: Scalable Lattice-Based Speech Modelling in Jax. ICASSP 2023: 1-5
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08583
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. CoRR abs/2302.08583 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-13134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-13134
Ke Wu, Ehsan Variani, Tom Bagby, Michael Riley:
LAST: Scalable Lattice-Based Speech Modelling in JAX. CoRR abs/2304.13134 (2023)
2022
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GaurCVHRM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GaurCVHRM22
Neeraj Gaur, Tongzhou Chen, Ehsan Variani, Parisa Haghani, Bhuvana Ramabhadran, Pedro J. Moreno:
Multilingual Second-Pass Rescoring for Automatic Speech Recognition Systems. ICASSP 2022: 6407-6411
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BreinerRVGMSGCM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BreinerRVGMSGCM22
Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey:
UserLibri: A Dataset for ASR Personalization Using Only Text. INTERSPEECH 2022: 694-698
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangCSVPHRGMPSH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangCSVPHRGMPSH22
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. INTERSPEECH 2022: 1031-1035
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Variani0RACR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Variani0RACR22
Ehsan Variani, Michael Riley, David Rybach, Cyril Allauzen, Tongzhou Chen, Bhuvana Ramabhadran:
On Adaptive Weight Interpolation of the Hybrid Autoregressive Transducer. INTERSPEECH 2022: 1646-1650
[c22]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/VarianiWRRSA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/VarianiWRRSA22
Ehsan Variani, Ke Wu, Michael D. Riley, David Rybach, Matt Shannon, Cyril Allauzen:
Global Normalization for Streaming Speech Recognition in a Modular Framework. NeurIPS 2022
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MengCPZWAESRHVHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MengCPZWAESRHVHM22
Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. SLT 2022: 197-204
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-07553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-07553
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prabhavalkar, W. Ronny Huang, Bhuvana Ramabhadran, Neeraj Gaur, Sepand Mavandadi, Cal Peyser, Trevor Strohman, Yanzhang He, David Rybach:
Improving Rare Word Recognition with LM-aware MWER Training. CoRR abs/2204.07553 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-13674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-13674
Ehsan Variani, Ke Wu, Michael Riley, David Rybach, Matt Shannon, Cyril Allauzen:
Global Normalization for Streaming Speech Recognition in a Modular Framework. CoRR abs/2205.13674 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-00706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-00706
Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey:
UserLibri: A Dataset for ASR Personalization Using Only Text. CoRR abs/2207.00706 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17049
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17049
Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. CoRR abs/2210.17049 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-12442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-12442
Ehsan Variani, Ke Wu, David Rybach, Cyril Allauzen, Michael Riley:
Alignment Entropy Regularization. CoRR abs/2212.12442 (2022)
2021
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NarayananSPYCPV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NarayananSPYCPV21
Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman:
Cascaded Encoders for Unifying Streaming and Non-Streaming ASR. ICASSP 2021: 5629-5633
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathHNBPRAVQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathHNBPRAVQ21
Tara N. Sainath, Yanzhang He, Arun Narayanan, Rami Botros, Ruoming Pang, David Rybach, Cyril Allauzen, Ehsan Variani, James Qin, Quoc-Nam Le-The, Shuo-Yiin Chang, Bo Li, Anmol Gulati, Jiahui Yu, Chung-Cheng Chiu, Diamantino Caseiro, Wei Li, Qiao Liang, Pat Rondon:
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling. Interspeech 2021: 1777-1781
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AllauzenV0RZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AllauzenV0RZ21
Cyril Allauzen, Ehsan Variani, Michael Riley, David Rybach, Hao Zhang:
A Hybrid Seq-2-Seq ASR Design for On-Device and Server Applications. Interspeech 2021: 4044-4048
2020
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VarianiRA020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VarianiRA020
Ehsan Variani, David Rybach, Cyril Allauzen, Michael Riley:
Hybrid Autoregressive Transducer (HAT). ICASSP 2020: 6139-6143
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VarianiCARLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VarianiCARLM20
Ehsan Variani, Tongzhou Chen, James Apfel, Bhuvana Ramabhadran, Seungji Lee, Pedro J. Moreno:
Neural Oracle Search on N-BEST Hypotheses. ICASSP 2020: 7824-7828
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-11268
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-11268
Erik McDermott, Hasim Sak, Ehsan Variani:
A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition. CoRR abs/2002.11268 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07705
Ehsan Variani, David Rybach, Cyril Allauzen, Michael Riley:
Hybrid Autoregressive Transducer (hat). CoRR abs/2003.07705 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14606
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14606
Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman:
Cascaded encoders for unifying streaming and non-streaming ASR. CoRR abs/2010.14606 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/McDermottSV19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/McDermottSV19
Erik McDermott, Hasim Sak, Ehsan Variani:
A Density Ratio Approach to Language Model Fusion in End-to-End Automatic Speech Recognition. ASRU 2019: 434-441
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VarianiSW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VarianiSW19
Ehsan Variani, Ananda Theertha Suresh, Mitchel Weintraub:
West: Word Encoded Sequence Transducers. ICASSP 2019: 7340-7344
2018
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VarianiBLMB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VarianiBLMB18
Ehsan Variani, Tom Bagby, Kamel Lahouel, Erik McDermott, Michiel Bacchiani:
Sampled Connectionist Temporal Classification. ICASSP 2018: 4959-4963
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimVNB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimVNB18
Chanwoo Kim, Ehsan Variani, Arun Narayanan, Michiel Bacchiani:
Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models. INTERSPEECH 2018: 3028-3032
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-08417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-08417
Ehsan Variani, Ananda Theertha Suresh, Mitchel Weintraub:
WEST: Word Encoded Sequence Transducers. CoRR abs/1811.08417 (2018)
2017
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SainathWWLNVBSS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SainathWWLNVBSS17
Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Bo Li, Arun Narayanan, Ehsan Variani, Michiel Bacchiani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 965-979 (2017)
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSNCBMSSPCSWWV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSNCBMSSPCSWWV17
Bo Li, Tara N. Sainath, Arun Narayanan, Joe Caroselli, Michiel Bacchiani, Ananya Misra, Izhak Shafran, Hasim Sak, Golan Pundak, Kean K. Chin, Khe Chai Sim, Ron J. Weiss, Kevin W. Wilson, Ehsan Variani, Chanwoo Kim, Olivier Siohan, Mitchel Weintraub, Erik McDermott, Richard Rose, Matt Shannon:
Acoustic Modeling for Google Home. INTERSPEECH 2017: 399-403
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VarianiBMB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VarianiBMB17
Ehsan Variani, Tom Bagby, Erik McDermott, Michiel Bacchiani:
End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow. INTERSPEECH 2017: 1641-1645
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/SainathWWNBLVSSCMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/SainathWWNBLVSSCMK17
Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani, Bo Li, Ehsan Variani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Raw Multichannel Processing Using Deep Neural Networks. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 105-133
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-03439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-03439
Chanwoo Kim, Ehsan Variani, Arun Narayanan, Michiel Bacchiani:
Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models. CoRR abs/1712.03439 (2017)
2016
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijst/SaghaLVMCS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijst/SaghaLVMCS16
Hesam Sagha, Feipeng Li, Ehsan Variani, José del R. Millán, Ricardo Chavarriaga, Björn W. Schuller:
Stream fusion for multi-stream automatic speech recognition. Int. J. Speech Technol. 19(4): 669-675 (2016)
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VarianiSSB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VarianiSSB16
Ehsan Variani, Tara N. Sainath, Izhak Shafran, Michiel Bacchiani:
Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling. INTERSPEECH 2016: 808-812
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathNWVWBS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathNWVWBS16
Tara N. Sainath, Arun Narayanan, Ron J. Weiss, Ehsan Variani, Kevin W. Wilson, Michiel Bacchiani, Izhak Shafran:
Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction. INTERSPEECH 2016: 1971-1975
2015
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VarianiMH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VarianiMH15
Ehsan Variani, Erik McDermott, Georg Heigold:
A Gaussian Mixture Model layer jointly optimized with discriminative features within a Deep Neural Network architecture. ICASSP 2015: 4270-4274
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/isit/VarianiLBJ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isit/VarianiLBJ15
Ehsan Variani, Kamel Lahouel, Avner Bar-Hen, Bruno Jedynak:
NON-adaptive policies for 20 questions target localization. ISIT 2015: 775-778
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/VarianiLBJ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/VarianiLBJ15
Ehsan Variani, Kamel Lahouel, Avner Bar-Hen, Bruno Jedynak:
Non-Adaptative Policies for 20 Questions Target Localization. CoRR abs/1504.05996 (2015)
2014
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VarianiLMMG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VarianiLMMG14
Ehsan Variani, Xin Lei, Erik McDermott, Ignacio López-Moreno, Javier Gonzalez-Dominguez:
Deep neural networks for small footprint text-dependent speaker verification. ICASSP 2014: 4052-4056
2013
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HermanskyVP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HermanskyVP13
Hynek Hermansky, Ehsan Variani, Vijayaditya Peddinti:
Mean temporal distance: Predicting ASR error from temporal properties of speech signal. ICASSP 2013: 7423-7426
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VarianiLH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VarianiLH13
Ehsan Variani, Feipeng Li, Hynek Hermansky:
Multi-stream recognition of noisy speech with performance monitoring. INTERSPEECH 2013: 2978-2981
2012
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VarianiH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VarianiH12
Ehsan Variani, Hynek Hermansky:
Estimating Classifier Performance in Unknown Noise. INTERSPEECH 2012: 1800-1803
2011
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VarianiS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VarianiS11
Ehsan Variani, Thomas Schaaf:
VTLN in the MFCC Domain: Band-Limited versus Local Interpolation. INTERSPEECH 2011: 1273-1276

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.