default search action
Yatharth Saraf
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [c19]Chunxi Liu, Michael Picheny, Leda Sari, Pooja Chitkara, Alex Xiao, Xiaohui Zhang, Mark Chou, Andres Alvarado, Caner Hazirbas, Yatharth Saraf:
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions. ICASSP 2022: 6162-6166 - [c18]Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexis Conneau, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli:
Improved Language Identification Through Cross-Lingual Self-Supervised Learning. ICASSP 2022: 6877-6881 - [c17]Sangeeta Srivastava, Yun Wang, Andros Tjandra, Anurag Kumar, Chunxi Liu, Kritika Singh, Yatharth Saraf:
Conformer-Based Self-Supervised Learning For Non-Speech Audio Tasks. ICASSP 2022: 8862-8866 - [c16]Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli:
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale. INTERSPEECH 2022: 2278-2282 - [c15]Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. INTERSPEECH 2022: 5135-5139 - [i20]Laxmi Pandey, Debjyoti Paul, Pooja Chitkara, Yutong Pang, Xuedong Zhang, Kjell Schubert, Mark Chou, Shu Liu, Yatharth Saraf:
Improving Data Driven Inverse Text Normalization using Data Augmentation. CoRR abs/2207.09674 (2022) - [i19]Pooja Chitkara, Morgane Rivière, Jade Copet, Frank Zhang, Yatharth Saraf:
Pushing the performances of ASR models on English and Spanish accents. CoRR abs/2212.12048 (2022) - 2021
- [c14]Vimal Manohar, Tatiana Likhomanenko, Qiantong Xu, Wei-Ning Hsu, Ronan Collobert, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition. ASRU 2021: 518-525 - [c13]Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer:
On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models. ASRU 2021: 1026-1033 - [c12]Leda Sari, Kritika Singh, Jiatong Zhou, Lorenzo Torresani, Nayan Singhal, Yatharth Saraf:
A Multi-View Approach to Audio-Visual Speaker Verification. ICASSP 2021: 6194-6198 - [c11]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776 - [c10]Xiaohui Zhang, Frank Zhang, Chunxi Liu, Kjell Schubert, Julian Chan, Pradyot Prakash, Jun Liu, Ching-Feng Yeh, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig:
Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR. SLT 2021: 46-51 - [c9]Chunxi Liu, Frank Zhang, Duc Le, Suyoun Kim, Yatharth Saraf, Geoffrey Zweig:
Improving RNN Transducer Based ASR with Auxiliary Tasks. SLT 2021: 172-179 - [c8]Ashutosh Pandey, Chunxi Liu, Yun Wang, Yatharth Saraf:
Dual Application of Speech Enhancement for Automatic Speech Recognition. SLT 2021: 223-228 - [i18]Leda Sari, Kritika Singh, Jiatong Zhou, Lorenzo Torresani, Nayan Singhal, Yatharth Saraf:
A Multi-View Approach To Audio-Visual Speaker Verification. CoRR abs/2102.06291 (2021) - [i17]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021) - [i16]Vimal Manohar, Tatiana Likhomanenko, Qiantong Xu, Wei-Ning Hsu, Ronan Collobert, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition. CoRR abs/2106.07759 (2021) - [i15]Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli:
Improved Language Identification Through Cross-Lingual Self-Supervised Learning. CoRR abs/2107.04082 (2021) - [i14]Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer:
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models. CoRR abs/2107.04154 (2021) - [i13]Sangeeta Srivastava, Yun Wang, Andros Tjandra, Anurag Kumar, Chunxi Liu, Kritika Singh, Yatharth Saraf:
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks. CoRR abs/2110.07313 (2021) - [i12]Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. CoRR abs/2111.05948 (2021) - [i11]Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli:
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale. CoRR abs/2111.09296 (2021) - [i10]Chunxi Liu, Michael Picheny, Leda Sari, Pooja Chitkara, Alex Xiao, Xiaohui Zhang, Mark Chou, Andres Alvarado, Caner Hazirbas, Yatharth Saraf:
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions. CoRR abs/2111.09983 (2021) - 2020
- [c7]Yan Xia, Yu Sun, Tian Wang, Juan Caicedo Carvajal, Jinliang Fan, Bhargav Mangipudi, Lisa Huang, Yatharth Saraf:
Relevance Ranking for Real-Time Tweet Search. CIKM 2020: 2829-2836 - [c6]Kritika Singh, Dmytro Okhonko, Jun Liu, Yongqiang Wang, Frank Zhang, Ross B. Girshick, Sergey Edunov, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Training ASR Models By Generation of Contextual Information. ICASSP 2020: 7864-7868 - [c5]Mahaveer Jain, Gil Keren, Jay Mahadeokar, Geoffrey Zweig, Florian Metze, Yatharth Saraf:
Contextual RNN-T for Open Domain ASR. INTERSPEECH 2020: 11-15 - [c4]Frank Zhang, Yongqiang Wang, Xiaohui Zhang, Chunxi Liu, Yatharth Saraf, Geoffrey Zweig:
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces. INTERSPEECH 2020: 976-980 - [c3]Da-Rong Liu, Chunxi Liu, Frank Zhang, Gabriel Synnaeve, Yatharth Saraf, Geoffrey Zweig:
Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model. INTERSPEECH 2020: 3650-3654 - [c2]Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR. INTERSPEECH 2020: 3770-3774 - [c1]Chunxi Liu, Qiaochu Zhang, Xiaohui Zhang, Kritika Singh, Yatharth Saraf, Geoffrey Zweig:
Multilingual Graphemic Hybrid ASR with Massive Data Augmentation. SLTU-CCURL@LREC 2020: 46-52 - [i9]Da-Rong Liu, Chunxi Liu, Frank Zhang, Gabriel Synnaeve, Yatharth Saraf, Geoffrey Zweig:
Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model. CoRR abs/2005.07394 (2020) - [i8]Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large scale weakly and semi-supervised learning for low-resource video ASR. CoRR abs/2005.07850 (2020) - [i7]Frank Zhang, Yongqiang Wang, Xiaohui Zhang, Chunxi Liu, Yatharth Saraf, Geoffrey Zweig:
Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces. CoRR abs/2005.09150 (2020) - [i6]Mahaveer Jain, Gil Keren, Jay Mahadeokar, Yatharth Saraf:
Contextual RNN-T For Open Domain ASR. CoRR abs/2006.03411 (2020) - [i5]Chunxi Liu, Frank Zhang, Duc Le, Suyoun Kim, Yatharth Saraf, Geoffrey Zweig:
Improving RNN Transducer Based ASR with Auxiliary Tasks. CoRR abs/2011.03109 (2020) - [i4]Ashutosh Pandey, Chunxi Liu, Yun Wang, Yatharth Saraf:
Dual Application of Speech Enhancement for Automatic Speech Recognition. CoRR abs/2011.03840 (2020) - [i3]Xiaohui Zhang, Frank Zhang, Chunxi Liu, Kjell Schubert, Julian Chan, Pradyot Prakash, Jun Liu, Ching-Feng Yeh, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig:
Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR. CoRR abs/2011.04785 (2020)
2010 – 2019
- 2019
- [i2]Chunxi Liu, Qiaochu Zhang, Xiaohui Zhang, Kritika Singh, Yatharth Saraf, Geoffrey Zweig:
Multilingual ASR with Massive Data Augmentation. CoRR abs/1909.06522 (2019) - [i1]Kritika Singh, Dmytro Okhonko, Jun Liu, Yongqiang Wang, Frank Zhang, Ross B. Girshick, Sergey Edunov, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Training ASR models by Generation of Contextual Information. CoRR abs/1910.12367 (2019)
2000 – 2009
- 2008
- [j2]Yatharth Saraf, Raman Balasubramanian, Krishnan Swaminathan:
Computing the curve-skeletons of images. Int. J. Comput. Math. 85(2): 253-270 (2008) - 2005
- [j1]Yatharth Saraf, Raman Balasubramanian, Krishnan Swaminathan:
A classical approach for thinning of binary images using divergence of the potential field. Int. J. Comput. Math. 82(6): 673-684 (2005)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-07-21 22:02 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint