default search action
Yu-An Chung
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c24]Heng-Jui Chang, Ning Dong, Ruslan Mavlyutov, Sravya Popuri, Yu-An Chung:
COLLD: Contrastive Layer-to-Layer Distillation for Compressing Multilingual Pre-Trained Speech Encoders. ICASSP 2024: 10801-10805 - [i33]Heng-Jui Chang, Hongyu Gong, Changhan Wang, James R. Glass, Yu-An Chung:
DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models. CoRR abs/2410.24177 (2024) - 2023
- [c23]Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee:
Speech-to-Speech Translation for a Real-world Unwritten Language. ACL (Findings) 2023: 4969-4983 - [c22]Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. ACL (1) 2023: 15655-15680 - [i32]Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim, Prangthip Hansanti, Russ Howes, Bernie Huang, Min-Jae Hwang, Hirofumi Inaguma, Somya Jain, Elahe Kalbassi, Amanda Kallet, Ilia Kulikov, Janice Lam, Daniel Li, Xutai Ma, Ruslan Mavlyutov, Benjamin N. Peloquin, Mohamed Ramadan, Abinesh Ramakrishnan, Anna Y. Sun, Kevin Tran, Tuan Tran, Igor Tufanov, Vish Vogeti, Carleigh Wood, Yilin Yang, Bokai Yu, Pierre Andrews, Can Balioglu, Marta R. Costa-jussà, Onur Celebi, Maha Elbayad, Cynthia Gao, Francisco Guzmán, Justine Kao, Ann Lee, Alexandre Mourachko, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang:
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation. CoRR abs/2308.11596 (2023) - [i31]Heng-Jui Chang, Ning Dong, Ruslan Mavlyutov, Sravya Popuri, Yu-An Chung:
CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders. CoRR abs/2309.07707 (2023) - [i30]Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia Gonzalez, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-jussà, Maha Elbayad, Hongyu Gong, Francisco Guzmán, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alexandre Mourachko, Benjamin N. Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Y. Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson:
Seamless: Multilingual Expressive and Streaming Speech Translation. CoRR abs/2312.05187 (2023) - 2022
- [b1]Yu-An Chung:
Self-Supervised Learning for Speech Processing. MIT, USA, 2022 - [j2]Gene-Ping Yang, Sung-Lin Yeh, Yu-An Chung, James R. Glass, Hao Tang:
Autoregressive Predictive Coding: A Comprehensive Study. IEEE J. Sel. Top. Signal Process. 16(6): 1380-1390 (2022) - [c21]Yuan Gong, Cheng-I Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. AAAI 2022: 10699-10709 - [i29]Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Miguel Pino, Wei-Ning Hsu, Ann Lee:
Speech-to-Speech Translation For A Real-world Unwritten Language. CoRR abs/2211.06474 (2022) - [i28]Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. CoRR abs/2212.08055 (2022) - 2021
- [j1]Yuan Gong, Yu-An Chung, James R. Glass:
PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3292-3306 (2021) - [c20]Yu-An Chung, Yu Zhang, Wei Han, Chung-Cheng Chiu, James Qin, Ruoming Pang, Yonghui Wu:
w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training. ASRU 2021: 244-250 - [c19]Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. ICASSP 2021: 3040-3044 - [c18]Yuan Gong, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. Interspeech 2021: 571-575 - [c17]Alexander H. Liu, Yu-An Chung, James R. Glass:
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies. Interspeech 2021: 3730-3734 - [c16]Yu-An Chung, Chenguang Zhu, Michael Zeng:
SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding. NAACL-HLT 2021: 1897-1907 - [i27]Yuan Gong, Yu-An Chung, James R. Glass:
PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation. CoRR abs/2102.01243 (2021) - [i26]Yuan Gong, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. CoRR abs/2104.01778 (2021) - [i25]Yu-An Chung, Yu Zhang, Wei Han, Chung-Cheng Chiu, James Qin, Ruoming Pang, Yonghui Wu:
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training. CoRR abs/2108.06209 (2021) - [i24]Yuan Gong, Cheng-I Jeff Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. CoRR abs/2110.09784 (2021) - [i23]Ankur Bapna, Yu-An Chung, Nan Wu, Anmol Gulati, Ye Jia, Jonathan H. Clark, Melvin Johnson, Jason Riesa, Alexis Conneau, Yu Zhang:
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training. CoRR abs/2110.10329 (2021) - 2020
- [c15]Yu-An Chung, James R. Glass:
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding. ACL 2020: 2353-2358 - [c14]Yu-An Chung, James R. Glass:
Generative Pre-Training for Speech with Autoregressive Predictive Coding. ICASSP 2020: 3497-3501 - [c13]Yu-An Chung, Hao Tang, James R. Glass:
Vector-Quantized Autoregressive Predictive Coding. INTERSPEECH 2020: 3760-3764 - [c12]Yu-An Chung, Shao-Wen Yang, Hsuan-Tien Lin:
Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation. TAAI 2020: 108-113 - [i22]Wei-Hung Weng, Yu-An Chung, Schrasing Tong:
Clinical Text Summarization with Syntax-Based Negation and Semantic Concept Identification. CoRR abs/2003.00353 (2020) - [i21]Yu-An Chung, James R. Glass:
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding. CoRR abs/2004.05274 (2020) - [i20]Yu-An Chung, Hao Tang, James R. Glass:
Vector-Quantized Autoregressive Predictive Coding. CoRR abs/2005.08392 (2020) - [i19]Yu-An Chung, Chenguang Zhu, Michael Zeng:
Semi-Supervised Speech-Language Joint Pre-Training for Spoken Language Understanding. CoRR abs/2010.02295 (2020) - [i18]Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. CoRR abs/2010.11481 (2020) - [i17]Alexander H. Liu, Yu-An Chung, James R. Glass:
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies. CoRR abs/2011.00406 (2020)
2010 – 2019
- 2019
- [c11]Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Yu-An Chung, Yuxuan Wang, Yonghui Wu, James R. Glass:
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization. ICASSP 2019: 5901-5905 - [c10]Yu-An Chung, Yuxuan Wang, Wei-Ning Hsu, Yu Zhang, R. J. Skerry-Ryan:
Semi-supervised Training for Improving Data Efficiency in End-to-end Speech Synthesis. ICASSP 2019: 6940-6944 - [c9]Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James R. Glass:
Towards Unsupervised Speech-to-text Translation. ICASSP 2019: 7170-7174 - [c8]Yu-An Chung, Wei-Ning Hsu, Hao Tang, James R. Glass:
An Unsupervised Autoregressive Model for Speech Representation Learning. INTERSPEECH 2019: 146-150 - [c7]Wei-Hung Weng, Yu-An Chung, Peter Szolovits:
Unsupervised Clinical Language Translation. KDD 2019: 3121-3131 - [i16]Wei-Hung Weng, Yu-An Chung, Peter Szolovits:
Unsupervised Clinical Language Translation. CoRR abs/1902.01177 (2019) - [i15]Yu-An Chung, Wei-Ning Hsu, Hao Tang, James R. Glass:
An Unsupervised Autoregressive Model for Speech Representation Learning. CoRR abs/1904.03240 (2019) - [i14]Wei Fang, Yu-An Chung, James R. Glass:
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models. CoRR abs/1906.07307 (2019) - [i13]Peter J. Liu, Yu-An Chung, Jie Ren:
SummAE: Zero-Shot Abstractive Text Summarization using Length-Agnostic Auto-Encoders. CoRR abs/1910.00998 (2019) - [i12]Yu-An Chung, James R. Glass:
Generative Pre-Training for Speech with Autoregressive Predictive Coding. CoRR abs/1910.12607 (2019) - 2018
- [c6]Yu-An Chung, James R. Glass:
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech. INTERSPEECH 2018: 811-815 - [c5]Yu-An Chung, Hung-yi Lee, James R. Glass:
Supervised and Unsupervised Transfer Learning for Question Answering. NAACL-HLT 2018: 1585-1594 - [c4]Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James R. Glass:
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces. NeurIPS 2018: 7365-7375 - [i11]Yu-An Chung, James R. Glass:
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech. CoRR abs/1803.08976 (2018) - [i10]Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James R. Glass:
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces. CoRR abs/1805.07467 (2018) - [i9]Yu-An Chung, Yuxuan Wang, Wei-Ning Hsu, Yu Zhang, R. J. Skerry-Ryan:
Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis. CoRR abs/1808.10128 (2018) - [i8]Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James R. Glass:
Towards Unsupervised Speech-to-Text Translation. CoRR abs/1811.01307 (2018) - 2017
- [i7]Yao-Yuan Yang, Shao-Chuan Lee, Yu-An Chung, Tung-En Wu, Si-An Chen, Hsuan-Tien Lin:
libact: Pool-based Active Learning in Python. CoRR abs/1710.00379 (2017) - [i6]Yu-An Chung, James R. Glass:
Learning Word Embeddings from Speech. CoRR abs/1711.01515 (2017) - [i5]Yu-An Chung, Hung-yi Lee, James R. Glass:
Supervised and Unsupervised Transfer Learning for Question Answering. CoRR abs/1711.05345 (2017) - [i4]Yu-An Chung, Wei-Hung Weng:
Learning Deep Representations of Medical Images using Siamese CNNs with Application to Content-Based Image Retrieval. CoRR abs/1711.08490 (2017) - 2016
- [c3]Yu-An Chung, Hsuan-Tien Lin, Shao-Wen Yang:
Cost-Aware Pre-Training for Multiclass Cost-Sensitive Deep Learning. IJCAI 2016: 1411-1417 - [c2]Yu-An Chung, Chao-Chung Wu, Chia-Hao Shen, Hung-yi Lee, Lin-Shan Lee:
Audio Word2Vec: Unsupervised Learning of Audio Segment Representations Using Sequence-to-Sequence Autoencoder. INTERSPEECH 2016: 765-769 - [i3]Yu-An Chung, Chao-Chung Wu, Chia-Hao Shen, Hung-yi Lee, Lin-Shan Lee:
Audio Word2Vec: Unsupervised Learning of Audio Segment Representations using Sequence-to-sequence Autoencoder. CoRR abs/1603.00982 (2016) - [i2]Yu-An Chung, Hsuan-Tien Lin:
Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation. CoRR abs/1611.05134 (2016) - 2015
- [c1]Chen-Wei Huang, Yu-An Chung, Pei-Shu Huang, Shiao-Li Tsao:
High-level energy consumption model of embedded graphic processors. DSP 2015: 105-109 - [i1]Yu-An Chung, Hsuan-Tien Lin, Shao-Wen Yang:
Cost-aware Pre-training for Multiclass Cost-sensitive Deep Learning. CoRR abs/1511.09337 (2015)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-01 01:10 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint