default search action

combined dblp search
author search
venue search
publication search

ask others

Ann Lee 0001

> Home > Persons

Person information

affiliation: Facebook, USA
affiliation (PhD 2016): Massachusetts Institute of Technology, Cambridge, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HwangKPGC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HwangKPGC024
Min-Jae Hwang, Ilia Kulikov, Benjamin N. Peloquin, Hongyu Gong, Peng-Jen Chen, Ann Lee:
Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation. ACL (Findings) 2024: 15524-15541
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenSN0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenSN0H24
Peng-Jen Chen, Bowen Shi, Kelvin Niu, Ann Lee, Wei-Ning Hsu:
M2BART: Multilingual and Multimodal Encoder-Decoder Pre-Training for Any-to-Any Machine Translation. ICASSP 2024: 11896-11900
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02733
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02733
Min-Jae Hwang, Ilia Kulikov, Benjamin N. Peloquin, Hongyu Gong, Peng-Jen Chen, Ann Lee:
Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation. CoRR abs/2406.02733 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-13720
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-13720
Adam Polyak, Amit Zohar, Andrew Brown, Andros Tjandra, Animesh Sinha, Ann Lee, Apoorv Vyas, Bowen Shi, Chih-Yao Ma, Ching-Yao Chuang, David Yan, Dhruv Choudhary, Dingkang Wang, Geet Sethi, Guan Pang, Haoyu Ma, Ishan Misra, Ji Hou, Jialiang Wang, Kiran Jagadeesh, Kunpeng Li, Luxin Zhang, Mannat Singh, Mary Williamson, Matt Le, Matthew Yu, Mitesh Kumar Singh, Peizhao Zhang, Peter Vajda, Quentin Duval, Rohit Girdhar, Roshan Sumbaly, Sai Saketh Rambhatla, Sam S. Tsai, Samaneh Azadi, Samyak Datta, Sanyuan Chen, Sean Bell, Sharadh Ramaswamy, Shelly Sheynin, Siddharth Bhattacharya, Simran Motwani, Tao Xu, Tianhe Li, Tingbo Hou, Wei-Ning Hsu, Xi Yin, Xiaoliang Dai, Yaniv Taigman, Yaqiao Luo, Yen-Cheng Liu, Yi-Chiao Wu, Yue Zhao, Yuval Kirstain, Zecheng He, Zijian He, Albert Pumarola, Ali K. Thabet, Artsiom Sanakoyeu, Arun Mallya, Baishan Guo, Boris Araya, Breena Kerr, Carleigh Wood, Ce Liu, Cen Peng, Dmitry Vengertsev, Edgar Schönfeld, Elliot Blanchard, Felix Juefei-Xu, Fraylie Nord, Jeff Liang, John Hoffman, Jonas Kohler, Kaolin Fire, Karthik Sivakumar, Lawrence Chen, Licheng Yu, Luya Gao, Markos Georgopoulos, Rashel Moritz, Sara K. Sampson, Shikai Li, Simone Parmeggiani, Steve Fine, Tara Fowler, Vladan Petrovic, Yuming Du:
Movie Gen: A Cast of Media Foundation Models. CoRR abs/2410.13720 (2024)
2023
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ChenTYDKCTDSGIP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChenTYDKCTDSGIP23
Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee:
Speech-to-Speech Translation for a Real-world Unwritten Language. ACL (Findings) 2023: 4969-4983
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/InagumaPKCWC00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/InagumaPKCWC00023
Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. ACL (1) 2023: 15655-15680
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/DuquenneGDD0GW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/DuquenneGDD0GW023
Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswami, Changhan Wang, Juan Pino, Benoît Sagot, Holger Schwenk:
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations. ACL (1) 2023: 16251-16269
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangPKWGSALC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangPKWGSALC23
Wen-Chin Huang, Benjamin N. Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen:
A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation. ICASSP 2023: 1-5
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiHCGGWLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiHCGGWLL23
Jiatong Shi, Chan-Jan Hsu, Ho-Lam Chung, Dongji Gao, Paola García, Shinji Watanabe, Ann Lee, Hung-Yi Lee:
Bridging Speech and Textual Pre-Trained Models With Unsupervised ASR. ICASSP 2023: 1-5
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiTLIWPW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiTLIWPW23
Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe:
Enhancing Speech-To-Speech Translation with Multiple TTS Targets. ICASSP 2023: 1-5
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/GatKN0CSDA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/GatKN0CSDA23
Itai Gat, Felix Kreuk, Tu Anh Nguyen, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi:
Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling. IWSLT@ACL 2023: 465-477
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-10606
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-10606
Wen-Chin Huang, Benjamin N. Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen:
A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation. CoRR abs/2301.10606 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-04618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-04618
Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe:
Enhancing Speech-to-Speech Translation with Multiple TTS Targets. CoRR abs/2304.04618 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-08655
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-08655
Hongyu Gong, Ning Dong, Sravya Popuri, Vedanuj Goswami, Ann Lee, Juan Pino:
Multilingual Speech-to-Speech Translation into Multiple Target Languages. CoRR abs/2307.08655 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11596
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11596
Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim, Prangthip Hansanti, Russ Howes, Bernie Huang, Min-Jae Hwang, Hirofumi Inaguma, Somya Jain, Elahe Kalbassi, Amanda Kallet, Ilia Kulikov, Janice Lam, Daniel Li, Xutai Ma, Ruslan Mavlyutov, Benjamin N. Peloquin, Mohamed Ramadan, Abinesh Ramakrishnan, Anna Y. Sun, Kevin Tran, Tuan Tran, Igor Tufanov, Vish Vogeti, Carleigh Wood, Yilin Yang, Bokai Yu, Pierre Andrews, Can Balioglu, Marta R. Costa-jussà, Onur Celebi, Maha Elbayad, Cynthia Gao, Francisco Guzmán, Justine Kao, Ann Lee, Alexandre Mourachko, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang:
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation. CoRR abs/2308.11596 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-05187
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-05187
Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia Gonzalez, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-jussà, Maha Elbayad, Hongyu Gong, Francisco Guzmán, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alexandre Mourachko, Benjamin N. Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Y. Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson:
Seamless: Multilingual Expressive and Streaming Speech Translation. CoRR abs/2312.05187 (2023)
2022
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LeeCWGPMPAHTPH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LeeCWGPMPAHTPH22
Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Sravya Popuri, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Pino, Wei-Ning Hsu:
Direct Speech-to-Speech Translation With Discrete Units. ACL (1) 2022: 3327-3339
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KharitonovLPACL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KharitonovLPACL22
Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. ACL (1) 2022: 8666-8681
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KahnPLXHCT0GASL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KahnPLXHCT0GASL22
Jacob D. Kahn, Vineel Pratap, Tatiana Likhomanenko, Qiantong Xu, Awni Y. Hannun, Jeff Cai, Paden Tomasello, Ann Lee, Edouard Grave, Gilad Avidov, Benoit Steiner, Vitaliy Liptchinsky, Gabriel Synnaeve, Ronan Collobert:
Flashlight: Enabling Innovation in Tools for Machine Learning. ICML 2022: 10557-10574
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PopuriCWPAGHL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PopuriCWPAGHL22
Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee:
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation. INTERSPEECH 2022: 5195-5199
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/LeeGDSCWPAPGH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/LeeGDSCWPAPGH22
Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Yossi Adi, Juan Miguel Pino, Jiatao Gu, Wei-Ning Hsu:
Textless Speech-to-Speech Translation on Real Data. NAACL-HLT 2022: 860-872
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-12465
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-12465
Jacob Kahn, Vineel Pratap, Tatiana Likhomanenko, Qiantong Xu, Awni Y. Hannun, Jeff Cai, Paden Tomasello, Ann Lee, Edouard Grave, Gilad Avidov, Benoit Steiner, Vitaliy Liptchinsky, Gabriel Synnaeve, Ronan Collobert:
Flashlight: Enabling Innovation in Tools for Machine Learning. CoRR abs/2201.12465 (2022)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07359
Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
textless-lib: a Library for Textless Spoken Language Processing. CoRR abs/2202.07359 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02967
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02967
Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee:
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation. CoRR abs/2204.02967 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-15483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-15483
Itai Gat, Felix Kreuk, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi:
On The Robustness of Self-Supervised Representations for Spoken Language Modeling. CoRR abs/2209.15483 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-03025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-03025
Jiatong Shi, Chan-Jan Hsu, Ho-Lam Chung, Dongji Gao, Paola García, Shinji Watanabe, Ann Lee, Hung-yi Lee:
Bridging Speech and Textual Pre-trained Models with Unsupervised ASR. CoRR abs/2211.03025 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-04508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-04508
Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswami, Changhan Wang, Juan Miguel Pino, Benoît Sagot, Holger Schwenk:
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations. CoRR abs/2211.04508 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-06474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-06474
Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Miguel Pino, Wei-Ning Hsu, Ann Lee:
Speech-to-Speech Translation For A Real-world Unwritten Language. CoRR abs/2211.06474 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08055
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08055
Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. CoRR abs/2212.08055 (2022)
2021
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangRLWTHWPD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangRLWTHWPD20
Changhan Wang, Morgane Rivière, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel Haziza, Mary Williamson, Juan Miguel Pino, Emmanuel Dupoux:
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation. ACL/IJCNLP (1) 2021: 993-1003
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/0001AR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/0001AR20
Ann Lee, Michael Auli, Marc'Aurelio Ranzato:
Discriminative Reranking for Neural Machine Translation. ACL/IJCNLP (1) 2021: 7250-7264
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/WangHAPLCGP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WangHAPLCGP21
Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu, Juan Pino:
fairseq S\^2: A Scalable and Integrable Speech Synthesis Toolkit. EMNLP (Demos) 2021: 143-152
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsuSBLXPK0CSA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsuSBLXPK0CSA21
Wei-Ning Hsu, Anuroop Sriram, Alexei Baevski, Tatiana Likhomanenko, Qiantong Xu, Vineel Pratap, Jacob Kahn, Ann Lee, Ronan Collobert, Gabriel Synnaeve, Michael Auli:
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training. Interspeech 2021: 721-725
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/Hsu0SH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/Hsu0SH21
Wei-Ning Hsu, Ann Lee, Gabriel Synnaeve, Awni Y. Hannun:
Semi-Supervised end-to-end Speech Recognition via Local Prior Matching. SLT 2021: 125-132
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-00390
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-00390
Changhan Wang, Morgane Rivière, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel Haziza, Mary Williamson, Juan Miguel Pino, Emmanuel Dupoux:
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation. CoRR abs/2101.00390 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-01027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-01027
Wei-Ning Hsu, Anuroop Sriram, Alexei Baevski, Tatiana Likhomanenko, Qiantong Xu, Vineel Pratap, Jacob Kahn, Ann Lee, Ronan Collobert, Gabriel Synnaeve, Michael Auli:
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training. CoRR abs/2104.01027 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-05604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-05604
Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Miguel Pino, Wei-Ning Hsu:
Direct speech-to-speech translation with discrete units. CoRR abs/2107.05604 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-03264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-03264
Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. CoRR abs/2109.03264 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-06912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-06912
Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu, Juan Miguel Pino:
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit. CoRR abs/2109.06912 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-08250
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-08250
Xutai Ma, Hongyu Gong, Danni Liu, Ann Lee, Yun Tang, Peng-Jen Chen, Wei-Ning Hsu, Kenneth Heafield, Phillip Koehn, Juan Miguel Pino:
Direct simultaneous speech to speech translation. CoRR abs/2110.08250 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-08352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-08352
Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Juan Miguel Pino, Jiatao Gu, Wei-Ning Hsu:
Textless Speech-to-Speech Translation on Real Data. CoRR abs/2112.08352 (2021)
2020
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Kahn0H20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Kahn0H20
Jacob Kahn, Ann Lee, Awni Y. Hannun:
Self-Training for End-to-End Speech Recognition. ICASSP 2020: 7084-7088
[c10]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/wmt/ChenLWGFWG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wmt/ChenLWGFWG20
Peng-Jen Chen, Ann Lee, Changhan Wang, Naman Goyal, Angela Fan, Mary Williamson, Jiatao Gu:
Facebook AI's WMT20 News Translation Task Submission. WMT@EMNLP 2020: 113-125
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-10336
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-10336
Wei-Ning Hsu, Ann Lee, Gabriel Synnaeve, Awni Y. Hannun:
Semi-Supervised Speech Recognition via Local Prior Matching. CoRR abs/2002.10336 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08298
Peng-Jen Chen, Ann Lee, Changhan Wang, Naman Goyal, Angela Fan, Mary Williamson, Jiatao Gu:
Facebook AI's WMT20 News Translation Task Submission. CoRR abs/2011.08298 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-09543
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-09543
Lajanugen Logeswaran, Ann Lee, Myle Ott, Honglak Lee, Marc'Aurelio Ranzato, Arthur Szlam:
Few-shot Sequence Learning with Transformers. CoRR abs/2012.09543 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hannun0XC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hannun0XC19
Awni Y. Hannun, Ann Lee, Qiantong Xu, Ronan Collobert:
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions. INTERSPEECH 2019: 3785-3789
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-02619
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-02619
Awni Y. Hannun, Ann Lee, Qiantong Xu, Ronan Collobert:
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions. CoRR abs/1904.02619 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-09116
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-09116
Jacob Kahn, Ann Lee, Awni Y. Hannun:
Self-Training for End-to-End Speech Recognition. CoRR abs/1909.09116 (2019)
2016
[b1]
- view
  - electronic edition via handle.net
  - details & citations
- export record
  dblp key:
  - phd/ndltd/Lee16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ndltd/Lee16a
Ann Lee:
Language-independent methods for computer-assisted pronunciation training. Massachusetts Institute of Technology, Cambridge, USA, 2016
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeeCG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeeCG16
Ann Lee, Nancy F. Chen, James R. Glass:
Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery. ICASSP 2016: 6145-6149
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsuZLG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsuZLG16
Wei-Ning Hsu, Yu Zhang, Ann Lee, James R. Glass:
Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition. INTERSPEECH 2016: 395-399
2015
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeG15
Ann Lee, James R. Glass:
Mispronunciation detection without nonnative training data. INTERSPEECH 2015: 643-647
2014
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeG14
Ann Lee, James R. Glass:
Context-dependent pronunciation error pattern discovery with limited annotations. INTERSPEECH 2014: 2877-2881
2013
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeeZG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeeZG13
Ann Lee, Yaodong Zhang, James R. Glass:
Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams. ICASSP 2013: 8227-8231
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/slte/LeeG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/LeeG13
Ann Lee, James R. Glass:
Pronunciation assessment via a comparison-based system. SLaTE 2013: 122-126
2012
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeG12
Ann Lee, James R. Glass:
Sentence Detection Using Multiple Annotations. INTERSPEECH 2012: 1848-1851
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LeeG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LeeG12
Ann Lee, James R. Glass:
A comparison-based approach to mispronunciation detection. SLT 2012: 382-387

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.