default search action

combined dblp search
author search
venue search
publication search

ask others

Yinghao Aaron Li

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiangHLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiangHLM24
Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani:
Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation. ICASSP 2024: 1281-1285
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-17671
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-17671
Gavin Mischler, Yinghao Aaron Li, Stephan Bickel, Ashesh D. Mehta, Nima Mesgarani:
Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain. CoRR abs/2401.17671 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03710
Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani:
Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience. CoRR abs/2402.03710 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-09732
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-09732
Xilin Jiang, Yinghao Aaron Li, Adrian Nicolas Florea, Cong Han, Nima Mesgarani:
Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis. CoRR abs/2407.09732 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-11849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-11849
Yinghao Aaron Li, Xilin Jiang, Jordan Darefsky, Ge Zhu, Nima Mesgarani:
Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation. CoRR abs/2408.11849 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-10058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-10058
Yinghao Aaron Li, Xilin Jiang, Cong Han, Nima Mesgarani:
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion. CoRR abs/2409.10058 (2024)
2023
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/HanCLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/HanCLM23
Cong Han, Vishal Choudhari, Yinghao Aaron Li, Nima Mesgarani:
Improved Decoding of Attentional Selection in Multi-Talker Environments with Self-Supervised Learned Speech Representation. EMBC 2023: 1-5
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiHJM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiHJM23
Yinghao Aaron Li, Cong Han, Xilin Jiang, Nima Mesgarani:
Phoneme-Level Bert for Enhanced Prosody of Text-To-Speech with Grapheme Predictions. ICASSP 2023: 1-5
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangLM23
Xilin Jiang, Yinghao Aaron Li, Nima Mesgarani:
DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes. INTERSPEECH 2023: 2818-2822
[c4]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiHRMM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiHRMM23
Yinghao Aaron Li, Cong Han, Vinay S. Raghavan, Gavin Mischler, Nima Mesgarani:
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models. NeurIPS 2023
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/LiHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/LiHM23
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs. WASPAA 2023: 1-5
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-08810
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-08810
Yinghao Aaron Li, Cong Han, Xilin Jiang, Nima Mesgarani:
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions. CoRR abs/2301.08810 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-05756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-05756
Cong Han, Vishal Choudhari, Yinghao Aaron Li, Nima Mesgarani:
Improved Decoding of Attentional Selection in Multi-Talker Environments with Self-Supervised Learned Speech Representation. CoRR abs/2302.05756 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18441
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18441
Xilin Jiang, Yinghao Aaron Li, Nima Mesgarani:
DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes. CoRR abs/2305.18441 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07691
Yinghao Aaron Li, Cong Han, Vinay S. Raghavan, Gavin Mischler, Nima Mesgarani:
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models. CoRR abs/2306.07691 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-09435
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-09435
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs. CoRR abs/2307.09435 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09493
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09493
Yinghao Aaron Li, Cong Han, Xilin Jiang, Nima Mesgarani:
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform. CoRR abs/2309.09493 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15938
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15938
Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani:
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event Representation. CoRR abs/2309.15938 (2023)
2022
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiHM22
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
Styletts-VC: One-Shot Voice Conversion by Knowledge Transfer From Style-Based TTS Models. SLT 2022: 920-927
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15439
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis. CoRR abs/2205.15439 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-14227
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-14227
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS Models. CoRR abs/2212.14227 (2022)
2021
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZM21
Yinghao Aaron Li, Ali Zare, Nima Mesgarani:
StarGANv2-VC: A Diverse, Unsupervised, Non-Parallel Framework for Natural-Sounding Voice Conversion. Interspeech 2021: 1349-1353
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-10394
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-10394
Yinghao Aaron Li, Ali Zare, Nima Mesgarani:
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion. CoRR abs/2107.10394 (2021)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.