default search action

combined dblp search
author search
venue search
publication search

ask others

Rongjie Huang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ijicte/HuangSZWMC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijicte/HuangSZWMC24
Rongjie Huang, Yusheng Sun, Zhifeng Zhang, Bo Wang, Junxia Ma, Yangyang Chu:
Capability Assessment of Cultivating Innovative Talents for Higher Schools Based on Machine Learning. Int. J. Inf. Commun. Technol. Educ. 20(1): 1-16 (2024)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangLHWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangLHWM24
Dongchao Yang, Songxiang Liu, Rongjie Huang, Chao Weng, Helen Meng:
InstructTTS: Modelling Expressive TTS in Discrete Latent Space With Natural Language Style Prompt. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2913-2925 (2024)
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangHLHXCDHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangHLHXCDHZ24
Yu Zhang, Rongjie Huang, Ruiqi Li, Jinzheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis. AAAI 2024: 19597-19605
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HuangLYSCYWHHLR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HuangLYSCYWHHLR24
Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Yuexian Zou, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. AAAI 2024: 23802-23804
[c37]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/WangBHLHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangBHLHZ24
Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao:
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer. ACL (Student Research Workshop) 2024: 42-49
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LiuHHSSCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiuHHSSCZ24
Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao:
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing. ACL (Findings) 2024: 4230-4242
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HongHCWLYZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HongHCWLYZZ24
Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang:
Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment. ACL (1) 2024: 6248-6261
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Li0WHHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Li0WHHZ24
Ruiqi Li, Yu Zhang, Yongqi Wang, Zhiqing Hong, Rongjie Huang, Zhou Zhao:
Robust Singing Voice Transcription Serves Synthesis. ACL (1) 2024: 9751-9766
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LiHWHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiHWHZ24
Ruiqi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao:
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion. ACL (Findings) 2024: 9819-9831
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ChengHLWJYCDHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChengHLWJYCDHZ24
Xize Cheng, Rongjie Huang, Linjun Li, Zehan Wang, Tao Jin, Aoxiong Yin, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. ACL (Findings) 2024: 9973-9986
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuangZWYTYLW0CS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangZWYTYLW0CS24
Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Jinchuan Tian, Zhenhui Ye, Luping Liu, Zehan Wang, Ziyue Jiang, Xuankai Chang, Jiatong Shi, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners. ACL (1) 2024: 10929-10942
[c30]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/01260LPHHWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/01260LPHHWZ24
Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. EMNLP 2024: 1960-1975
[c29]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YeZ0YLH0HHL00MZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YeZ0YLH0HHL00MZ24
Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. ICLR 2024
[c28]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0001ZCHLYHZ0GZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0001ZCHLYHZ0GZ24
Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. ICML 2024
[c27]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuangHW0C0YYLGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangHW0C0YYLGZ24
Rongjie Huang, Ruofan Hu, Yongqi Wang, Zehan Wang, Xize Cheng, Ziyue Jiang, Zhenhui Ye, Dongchao Yang, Luping Liu, Peng Gao, Zhou Zhao:
InstructSpeech: Following Speech Editing Instructions via Large Language Models. ICML 2024
[c26]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangT0HLGCSZ0ZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangT0HLGCSZ0ZW24
Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Haohan Guo, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Zhou Zhao, Xixin Wu, Helen M. Meng:
UniAudio: Towards Universal Audio Generation with Large Language Models. ICML 2024
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiuHLCWCZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiuHLCWCZZ24
Huadai Liu, Rongjie Huang, Yang Liu, Hengyuan Cao, Jialei Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
AudioLCM: Efficient and High-Quality Text-to-Audio Generation with Minimal Inference Steps. ACM Multimedia 2024: 7008-7017
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangWHXHYC00YL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangWHXHYC00YL24
Rongjie Huang, Yongqi Wang, Ruofan Hu, Xiaoshan Xu, Zhiqing Hong, Dongchao Yang, Xize Cheng, Zehan Wang, Ziyue Jiang, Zhenhui Ye, Luping Liu, Siqi Zheng, Zhou Zhao:
VoiceTuner: Self-Supervised Pre-training and Efficient Fine-tuning For Voice Generation. ACM Multimedia 2024: 10630-10639
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/WangHHHLLYJZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WangHHHLLYJZ24
Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao:
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt. NAACL-HLT 2024: 4780-4794
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08503
Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. CoRR abs/2401.08503 (2024)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12208
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12208
Shengpeng Ji, Minghui Fang, Ziyue Jiang, Rongjie Huang, Jialong Zuo, Shulei Wang, Zhou Zhao:
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models. CoRR abs/2402.12208 (2024)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11780
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11780
Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao:
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt. CoRR abs/2403.11780 (2024)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09313
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09313
Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang:
Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment. CoRR abs/2404.09313 (2024)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-04883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-04883
Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. CoRR abs/2405.04883 (2024)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-05945
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-05945
Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li:
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers. CoRR abs/2405.05945 (2024)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-09940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-09940
Ruiqi Li, Yu Zhang, Yongqi Wang, Zhiqing Hong, Rongjie Huang, Zhou Zhao:
Robust Singing Voice Transcription Serves Synthesis. CoRR abs/2405.09940 (2024)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00320
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00320
Yongqi Wang, Wenxiang Guo, Rongjie Huang, Jiawei Huang, Zehan Wang, Fuming You, Ruiqi Li, Zhou Zhao:
Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching. CoRR abs/2406.00320 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00356
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00356
Huadai Liu, Rongjie Huang, Yang Liu, Hengyuan Cao, Jialei Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
AudioLCM: Text-to-Audio Generation with Latent Consistency Models. CoRR abs/2406.00356 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-01205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-01205
Shengpeng Ji, Jialong Zuo, Minghui Fang, Siqi Zheng, Qian Chen, Wen Wang, Ziyue Jiang, Hai Huang, Xize Cheng, Rongjie Huang, Zhou Zhao:
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec. CoRR abs/2406.01205 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02429
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02429
Ruiqi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao:
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion. CoRR abs/2406.02429 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10056
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10056
Dongchao Yang, Haohan Guo, Yuanyuan Wang, Rongjie Huang, Xiang Li, Xu Tan, Xixin Wu, Helen Meng:
UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner. CoRR abs/2406.10056 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18583
Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT. CoRR abs/2406.18583 (2024)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-02049
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-02049
Ruiqi Li, Zhiqing Hong, Yongqi Wang, Lichao Zhang, Rongjie Huang, Siqi Zheng, Zhou Zhao:
Accompanied Singing Voice Synthesis with Fully Text-controlled Melody. CoRR abs/2407.02049 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-11895
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-11895
Zehan Wang, Ziang Zhang, Hang Zhang, Luping Liu, Rongjie Huang, Xize Cheng, Hengshuang Zhao, Zhou Zhao:
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces. CoRR abs/2407.11895 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-13220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-13220
Huadai Liu, Jialei Wang, Rongjie Huang, Yang Liu, Jiayang Xu, Zhou Zhao:
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control. CoRR abs/2407.13220 (2024)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-12102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-12102
Luyao Cheng, Hui Wang, Siqi Zheng, Yafeng Chen, Rongjie Huang, Qinglin Zhang, Qian Chen, Xihao Li:
Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization. CoRR abs/2408.12102 (2024)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-13893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-13893
Dongchao Yang, Rongjie Huang, Yuanyuan Wang, Haohan Guo, Dading Chong, Songxiang Liu, Xixin Wu, Helen Meng:
SimpleSpeech 2: Towards Simple and Efficient Text-to-Speech with Flow-based Scalar Latent Transformer Diffusion Models. CoRR abs/2408.13893 (2024)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-16532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-16532
Shengpeng Ji, Ziyue Jiang, Xize Cheng, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Ruiqi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Wen Wang, Zhou Zhao:
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling. CoRR abs/2408.16532 (2024)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15977
Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. CoRR abs/2409.15977 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-06734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-06734
Zhenhui Ye, Tianyun Zhong, Yi Ren, Ziyue Jiang, Jiawei Huang, Rongjie Huang, Jinglin Liu, Jinzheng He, Chen Zhang, Zehan Wang, Xize Chen, Xiang Yin, Zhou Zhao:
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes. CoRR abs/2410.06734 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-12266
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-12266
Huadai Liu, Jialei Wang, Rongjie Huang, Yang Liu, Heng Lu, Wei Xue, Zhou Zhao:
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation. CoRR abs/2410.12266 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-21269
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-21269
Xize Cheng, Siqi Zheng, Zehan Wang, Minghui Fang, Ziang Zhang, Rongjie Huang, Ziyang Ma, Shengpeng Ji, Jialong Zuo, Tao Jin, Zhou Zhao:
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup. CoRR abs/2410.21269 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-01805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-01805
Fuming You, Minghui Fang, Li Tang, Rongjie Huang, Yongqi Wang, Zhou Zhao:
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence. CoRR abs/2411.01805 (2024)
2023
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HeLYHCLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HeLYHCLZ23
Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao:
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis. ACL (Findings) 2023: 236-248
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Huang0JCLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Huang0JCLZ23
Rongjie Huang, Yi Ren, Ziyue Jiang, Chenye Cui, Jinglin Liu, Zhou Zhao:
FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis. ACL (Findings) 2023: 6994-7009
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LiHZLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiHZLZ23
Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao:
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment. ACL (Findings) 2023: 7074-7088
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuangZRZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangZRZ023
Rongjie Huang, Chunlei Zhang, Yi Ren, Zhou Zhao, Dong Yu:
Prosody-TTS: Improving Prosody with Masked Autoencoder and Conditional Diffusion Model For Expressive Text-to-Speech. ACL (Findings) 2023: 8018-8034
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuangLC0LYHZLYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangLC0LYHZLYZ23
Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. ACL (1) 2023: 8590-8604
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YeHRJLHYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YeHRJLHYZ23
Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training. ACL (1) 2023: 9317-9331
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LiJCWLHZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiJCWLHZ23
Linjun Li, Tao Jin, Xize Cheng, Ye Wang, Wang Lin, Rongjie Huang, Zhou Zhao:
Contrastive Token-Wise Meta-Learning for Unseen Performer Visual Temporal-Aligned Translation. ACL (Findings) 2023: 10993-11007
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/JiangYZYHRZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/JiangYZYHRZ23
Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. ACL (Findings) 2023: 11655-11671
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/LiuHLXZCHZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LiuHLXZCHZ23
Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao:
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer. EMNLP 2023: 15957-15969
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CuiZRLHCWHW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CuiZRLHCWHW23
Chenye Cui, Zhou Zhao, Yi Ren, Jinglin Liu, Rongjie Huang, Feiyang Chen, Zhefeng Wang, Baoxing Huai, Fei Wu:
VarietySound: Timbre-Controllable Video to Sound Generation Via Unsupervised Information Disentanglement. ICASSP 2023: 1-5
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ChengJHLLWWLYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ChengJHLLWWLYZ23
Xize Cheng, Tao Jin, Rongjie Huang, Linjun Li, Wang Lin, Zehan Wang, Ye Wang, Huadai Liu, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. ICCV 2023: 15689-15699
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HuangLL0ZHZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HuangLL0ZHZ23
Rongjie Huang, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He, Zhou Zhao:
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation. ICLR 2023
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuangHY0LLYLYZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangHY0LLYLYZ23
Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. ICML 2023: 13916-13932
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HongCHZLHZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HongCHZLHZ23
Zhiqing Hong, Chenye Cui, Rongjie Huang, Lichao Zhang, Jinglin Liu, Jinzheng He, Zhou Zhao:
UniSinger: Unified End-to-End Singing Voice Synthesis With Cross-Modality Information Matching. ACM Multimedia 2023: 7569-7579
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-12661
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-12661
Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. CoRR abs/2301.12661 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-13662
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-13662
Dongchao Yang, Songxiang Liu, Rongjie Huang, Guangzhi Lei, Chao Weng, Helen Meng, Dong Yu:
InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt. CoRR abs/2301.13662 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-05309
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-05309
Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. CoRR abs/2303.05309 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-12995
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-12995
Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. CoRR abs/2304.12995 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-00787
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-00787
Zhenhui Ye, Jinzheng He, Ziyue Jiang, Rongjie Huang, Jiawei Huang, Jinglin Liu, Yi Ren, Xiang Yin, Zejun Ma, Zhou Zhao:
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation. CoRR abs/2305.00787 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-02765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-02765
Dongchao Yang, Songxiang Liu, Rongjie Huang, Jinchuan Tian, Chao Weng, Yuexian Zou:
HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec. CoRR abs/2305.02765 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-04476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-04476
Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao:
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment. CoRR abs/2305.04476 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10686
Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao:
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis. CoRR abs/2305.10686 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10763
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10763
Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training. CoRR abs/2305.10763 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12552
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12552
Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao:
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing. CoRR abs/2305.12552 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12708
Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao:
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer. CoRR abs/2305.12708 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13612
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13612
Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. CoRR abs/2305.13612 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15403
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15403
Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. CoRR abs/2305.15403 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18474
Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye, Chen Zhang, Jinglin Liu, Xiang Yin, Zejun Ma, Zhou Zhao:
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation. CoRR abs/2305.18474 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19269
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19269
Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Luping Liu, Zhenhui Ye, Ziyue Jiang, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Unified Voice Synthesis With Discrete Representation. CoRR abs/2305.19269 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02236
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02236
Luping Liu, Zijian Zhang, Yi Ren, Rongjie Huang, Xiang Yin, Zhou Zhao:
Detector Guidance for Multi-Object Text-to-Image Generation. CoRR abs/2306.02236 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-03509
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-03509
Ziyue Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias. CoRR abs/2306.03509 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07566
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07566
Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao:
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer. CoRR abs/2309.07566 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00704
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00704
Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Xixin Wu, Zhou Zhao, Shinji Watanabe, Helen Meng:
UniAudio: An Audio Foundation Model Toward Universal Audio Generation. CoRR abs/2310.00704 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08168
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08168
Haifeng Huang, Zehan Wang, Rongjie Huang, Luping Liu, Xize Cheng, Yang Zhao, Tao Jin, Zhou Zhao:
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers. CoRR abs/2312.08168 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10741
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10741
Yu Zhang, Rongjie Huang, Ruiqi Li, Jinzheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis. CoRR abs/2312.10741 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-15197
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-15197
Xize Cheng, Rongjie Huang, Linjun Li, Tao Jin, Zehan Wang, Aoxiong Yin, Minglei Li, Xinyu Duan, Changpeng Yang, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. CoRR abs/2312.15197 (2023)
2022
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/cma/HuangXZGLW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cma/HuangXZGLW22
Rongjie Huang, Guizhong Xie, Yudong Zhong, Hongrui Geng, Hao Li, Liangwen Wang:
Boundary element analysis of thin structures using a dual transformation method for weakly singular boundary integrals. Comput. Math. Appl. 113: 198-213 (2022)
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/HuangL0S00Z22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/HuangL0S00Z22
Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. IJCAI 2022: 4157-4163
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangCC0LZHW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangCC0LZHW22
Rongjie Huang, Chenye Cui, Feiyang Chen, Yi Ren, Jinglin Liu, Zhou Zhao, Baoxing Huai, Zhefeng Wang:
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation. ACM Multimedia 2022: 2525-2535
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangZLLC022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangZLLC022
Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren:
ProDiff: Progressive Fast Diffusion Model for High-Quality Text-to-Speech. ACM Multimedia 2022: 2595-2605
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Huang0LCZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Huang0LCZ22
Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech. NeurIPS 2022
[c4]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangLWDL0HHZCZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangLWDL0HHZCZ22
Lichao Zhang, Ruiqi Li, Shoutong Wang, Liqun Deng, Jinglin Liu, Yi Ren, Jinzheng He, Rongjie Huang, Jieming Zhu, Xiao Chen, Zhou Zhao:
M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus. NeurIPS 2022
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-09934
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-09934
Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. CoRR abs/2204.09934 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-07211
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-07211
Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis. CoRR abs/2205.07211 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-12523
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-12523
Rongjie Huang, Zhou Zhao, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He:
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation. CoRR abs/2205.12523 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-06389
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-06389
Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren:
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech. CoRR abs/2207.06389 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-10666
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-10666
Chenye Cui, Yi Ren, Jinglin Liu, Rongjie Huang, Zhou Zhao:
VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement. CoRR abs/2211.10666 (2022)
2021
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Cui0LCHLZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Cui0LCHLZ21
Chenye Cui, Yi Ren, Jinglin Liu, Feiyang Chen, Rongjie Huang, Ming Lei, Zhou Zhao:
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model. Interspeech 2021: 2766-2770
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangC0LCZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangC0LCZ21
Rongjie Huang, Feiyang Chen, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus. ACM Multimedia 2021: 3945-3954
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-09317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-09317
Chenye Cui, Yi Ren, Jinglin Liu, Feiyang Chen, Rongjie Huang, Ming Lei, Zhou Zhao:
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model. CoRR abs/2106.09317 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-11514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-11514
Max W. Y. Lam, Jun Wang, Rongjie Huang, Dan Su, Dong Yu:
Bilateral Denoising Diffusion Models. CoRR abs/2108.11514 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07468
Feiyang Chen, Rongjie Huang, Chenye Cui, Yi Ren, Jinglin Liu, Zhou Zhao, Nicholas Jing Yuan, Baoxing Huai:
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation. CoRR abs/2110.07468 (2021)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-10358
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-10358
Rongjie Huang, Feiyang Chen, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus. CoRR abs/2112.10358 (2021)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2017
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/smartcom/CaiHYJMLS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/smartcom/CaiHYJMLS17
Shubin Cai, Rongjie Huang, Ningsheng Yang, Jinwen Jiang, Zhong Ming, Zhengping Liang, Zhiguang Shan:
Research on Dynamic Safe Loading Techniques in Android Application Protection System. SmartCom 2017: 134-143

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.