default search action

combined dblp search
author search
venue search
publication search

ask others

Li-Rong Dai 0001

Lirong Dai 0001

> Home > Persons

Person information

affiliation: University of Science and Technology of China, National Engineering Laboratory for Speech and Language Information Processing, Hefei, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j60]
- view
  authority control:
- export record
  dblp key:
  - journals/jpdc/DaiGAXD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jpdc/DaiGAXD24
Li-Rong Dai, Luqi Gong, Zhulin An, Yongjun Xu, Boyu Diao:
Sketch-fusion: A gradient compression method with multi-layer fusion for communication-efficient distributed training. J. Parallel Distributed Comput. 185: 104811 (2024)
[j59]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ZhuHLTD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ZhuHLTD24
Jian Zhu, Zhangmin Huang, Lei Liu, Chang Tang, Li-Rong Dai:
Boosted Curriculum Multi-View Hashing for Multimedia Retrieval. IEEE Signal Process. Lett. 31: 2065-2069 (2024)
[j58]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangCZWRLYGDLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangCZWRLYGDLW24
Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, Li-Rong Dai, Jinyu Li, Furu Wei:
SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2177-2187 (2024)
[j57]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ZhuZZLJZDJLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ZhuZZLJZDJLW24
Qiushi Zhu, Long Zhou, Ziqiang Zhang, Shujie Liu, Binxing Jiao, Jie Zhang, Li-Rong Dai, Daxin Jiang, Jinyu Li, Furu Wei:
VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning. IEEE Trans. Multim. 26: 1055-1064 (2024)
[j56]
- view
  authority control:
- export record
  dblp key:
  - journals/tsc/YuSHGWBDX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsc/YuSHGWBDX24
Dianhai Yu, Liang Shen, Hongxiang Hao, Weibao Gong, HuaChao Wu, Jiang Bian, Lirong Dai, Haoyi Xiong:
MoESys: A Distributed and Efficient Mixture-of-Experts Training and Inference System for Internet Services. IEEE Trans. Serv. Comput. 17(5): 2626-2639 (2024)
[c276]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhuZGH024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhuZGH024
Qiushi Zhu, Jie Zhang, Yu Gu, Yuchen Hu, Lirong Dai:
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation. AAAI 2024: 19768-19776
[c275]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Wang0CZYZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Wang0CZYZ024
Yichi Wang, Jie Zhang, Shihao Chen, Weitai Zhang, Zhongyi Ye, Xinyuan Zhou, Lirong Dai:
A Study of Multichannel Spatiotemporal Features and Knowledge Distillation on Robust Target Speaker Extraction. ICASSP 2024: 431-435
[c274]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CuiGWZC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CuiGWZC024
Jianwei Cui, Yu Gu, Chao Weng, Jie Zhang, Liping Chen, Lirong Dai:
Sifisinger: A High-Fidelity End-to-End Singing Voice Synthesizer Based on Source-Filter Model. ICASSP 2024: 11126-11130
[c273]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenCZLL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenCZLL024
Shihao Chen, Liping Chen, Jie Zhang, Kong-Aik Lee, Zhenhua Ling, Lirong Dai:
Adversarial Speech for Voice Privacy Protection from Personalized Speech Generation. ICASSP 2024: 11411-11415
[c272]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangZLYZL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangZLYZL024
Weitai Zhang, Hanyi Zhang, Chenxuan Liu, Zhongyi Ye, Xinyuan Zhou, Chao Lin, Lirong Dai:
Pre-Trained Acoustic-and-Textual Modeling for End-To-End Speech-To-Text Translation. ICASSP 2024: 11451-11455
[c271]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/XiongD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/XiongD24
Shifu Xiong, Li-Rong Dai:
Exploring Semi-Supervised, Subcategory Classification and Subwords Alignment for Visual Wake Word Spotting. ICME Workshops 2024: 1-6
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-03468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-03468
Qiushi Zhu, Jie Zhang, Yu Gu, Yuchen Hu, Lirong Dai:
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation. CoRR abs/2401.03468 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-11857
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-11857
Shihao Chen, Liping Chen, Jie Zhang, Kong-Aik Lee, Zhenhua Ling, Lirong Dai:
Adversarial speech for voice privacy protection from Personalized Speech generation. CoRR abs/2401.11857 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05325
Shihao Chen, Yu Gu, Jie Zhang, Na Li, Rilin Chen, Liping Chen, Lirong Dai:
LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance. CoRR abs/2406.05325 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-12354
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-12354
Shihao Chen, Yu Gu, Jianwei Cui, Jie Zhang, Rilin Chen, Li-Rong Dai:
LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation. CoRR abs/2408.12354 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-12536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-12536
Jianwei Cui, Yu Gu, Chao Weng, Jie Zhang, Liping Chen, Li-Rong Dai:
SiFiSinger: A High-Fidelity End-to-End Singing Voice Synthesizer based on Source-filter Model. CoRR abs/2410.12536 (2024)
2023
[j55]
- view
  authority control:
- export record
  dblp key:
  - journals/jocnet/YangXSSLCJDP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jocnet/YangXSSLCJDP23
Hongyu Yang, Yiyuan Xie, Tingting Song, Ye Su, Bocheng Liu, Junxiong Chai, Xiao Jiang, Lirong Dai, Jing Pang:
Universal wavelength reuse mechanism for optical networks-on-chip based on a cooperative game. J. Opt. Commun. Netw. 15(6): 367-380 (2023)
[j54]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangTDD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangTDD23
Jie Zhang, Rui Tao, Jun Du, Li-Rong Dai:
Energy-Efficient Sparsity-Driven Speech Enhancement in Wireless Acoustic Sensor Networks. IEEE ACM Trans. Audio Speech Lang. Process. 31: 215-228 (2023)
[j53]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhuZZD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhuZZD23
Qiu-Shi Zhu, Jie Zhang, Ziqiang Zhang, Li-Rong Dai:
A Joint Speech Enhancement and Self-Supervised Representation Learning Framework for Noise-Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1927-1939 (2023)
[j52]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangTDD23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangTDD23a
Jie Zhang, Rui Tao, Jun Du, Li-Rong Dai:
SDW-SWF: Speech Distortion Weighted Single-Channel Wiener Filter for Noise Reduction. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3176-3189 (2023)
[c270]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DengZZYZCD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DengZZYZCD23
Pan Deng, Jie Zhang, Xinyuan Zhou, Zhongyi Ye, Weitai Zhang, Jianwei Cui, Lirong Dai:
Learning Semantic Information from Machine Translation to Improve Speech-to-Text Translation. APSIPA ASC 2023: 954-959
[c269]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ShiZDYCZD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ShiZDYCZD23
Mohan Shi, Jie Zhang, Zhihao Du, Fan Yu, Qian Chen, Shiliang Zhang, Li-Rong Dai:
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings. APSIPA ASC 2023: 1943-1948
[c268]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuSZDMZZLX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuSZDMZZLX23
Hang-Rui Hu, Yan Song, Jian-Tao Zhang, Li-Rong Dai, Ian McLoughlin, Zhu Zhuo, Yu Zhou, Yu-Hong Li, Hui Xue:
Stargan-vc Based Cross-Domain Data Augmentation for Speaker Verification. ICASSP 2023: 1-5
[c267]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiSDMFL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiSDMFL23
Kang Li, Yan Song, Li-Rong Dai, Ian McLoughlin, Xin Fang, Lin Liu:
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer. ICASSP 2023: 1-5
[c266]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuWZYWGFD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuWZYWGFD23
Haitao Xu, Liangfa Wei, Jie Zhang, Jianming Yang, Yannan Wang, Tian Gao, Xin Fang, Li-Rong Dai:
A Multi-Scale Feature Aggregation Based Lightweight Network for Audio-Visual Speech Enhancement. ICASSP 2023: 1-5
[c265]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZengSZZLXDM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZengSZZLXDM23
Xiao-Min Zeng, Yan Song, Zhu Zhuo, Yu Zhou, Yu-Hong Li, Hui Xue, Li-Rong Dai, Ian McLoughlin:
Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection. ICASSP 2023: 1-5
[c264]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuZZLHD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuZZLHD23
Qiu-Shi Zhu, Long Zhou, Jie Zhang, Shujie Liu, Yu-Chen Hu, Li-Rong Dai:
Robust Data2VEC: Noise-Robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning. ICASSP 2023: 1-5
[c263]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/HeHYWXYL0X23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/HeHYWXYL0X23
Shan He, Haonan He, Shuo Yang, Xiaoyan Wu, Pengcheng Xia, Bing Yin, Cong Liu, Lirong Dai, Chang Xu:
Speech4Mesh: Speech-Assisted Monocular 3D Facial Reconstruction for Speech-Driven 3D Facial Animation. ICCV 2023: 14146-14156
[c262]
- view
  authority control:
- export record
  dblp key:
  - conf/icig/WuLZYYLD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icig/WuLZYYLD23
Jiajia Wu, Anni Li, Kun Zhao, Zhengyan Yang, Bing Yin, Cong Liu, Li-Rong Dai:
A Multimodal Text Block Segmentation Framework for Photo Translation. ICIG (3) 2023: 116-127
[c261]
- view
  authority control:
- export record
  dblp key:
  - conf/icig/WuZYYLD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icig/WuZYYLD23
Jiajia Wu, Kun Zhao, Zhengyan Yang, Bing Yin, Cong Liu, Li-Rong Dai:
End-to-End Multilingual Text Recognition Based on Byte Modeling. ICIG (3) 2023: 128-137
[c260]
- view
  authority control:
- export record
  dblp key:
  - conf/icig/HuLYZWDD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icig/HuLYZWDD23
Jinshui Hu, Chenyu Liu, Qiandong Yan, Xuyang Zhu, Jiajia Wu, Jun Du, Li-Rong Dai:
Vision-Language Adaptive Mutual Decoder for OOV-STR. ICIG (2) 2023: 175-186
[c259]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zeng000023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zeng000023
Xiao-Min Zeng, Yan Song, Ian McLoughlin, Lin Liu, Li-Rong Dai:
Robust Prototype Learning for Anomalous Sound Detection. INTERSPEECH 2023: 261-265
[c258]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Li000L023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Li000L023
Kang Li, Yan Song, Ian McLoughlin, Lin Liu, Jin Li, Li-Rong Dai:
Fine-tuning Audio Spectrogram Transformer with Task-aware Adapters for Sound Event Detection. INTERSPEECH 2023: 291-295
[c257]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiD0YLZ0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiD0YLZ0023
Mohan Shi, Zhihao Du, Qian Chen, Fan Yu, Yangze Li, Shiliang Zhang, Jie Zhang, Li-Rong Dai:
CASA-ASR: Context-Aware Speaker-Attributed ASR. INTERSPEECH 2023: 411-415
[c256]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiSZ0Z0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiSZ0Z0023
Mohan Shi, Yuchun Shu, Lingyun Zuo, Qian Chen, Shiliang Zhang, Jie Zhang, Li-Rong Dai:
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction. INTERSPEECH 2023: 5047-5051
[c255]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wang0023a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wang0023a
Jingyuan Wang, Jie Zhang, Li-Rong Dai:
Real-Time Causal Spectro-Temporal Voice Activity Detection Based on Convolutional Encoding and Residual Decoding. INTERSPEECH 2023: 5062-5066
[c254]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/DengCZZD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/DengCZZD23
Pan Deng, Shihao Chen, Weitai Zhang, Jie Zhang, Lirong Dai:
The USTC's Dialect Speech Translation System for IWSLT 2023. IWSLT@ACL 2023: 102-112
[c253]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/ZhouCYWXZZD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/ZhouCYWXZZD23
Xinyuan Zhou, Jianwei Cui, Zhongyi Ye, Yichi Wang, Luzhen Xu, Hanyi Zhang, Weitai Zhang, Lirong Dai:
Submission of USTC's System for the IWSLT 2023 - Offline Speech Translation Track. IWSLT@ACL 2023: 194-201
[c252]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuWCLWYYYLD023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuWCLWYYYLD023
Jinshui Hu, Hao Wu, Mingjun Chen, Chenyu Liu, Jiajia Wu, Shi Yin, Baocai Yin, Bing Yin, Cong Liu, Jun Du, Lirong Dai:
Handwritten Chemical Structure Image to Structure-Specific Markup Using Random Conditional Guided Decoder. ACM Multimedia 2023: 8114-8124
[c251]
- view
  authority control:
- export record
  dblp key:
  - conf/ssp/ZhangTD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssp/ZhangTD23
Jie Zhang, Rui Tao, Li-Rong Dai:
A Speech Distortion Weighted Single-Channel Wiener Filter Based STFT-Domain Noise Reduction. SSP 2023: 527-531
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-03689
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-03689
Kang Li, Yan Song, Li-Rong Dai, Ian McLoughlin, Xin Fang, Lin Liu:
AST-SED: An Effective Sound Event Detection Method Based on Audio Spectrogram Transformer. CoRR abs/2303.03689 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12111
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12111
Xiao-Min Zeng, Yan Song, Zhu Zhuo, Yu Zhou, Yu-Hong Li, Hui Xue, Li-Rong Dai, Ian McLoughlin:
Joint Generative-Contrastive Representation Learning for Anomalous Sound Detection. CoRR abs/2305.12111 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12450
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12450
Mohan Shi, Yuchun Shu, Lingyun Zuo, Qian Chen, Shiliang Zhang, Jie Zhang, Li-Rong Dai:
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction. CoRR abs/2305.12450 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12459
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12459
Mohan Shi, Zhihao Du, Qian Chen, Fan Yu, Yangze Li, Shiliang Zhang, Jie Zhang, Li-Rong Dai:
CASA-ASR: Context-Aware Speaker-Attributed ASR. CoRR abs/2305.12459 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-14553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-14553
Qiushi Zhu, Yu Gu, Chao Weng, Yuchen Hu, Lirong Dai, Jie Zhang:
Rep2wav: Noise Robust text-to-speech Using self-supervised representations. CoRR abs/2308.14553 (2023)
2022
[j51]
- view
  authority control:
- export record
  dblp key:
  - journals/cssp/ZhangSWFMD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cssp/ZhangSWFMD22
Zi-qiang Zhang, Yan Song, Ming-Hui Wu, Xin Fang, Ian McLoughlin, Li-Rong Dai:
Cross-Lingual Self-training to Learn Multilingual Representation for Low-Resource Speech Recognition. Circuits Syst. Signal Process. 41(12): 6827-6843 (2022)
[j50]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/WuDWYJHYZD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/WuDWYJHYZD22
Jiajia Wu, Jun Du, Fengren Wang, Chen Yang, Xinzhe Jiang, Jinshui Hu, Bing Yin, Jianshu Zhang, Lirong Dai:
A multimodal attention fusion network with a dynamic vocabulary for TextVQA. Pattern Recognit. 122: 108214 (2022)
[j49]
- view
  authority control:
- export record
  dblp key:
  - journals/twc/ZhangZD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/twc/ZhangZD22
Jie Zhang, Guanghui Zhang, Li-Rong Dai:
Frequency-Invariant Sensor Selection for MVDR Beamforming in Wireless Acoustic Sensor Networks. IEEE Trans. Wirel. Commun. 21(12): 10648-10661 (2022)
[c250]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhangZA0D0W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangZA0D0W22
Ziqiang Zhang, Long Zhou, Junyi Ao, Shujie Liu, Lirong Dai, Jinyu Li, Furu Wei:
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training. EMNLP 2022: 1663-1676
[c249]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenSDML22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenSDML22
Han Chen, Yan Song, Li-Rong Dai, Ian McLoughlin, Lin Liu:
Self-Supervised Representation Learning for Unsupervised Anomalous Sound Detection Under Domain Shift. ICASSP 2022: 471-475
[c248]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenZZD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenZZD22
Xing-Yu Chen, Qiu-Shi Zhu, Jie Zhang, Li-Rong Dai:
Supervised and Self-Supervised Pretraining Based Covid-19 Detection Using Acoustic Breathing/Cough/Speech Signals. ICASSP 2022: 561-565
[c247]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuZZWFD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuZZWFD22
Qiu-Shi Zhu, Jie Zhang, Zi-qiang Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
A Noise-Robust Self-Supervised Pre-Training Model Based Speech Representation Learning for Automatic Speech Recognition. ICASSP 2022: 3174-3178
[c246]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenZD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenZD22
Xing-Yu Chen, Jie Zhang, Li-Rong Dai:
Reference Microphone Selection and Low-Rank Approximation Based Multichannel Wiener Filter with Application to Speech Recognition. ICASSP 2022: 4963-4967
[c245]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuSLDML22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuSLDML22
Hang-Rui Hu, Yan Song, Ying Liu, Li-Rong Dai, Ian McLoughlin, Lin Liu:
Domain Robust Deep Embedding Learning for Speaker Recognition. ICASSP 2022: 7182-7186
[c244]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiSDML22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiSDML22
Yuxuan Xi, Yan Song, Li-Rong Dai, Ian McLoughlin, Lin Liu:
Frontend Attributes Disentanglement for Speech Emotion Recognition. ICASSP 2022: 7712-7716
[c243]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/Zhang0ZWF022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/Zhang0ZWF022
Ziqiang Zhang, Jie Zhang, Jian-Shu Zhang, Ming-Hui Wu, Xin Fang, Lirong Dai:
Learning Contextually Fused Audio-Visual Representations For Audio-Visual Speech Recognition. ICIP 2022: 1346-1350
[c242]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/GanZWF022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/GanZWF022
Ao-Ran Gan, Jie Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
An Experimental Comparison between Low-Resource Semi-Supervised and High-Resource Supervised Automatic Speech Recognition Models. ICME 2022: 1-6
[c241]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/WuHC0NW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/WuHC0NW22
Jiajia Wu, Jinshui Hu, Mingjun Chen, Lirong Dai, Xuejing Niu, Ning Wang:
Structural String Decoder for Handwritten Mathematical Expression Recognition. ICPR 2022: 3246-3251
[c240]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuZ022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuZ022
Hai-tao Xu, Jie Zhang, Li-Rong Dai:
Differential Time-frequency Log-mel Spectrogram Features for Vision Transformer Based Infant Cry Recognition. INTERSPEECH 2022: 1963-1967
[c239]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuZZ0WFY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuZZ0WFY22
Ye-Qian Du, Jie Zhang, Qiu-Shi Zhu, Lirong Dai, Ming-Hui Wu, Xin Fang, Zhou-Wang Yang:
A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text. INTERSPEECH 2022: 2613-2617
[c238]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AoZZ00K00QW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AoZZ00K00QW22
Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. INTERSPEECH 2022: 2658-2662
[c237]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hu000L22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hu000L22
Hang-Rui Hu, Yan Song, Li-Rong Dai, Ian McLoughlin, Lin Liu:
Class-Aware Distribution Alignment based Unsupervised Domain Adaptation for Speaker Verification. INTERSPEECH 2022: 3689-3693
[c236]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhongSWSLPFDZD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhongSWSLPFDZD22
Guolong Zhong, Hongyu Song, Ruoyu Wang, Lei Sun, Diyuan Liu, Jia Pan, Xin Fang, Jun Du, Jie Zhang, Lirong Dai:
External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge. INTERSPEECH 2022: 4860-4864
[c235]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/ZhangYTLZYCLLD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/ZhangYTLZYCLLD22
Weitai Zhang, Zhongyi Ye, Haitao Tang, Xiaoxi Li, Xinyuan Zhou, Jing Yang, Jianwei Cui, Dan Liu, Junhua Liu, Lirong Dai:
The USTC-NELSLIP Offline Speech Translation Systems for IWSLT 2022. IWSLT@ACL 2022: 198-207
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-08930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-08930
Qiu-Shi Zhu, Jie Zhang, Zi-qiang Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition. CoRR abs/2201.08930 (2022)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-08934
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-08934
Xing-Yu Chen, Qiu-Shi Zhu, Jie Zhang, Li-Rong Dai:
Supervised and Self-supervised Pretraining Based COVID-19 Detection Using Acoustic Breathing/Cough/Speech Signals. CoRR abs/2201.08934 (2022)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07428
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07428
Zi-qiang Zhang, Jie Zhang, Jian-Shu Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition. CoRR abs/2202.07428 (2022)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-17113
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-17113
Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. CoRR abs/2203.17113 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02023
Ye-Qian Du, Jie Zhang, Qiu-Shi Zhu, Li-Rong Dai, Ming-Hui Wu, Xin Fang, Zhou-Wang Yang:
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition. CoRR abs/2204.02023 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-13293
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-13293
Qiu-Shi Zhu, Jie Zhang, Zi-qiang Zhang, Li-Rong Dai:
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR. CoRR abs/2205.13293 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-03730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-03730
Ziqiang Zhang, Long Zhou, Junyi Ao, Shujie Liu, Lirong Dai, Jinyu Li, Furu Wei:
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training. CoRR abs/2210.03730 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15324
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15324
Qiu-Shi Zhu, Long Zhou, Jie Zhang, Shujie Liu, Yu-Chen Hu, Lirong Dai:
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning. CoRR abs/2210.15324 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-11275
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-11275
Qiu-Shi Zhu, Long Zhou, Ziqiang Zhang, Shujie Liu, Binxing Jiao, Jie Zhang, Lirong Dai, Daxin Jiang, Jinyu Li, Furu Wei:
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning. CoRR abs/2211.11275 (2022)
2021
[j48]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/ChenDHDYL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/ChenDHDYL21
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement. Neural Networks 143: 171-182 (2021)
[j47]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangCDH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangCDH21
Jie Zhang, Huawei Chen, Li-Rong Dai, Richard Christian Hendriks:
A Study on Reference Microphone Selection for Multi-Microphone Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 29: 671-683 (2021)
[j46]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangDD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangDD21
Jie Zhang, Jun Du, Li-Rong Dai:
Sensor Selection for Relative Acoustic Transfer Function Steered Linearly-Constrained Beamformers. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1220-1232 (2021)
[j45]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhouLD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhouLD21
Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
UnitNet: A Sequence-to-Sequence Acoustic Model for Concatenative Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2643-2655 (2021)
[j44]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TangZSMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TangZSMD21
Jian Tang, Jie Zhang, Yan Song, Ian McLoughlin, Li-Rong Dai:
Multi-Granularity Sequence Alignment Mapping for Encoder-Decoder Based End-to-End ASR. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2816-2828 (2021)
[j43]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ZhangDYSD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ZhangDYSD21
Jianshu Zhang, Jun Du, Yongxin Yang, Yi-Zhe Song, Lirong Dai:
SRD: A Tree Structure Based Decoder for Online Handwritten Mathematical Expression Recognition. IEEE Trans. Multim. 23: 2471-2480 (2021)
[c234]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangRL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangRL021
Jing-Xuan Zhang, Korin Richmond, Zhen-Hua Ling, Lirong Dai:
TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis. AAAI 2021: 14402-14410
[c233]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhengSML021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhengSML021
Xu Zheng, Yan Song, Ian McLoughlin, Lin Liu, Li-Rong Dai:
An Improved Mean Teacher Based Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection. ICASSP 2021: 356-360
[c232]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuS0L021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuS0L021
Ying Liu, Yan Song, Ian McLoughlin, Lin Liu, Li-Rong Dai:
An Effective Deep Embedding Learning Method Based on Dense-Residual Networks for Speaker Verification. ICASSP 2021: 6683-6687
[c231]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zheng00ML21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zheng00ML21
Xu Zheng, Yan Song, Li-Rong Dai, Ian McLoughlin, Lin Liu:
An Effective Mutual Mean Teaching Based Domain Adaptation Method for Sound Event Detection. Interspeech 2021: 556-560
[c230]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLSF0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLSF0021
Hui Wang, Lin Liu, Yan Song, Lei Fang, Ian McLoughlin, Li-Rong Dai:
A Weight Moving Average Based Alternate Decoupled Learning Algorithm for Long-Tailed Language Identification. Interspeech 2021: 1499-1503
[c229]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenD00YL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenD00YL21
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries. Interspeech 2021: 3001-3005
[c228]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouL021
Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
UnitNet-Based Hybrid Speech Synthesis. Interspeech 2021: 4119-4123
[c227]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuZWF021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuZWF021
Qiu-Shi Zhu, Jie Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition. Interspeech 2021: 4334-4338
[c226]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/LiuDLHD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/LiuDLHD21
Dan Liu, Mengge Du, Xiaoxi Li, Yuchen Hu, Lirong Dai:
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021. IWSLT 2021: 30-38
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-08207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-08207
Zi-qiang Zhang, Yan Song, Ming-Hui Wu, Xin Fang, Li-Rong Dai:
XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition. CoRR abs/2103.08207 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-00279
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-00279
Dan Liu, Mengge Du, Xiaoxi Li, Yuchen Hu, Lirong Dai:
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021. CoRR abs/2107.00279 (2021)
2020
[j42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/TangHSDM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/TangHSDM20
Jian Tang, Junfeng Hou, Yan Song, Li-Rong Dai, Ian McLoughlin:
Effective Exploitation of Posterior Information for Attention-Based Speech Recognition. IEEE Access 8: 108988-108999 (2020)
[j41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/HouGSD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/HouGSD20
Junfeng Hou, Wu Guo, Yan Song, Li-Rong Dai:
Segment boundary detection directed attention for online end-to-end speech recognition. EURASIP J. Audio Speech Music. Process. 2020(1): 3 (2020)
[j40]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/ZhangDD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/ZhangDD20
Jianshu Zhang, Jun Du, Lirong Dai:
Radical analysis network for learning hierarchies of Chinese characters. Pattern Recognit. 103: 107305 (2020)
[j39]
- view
  authority control:
- export record
  dblp key:
  - journals/talip/ZhouLD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/talip/ZhouLD20
Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
Learning and Modeling Unit Embeddings Using Deep Neural Networks for Unit-Selection-Based Mandarin Speech Synthesis. ACM Trans. Asian Low Resour. Lang. Inf. Process. 19(3): 38:1-38:14 (2020)
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangLD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangLD20
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Non-Parallel Sequence-to-Sequence Voice Conversion With Disentangled Linguistic and Speaker Representations. IEEE ACM Trans. Audio Speech Lang. Process. 28: 540-552 (2020)
[c225]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/WeiZHD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WeiZHD20
Liangfa Wei, Jie Zhang, Junfeng Hou, Lirong Dai:
Attentive Fusion Enhanced Audio-Visual Encoding for Transformer Based Robust Speech Recognition. APSIPA 2020: 638-643
[c224]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/LiuCZ0HL020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/LiuCZ0HL020
Li-Juan Liu, Yan-Nian Chen, Jing-Xuan Zhang, Yuan Jiang, Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai:
Non-Parallel Voice Conversion with Autoregressive Conversion Model and Duration Adjustment. Blizzard Challenge / Voice Conversion Challenge 2020
[c223]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/ZhangLCH0L020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/ZhangLCH0L020
Jing-Xuan Zhang, Li-Juan Liu, Yan-Nian Chen, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling, Li-Rong Dai:
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer. Blizzard Challenge / Voice Conversion Challenge 2020
[c222]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YanSDM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YanSDM20
Jie Yan, Yan Song, Li-Rong Dai, Ian McLoughlin:
Task-Aware Mean Teacher Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection. ICASSP 2020: 326-330
[c221]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangSLMD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangSLMD20
Hui Wang, Yan Song, Zengxi Li, Ian McLoughlin, Li-Rong Dai:
An Online Speaker-aware Speech Separation Approach Based on Time-domain Representation. ICASSP 2020: 6379-6383
[c220]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuGDD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuGDD20
Bin Gu, Wu Guo, Lirong Dai, Jun Du:
An Improved Deep Neural Network for Modeling Speaker Characteristics at Different Temporal Scales. ICASSP 2020: 6814-6818
[c219]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DingGDD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DingGDD20
Fenglin Ding, Wu Guo, Lirong Dai, Jun Du:
Attention-Based Gated Scaling Adaptive Acoustic Model for CTC-Based Speech Recognition. ICASSP 2020: 7404-7408
[c218]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouLD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouLD20
Xiao Zhou, Zhen-Hua Ling, Li-Rong Dai:
Extracting Unit Embeddings Using Sequence-To-Sequence Acoustic Models for Unit Selection Speech Synthesis. ICASSP 2020: 7659-7663
[c217]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhangDYSW020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangDYSW020
Jianshu Zhang, Jun Du, Yongxin Yang, Yi-Zhe Song, Si Wei, Lirong Dai:
A Tree-Structured Decoder for Image-to-Markup Generation. ICML 2020: 11076-11085
[c216]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangL020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangL020
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning. INTERSPEECH 2020: 771-775
[c215]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zheng0Y0ML20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zheng0Y0ML20
Xu Zheng, Yan Song, Jie Yan, Li-Rong Dai, Ian McLoughlin, Lin Liu:
An Effective Perturbation Based Semi-Supervised Learning Method for Sound Event Detection. INTERSPEECH 2020: 841-845
[c214]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Liu0JML020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Liu0JML020
Ying Liu, Yan Song, Yiheng Jiang, Ian McLoughlin, Lin Liu, Li-Rong Dai:
An Effective Speaker Recognition Method Based on Joint Identification and Verification Supervisions. INTERSPEECH 2020: 3007-3011
[c213]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangSZM020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangSZM020
Zi-qiang Zhang, Yan Song, Jian-Shu Zhang, Ian McLoughlin, Li-Rong Dai:
Semi-Supervised End-to-End ASR via Teacher-Student Learning with Conditional Posterior Distribution. INTERSPEECH 2020: 3580-3584
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-00129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-00129
Fenglin Ding, Wu Guo, Lirong Dai, Jun Du:
Attentive batch normalization for lstm-based acoustic modeling of speech recognition. CoRR abs/2001.00129 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-02371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-02371
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning. CoRR abs/2008.02371 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-02686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-02686
Liangfa Wei, Jie Zhang, Junfeng Hou, Lirong Dai:
Attentive Fusion Enhanced Audio-Visual Encoding for Transformer Based Robust Speech Recognition. CoRR abs/2008.02686 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-01475
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-01475
Jing-Xuan Zhang, Li-Juan Liu, Yan-Nian Chen, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling, Li-Rong Dai:
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer. CoRR abs/2009.01475 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-09561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-09561
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Bao-Cai Yin, Chin-Hui Lee:
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement. CoRR abs/2009.09561 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-14360
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-14360
Hang Chen, Jun Du, Yu Hu, Li-Rong Dai, Chin-Hui Lee, Bao-Cai Yin:
Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention. CoRR abs/2012.14360 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j37]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangLLJD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangLLJD19
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Li-Rong Dai:
Sequence-to-Sequence Acoustic Modeling for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 27(3): 631-644 (2019)
[j36]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiSDM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiSDM19
Zengxi Li, Yan Song, Li-Rong Dai, Ian McLoughlin:
Listening and Grouping: An Online Autoregressive Approach for Monaural Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 27(4): 692-703 (2019)
[j35]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ZhangDD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ZhangDD19
Jianshu Zhang, Jun Du, Lirong Dai:
Track, Attend, and Parse (TAP): An End-to-End Framework for Online Handwritten Mathematical Expression Recognition. IEEE Trans. Multim. 21(1): 221-233 (2019)
[c212]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XiLSJD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XiLSJD19
Yuxuan Xi, Pengcheng Li, Yan Song, Yiheng Jiang, Lirong Dai:
Speaker to Emotion: Domain Adaptation for Speech Emotion Recognition with Residual Adapters. APSIPA 2019: 513-518
[c211]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WuLLJWD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuLLJWD19
Peng-Fei Wu, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Hong-Chuan Wu, Lirong Dai:
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training. APSIPA 2019: 623-627
[c210]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XuHSGD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XuHSGD19
Jingyi Xu, Junfeng Hou, Yan Song, Wu Guo, Lirong Dai:
Knowledge Distillation from Multilingual and Monolingual Teachers for End-to-End Multilingual Speech Recognition. APSIPA 2019: 844-849
[c209]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/NaHGSD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/NaHGSD19
Rui Na, Junfeng Hou, Wu Guo, Yan Song, Lirong Dai:
Learning Adaptive Downsampling Encoding for Online End-to-End Speech Recognition. APSIPA 2019: 850-854
[c208]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/JiangSYDM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/JiangSYDM19
Yiheng Jiang, Yan Song, Jie Yan, Lirong Dai, Ian McLoughlin:
Triplet-Center Loss Based Deep Embedding Learning Method for Speaker Verification. APSIPA 2019: 1625-1629
[c207]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0006HLWWAL019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0006HLWWAL019
Yuan Jiang, Ya-Jun Hu, Li-Juan Liu, Hong-Chuan Wu, Zhi-Kun Wang, Yang Ai, Zhen-Hua Ling, Li-Rong Dai:
The USTC System for Blizzard Challenge 2019. Blizzard Challenge 2019
[c206]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YanSGDMC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YanSGDMC19
Jie Yan, Yan Song, Wu Guo, Li-Rong Dai, Ian McLoughlin, Liang Chen:
A Region Based Attention Method for Weakly Supervised Sound Event Detection and Classification. ICASSP 2019: 755-759
[c205]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangLJLLD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangLJLLD19
Jing-Xuan Zhang, Zhen-Hua Ling, Yuan Jiang, Li-Juan Liu, Chen Liang, Li-Rong Dai:
Improving Sequence-to-sequence Voice Conversion by Adding Text-supervision. ICASSP 2019: 6785-6789
[c204]
- view
  authority control:
- export record
  dblp key:
  - conf/icmssp/LeiCHC019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmssp/LeiCHC019
Qinhui Lei, Hang Chen, Junfeng Hou, Liang Chen, Lirong Dai:
Deep Neural Network Based Regression Approach for Acoustic Echo Cancellation. ICMSSP 2019: 94-98
[c203]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoSMLJD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoSMLJD19
Zhifu Gao, Yan Song, Ian McLoughlin, Pengcheng Li, Yiheng Jiang, Li-Rong Dai:
Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System. INTERSPEECH 2019: 361-365
[c202]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YouGDD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YouGDD19
Lanhua You, Wu Guo, Li-Rong Dai, Jun Du:
Multi-Task Learning with High-Order Statistics for x-Vector Based Text-Independent Speaker Verification. INTERSPEECH 2019: 1158-1162
[c201]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YouGDD19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YouGDD19a
Lanhua You, Wu Guo, Li-Rong Dai, Jun Du:
Deep Neural Network Embeddings with Gating Mechanisms for Text-Independent Speaker Verification. INTERSPEECH 2019: 1168-1172
[c200]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLD19
Jia-Xiang Chen, Zhen-Hua Ling, Li-Rong Dai:
A Chinese Dataset for Identifying Speakers in Novels. INTERSPEECH 2019: 1561-1565
[c199]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YiALD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YiALD19
Yuan-Hao Yi, Yang Ai, Zhen-Hua Ling, Li-Rong Dai:
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling. INTERSPEECH 2019: 2593-2597
[c198]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangSMGD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangSMGD19
Yiheng Jiang, Yan Song, Ian McLoughlin, Zhifu Gao, Li-Rong Dai:
An Effective Deep Embedding Learning Architecture for Speaker Verification. INTERSPEECH 2019: 4040-4044
[c197]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenGDLD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenGDLD19
Zhi Chen, Wu Guo, Li-Rong Dai, Zhen-Hua Ling, Jun Du:
Neural Text Clustering with Document-Level Attention Based on Dynamic Soft Labels. INTERSPEECH 2019: 4225-4229
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-12058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-12058
Lanhua You, Wu Guo, Lirong Dai, Jun Du:
Deep Neural Network Embedding Learning with High-Order Statistics for Text-Independent Speaker Verification. CoRR abs/1903.12058 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-12092
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-12092
Lanhua You, Wu Guo, Lirong Dai, Jun Du:
Deep Neural Network Embeddings with Gating Mechanisms for Text-Independent Speaker Verification. CoRR abs/1903.12092 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-08977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-08977
Yuan-Hao Yi, Yang Ai, Zhen-Hua Ling, Li-Rong Dai:
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling. CoRR abs/1906.08977 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-10508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-10508
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations. CoRR abs/1906.10508 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-10859
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-10859
Peng-Fei Wu, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Hong-Chuan Wu, Li-Rong Dai:
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training. CoRR abs/1906.10859 (2019)
2018
[j34]
- view
  authority control:
- export record
  dblp key:
  - journals/cssp/LiDSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cssp/LiDSM18
Zengxi Li, Li-Rong Dai, Yan Song, Ian McLoughlin:
A Conditional Generative Model for Speech Enhancement. Circuits Syst. Signal Process. 37(11): 5005-5022 (2018)
[j33]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/LiuLD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LiuLD18
Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai:
Articulatory-to-acoustic conversion using BLSTM-RNNs with augmented input representation. Speech Commun. 99: 161-172 (2018)
[j32]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/LiuLD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/LiuLD18
Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai:
Statistical Parametric Speech Synthesis Using Generalized Distillation Framework. IEEE Signal Process. Lett. 25(5): 695-699 (2018)
[j31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/JinSMD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/JinSMD18
Ma Jin, Yan Song, Ian McLoughlin, Li-Rong Dai:
LID-Senones and Their Statistics for Language Identification. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 171-183 (2018)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LingAGD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LingAGD18
Zhen-Hua Ling, Yang Ai, Yu Gu, Li-Rong Dai:
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension. IEEE ACM Trans. Audio Speech Lang. Process. 26(5): 883-894 (2018)
[j29]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangDDL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangDDL18
Qing Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1181-1193 (2018)
[j28]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/LiuLWHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/LiuLWHD18
Junhua Liu, Zhen-Hua Ling, Si Wei, Guoping Hu, Li-Rong Dai:
Improving the Decoding Efficiency of Deep Neural Network Acoustic Models by Cluster-Based Senone Selection. J. Signal Process. Syst. 90(7): 999-1011 (2018)
[c196]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LiuTSD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiuTSD18
Yaming Liu, Jian Tang, Yan Song, Lirong Dai:
A Capsule based Approach for Polyphonic Sound Event Detection. APSIPA 2018: 1853-1857
[c195]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0006ZDHL018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0006ZDHL018
Yuan Jiang, Xiao Zhou, Chuang Ding, Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai:
The USTC System for Blizzard Challenge 2018. Blizzard Challenge 2018
[c194]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiSDM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiSDM18
Zengxi Li, Yan Song, Li-Rong Dai, Ian McLoughlin:
Source-Aware Context Network for Single-Channel Multi-Speaker Speech Separation. ICASSP 2018: 681-685
[c193]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangLD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangLD18
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Forward Attention in Sequence- To-Sequence Acoustic Modeling for Speech Synthesis. ICASSP 2018: 4789-4793
[c192]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GaoDDL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GaoDDL18
Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Densely Connected Progressive Learning for LSTM-Based Speech Enhancement. ICASSP 2018: 5054-5058
[c191]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangLYD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangLYD18
Shiliang Zhang, Ming Lei, Zhijie Yan, Lirong Dai:
Deep-FSMN for Large Vocabulary Continuous Speech Recognition. ICASSP 2018: 5869-5873
[c190]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenGDL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenGDL18
Peixin Chen, Wu Guo, Lirong Dai, Zhenhua Ling:
Pseudo-Supervised Approach for Text Clustering Based on Consensus Analysis. ICASSP 2018: 6184-6188
[c189]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/ZhangZDD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/ZhangZDD18
Jianshu Zhang, Yixing Zhu, Jun Du, Lirong Dai:
Radical Analysis Network for Zero-Shot Learning in Printed Chinese Character Recognition. ICME 2018: 1-6
[c188]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/ZhangDD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/ZhangDD18
Jianshu Zhang, Jun Du, Lirong Dai:
Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition. ICPR 2018: 2245-2250
[c187]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/ZhangZDD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/ZhangZDD18
Jianshu Zhang, Yixing Zhu, Jun Du, Lirong Dai:
Trajectory-based Radical Analysis Network for Online Handwritten Chinese Character Recognition. ICPR 2018: 3681-3686
[c186]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangSDM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangSDM18
Jian Tang, Yan Song, Lirong Dai, Ian McLoughlin:
Acoustic Modeling with Densely Connected Residual Network for Multichannel Speech Recognition. INTERSPEECH 2018: 1783-1787
[c185]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuLJZD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuLJZD18
Li-Juan Liu, Zhen-Hua Ling, Yuan Jiang, Ming Zhou, Li-Rong Dai:
WaveNet Vocoder with Limited Training Data for Voice Conversion. INTERSPEECH 2018: 1983-1987
[c184]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouLZD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouLZD18
Xiao Zhou, Zhen-Hua Ling, Zhi-Ping Zhou, Li-Rong Dai:
Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis. INTERSPEECH 2018: 2509-2513
[c183]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSMGD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSMGD18
Pengcheng Li, Yan Song, Ian McLoughlin, Wu Guo, Lirong Dai:
An Attention Pooling Based Representation Learning Method for Speech Emotion Recognition. INTERSPEECH 2018: 3087-3091
[c182]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoSMGD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoSMGD18
Zhifu Gao, Yan Song, Ian McLoughlin, Wu Guo, Lirong Dai:
An Improved Deep Embedding Learning Method for Short Duration Speaker Verification. INTERSPEECH 2018: 3578-3582
[c181]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangDCDL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangDCDL18
Qing Wang, Jun Du, Li Chai, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network. ISCSLP 2018: 295-299
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1801-03530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-03530
Jianshu Zhang, Jun Du, Lirong Dai:
Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition. CoRR abs/1801.03530 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1801-07910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-07910
Zhen-Hua Ling, Yang Ai, Yu Gu, Li-Rong Dai:
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension. CoRR abs/1801.07910 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1801-10109
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-10109
Jianshu Zhang, Yixing Zhu, Jun Du, Lirong Dai:
Trajectory-based Radical Analysis Network for Online Handwritten Chinese Character Recognition. CoRR abs/1801.10109 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-05030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-05030
Shiliang Zhang, Ming Lei, Zhijie Yan, Lirong Dai:
Deep-FSMN for Large Vocabulary Continuous Speech Recognition. CoRR abs/1803.05030 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-06736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-06736
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai:
Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis. CoRR abs/1807.06736 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-07436
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-07436
Yaming Liu, Jian Tang, Yan Song, Lirong Dai:
A Capsule based Approach for Polyphonic Sound Event Detection. CoRR abs/1807.07436 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-06865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-06865
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Li-Rong Dai:
Sequence-to-Sequence Acoustic Modeling for Voice Conversion. CoRR abs/1810.06865 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-08111
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-08111
Jing-Xuan Zhang, Zhen-Hua Ling, Yuan Jiang, Li-Juan Liu, Chen Liang, Li-Rong Dai:
Improving Sequence-to-Sequence Acoustic Modeling by Adding Text-Supervision. CoRR abs/1811.08111 (2018)
2017
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/TuDWBDL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/TuDWBDL17
Yanhui Tu, Jun Du, Qing Wang, Xiao Bao, Li-Rong Dai, Chin-Hui Lee:
An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech. Comput. Speech Lang. 46: 517-534 (2017)
[j26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jzusc/TianCXLDCXCWHHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/TianCXLDCXCWHHH17
Yonghong Tian, Xilin Chen, Hongkai Xiong, Hong-Liang Li, Li-Rong Dai, Jing Chen, Junliang Xing, Jing Chen, Xihong Wu, Weiming Hu, Yu Hu, Tiejun Huang, Wen Gao:
Towards human-like and transhuman perception in AI 2.0: a review. Frontiers Inf. Technol. Electron. Eng. 18(1): 58-67 (2017)
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/ZhangDZLHHWD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/ZhangDZLHHWD17
Jianshu Zhang, Jun Du, Shiliang Zhang, Dan Liu, Yulong Hu, Jin-Shui Hu, Si Wei, Li-Rong Dai:
Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition. Pattern Recognit. 71: 196-206 (2017)
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/GaoDDL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/GaoDDL17
Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A unified DNN approach to speaker-dependent simultaneous speech enhancement and speech separation in low SNR environments. Speech Commun. 95: 28-39 (2017)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangLJWDH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangLJWDH17
Shiliang Zhang, Cong Liu, Hui Jiang, Si Wei, Li-Rong Dai, Yu Hu:
Nonrecurrent Neural Structure for Long-Term Dependence. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 871-884 (2017)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangDDL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangDDL17
Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Gender Mixture Detection Approach to Unsupervised Single-Channel Speech Separation Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 25(7): 1535-1546 (2017)
[c180]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/HouZD017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HouZD017
Junfeng Hou, Shiliang Zhang, Li-Rong Dai, Hui Jiang:
Feedforward sequential memory networks based encoder-decoder model for machine translation. APSIPA 2017: 622-625
[c179]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ChenZHD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChenZHD17
Huang Chen, Shiliang Zhang, Junfeng Hou, Lirong Dai:
Learning the number of nodes in DNNs with activation mask. APSIPA 2017: 1218-1221
[c178]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/AnLD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/AnLD17
Shumin An, Zhenhua Ling, Lirong Dai:
Emotional statistical parametric speech synthesis using LSTM-RNNs. APSIPA 2017: 1613-1616
[c177]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HuLDLD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HuLDLD17
Ya-Jun Hu, Li-Juan Liu, Chuang Ding, Zhen-Hua Ling, Li-Rong Dai:
The USTC system for blizzard machine learning challenge 2017-ES2. ASRU 2017: 650-656
[c176]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/HuDLL017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/HuDLL017
Ya-Jun Hu, Chuang Ding, Li-Juan Liu, Zhen-Hua Ling, Li-Rong Dai:
The USTC System for Blizzard Challenge 2017. Blizzard Challenge 2017
[c175]
- view
  authority control:
- export record
  dblp key:
  - conf/hscma/WangDDL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hscma/WangDDL17
Qing Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Joint noise and mask aware training for DNN-based speech enhancement with SUB-band features. HSCMA 2017: 101-105
[c174]
- view
  authority control:
- export record
  dblp key:
  - conf/hscma/SunDDL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hscma/SunDDL17
Lei Sun, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Multiple-target deep learning for LSTM-RNN based speech enhancement. HSCMA 2017: 136-140
[c173]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuLD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuLD17
Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai:
Extracting structural spectral features using what-where auto-encoders for statistical parametric speech synthesis. ICASSP 2017: 4915-4919
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLMMLD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLMMLD17
Liping Chen, Kong-Aik Lee, Bin Ma, Long Ma, Haizhou Li, Li-Rong Dai:
Adaptation of PLDA for multi-source text-independent speaker verification. ICASSP 2017: 5380-5384
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/icdar/ZhangDD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdar/ZhangDD17
Jianshu Zhang, Jun Du, Lirong Dai:
A GRU-Based Encoder-Decoder Approach with Attention for Online Handwritten Mathematical Expression Recognition. ICDAR 2017: 902-907
[c170]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/BaoGDD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/BaoGDD17
Xiao Bao, Tian Gao, Jun Du, Li-Rong Dai:
An investigation of high-resolution modeling units of deep neural networks for acoustic scene classification. IJCNN 2017: 3028-3035
[c169]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangDDL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangDDL17
Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation. INTERSPEECH 2017: 1178-1182
[c168]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JinSMGD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JinSMGD17
Ma Jin, Yan Song, Ian Vince McLoughlin, Wu Guo, Li-Rong Dai:
End-to-End Language Identification Using High-Order Utterance Representation with Bilinear Pooling. INTERSPEECH 2017: 2571-2575
[c167]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HouZD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HouZD17
Junfeng Hou, Shiliang Zhang, Li-Rong Dai:
Gaussian Prediction Based Attention for Online End-to-End Speech Recognition. INTERSPEECH 2017: 3692-3696
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhangZCDWJ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangZCDWJ17
Junbei Zhang, Xiaodan Zhu, Qian Chen, Li-Rong Dai, Si Wei, Hui Jiang:
Exploring Question Understanding and Adaptation in Neural-Network-Based Question Answering. CoRR abs/1703.04617 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/XuDHDL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XuDHDL17
Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee:
Multi-Objective Learning and Mask-Based Post-Processing for Deep Neural Network Based Speech Enhancement. CoRR abs/1703.07172 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-01889
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-01889
Jianshu Zhang, Yixing Zhu, Jun Du, Li-Rong Dai:
RAN: Radical analysis networks for zero-shot learning of Chinese characters. CoRR abs/1711.01889 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-03991
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-03991
Jianshu Zhang, Jun Du, Li-Rong Dai:
A GRU-based Encoder-Decoder Approach with Attention for Online Handwritten Mathematical Expression Recognition. CoRR abs/1712.03991 (2017)
2016
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/WangLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/WangLD16
Xin Wang, Zhen-Hua Ling, Li-Rong Dai:
Concept-to-Speech generation with knowledge sharing for acoustic modelling and utterance filtering. Comput. Speech Lang. 38: 46-67 (2016)
[j20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/GaoDXLDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/GaoDXLDL16
Tian Gao, Jun Du, Yong Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition. EURASIP J. Adv. Signal Process. 2016: 86 (2016)
[j19]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/ZhangJD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/ZhangJD16
Shiliang Zhang, Hui Jiang, Li-Rong Dai:
Hybrid Orthogonal Projection and Estimation (HOPE): A New Framework to Learn Neural Networks. J. Mach. Learn. Res. 17: 37:1-37:33 (2016)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/YinLQSHLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/YinLQSHLD16
Xiang Yin, Ming Lei, Yao Qian, Frank K. Soong, Lei He, Zhen-Hua Ling, Li-Rong Dai:
Modeling F0 trajectories in hierarchically structured deep neural networks. Speech Commun. 76: 82-92 (2016)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DuTDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DuTDL16
Jun Du, Yanhui Tu, Li-Rong Dai, Chin-Hui Lee:
A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 24(8): 1424-1437 (2016)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/Xue0DL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/Xue0DL16
Shaofei Xue, Hui Jiang, Li-Rong Dai, Qingfeng Liu:
Speaker Adaptation of Hybrid NN/HMM Model for Speech Recognition Based on Singular Value Decomposition. J. Signal Process. Syst. 82(2): 175-185 (2016)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/ChenLMGLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/ChenLMGLD16
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Exploration of Local Variability in Text-Independent Speaker Verification. J. Signal Process. Syst. 82(2): 217-228 (2016)
[c166]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangDD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangDD16
Qing Wang, Jun Du, Li-Rong Dai:
Boosting DNN-based speech enhancement via explicit transformations. APSIPA 2016: 1-4
[c165]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangDDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangDDL16
Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Unsupervised single-channel speech separation via deep neural network for different gender mixtures. APSIPA 2016: 1-4
[c164]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/Chen0ZL016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/Chen0ZL016
Ling-Hui Chen, Yuan Jiang, Ming Zhou, Zhen-Hua Ling, Li-Rong Dai:
The USTC System for Blizzard Challenge 2016. Blizzard Challenge 2016
[c163]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiSMD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiSMD16
Zengxi Li, Yan Song, Ian McLoughlin, Li-Rong Dai:
Compact convolutional neural network transfer learning for small-scale image classification. ICASSP 2016: 2737-2741
[c162]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YinLHD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YinLHD16
Xiang Yin, Zhen-Hua Ling, Ya-Jun Hu, Li-Rong Dai:
Modeling spectral envelopes using deep conditional restricted Boltzmann machines for statistical parametric speech synthesis. ICASSP 2016: 5125-5129
[c161]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangTXD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangTXD16
Zhiying Huang, Jian Tang, Shaofei Xue, Li-Rong Dai:
Speaker adaptation OF RNN-BLSTM for speech recognition based on speaker code. ICASSP 2016: 5305-5309
[c160]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLCMLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLCMLD16
Liping Chen, Kong-Aik Lee, Eng Siong Chng, Bin Ma, Haizhou Li, Li-Rong Dai:
Content-aware local variability vector for speaker verification with short utterance. ICASSP 2016: 5485-5489
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuLD16
Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai:
Deep belief network-based post-filtering for statistical parametric speech synthesis. ICASSP 2016: 5510-5514
[c158]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LingSDH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LingSDH16
Zhen-Hua Ling, Xiao-Hui Sun, Li-Rong Dai, Yu Hu:
Modulation spectrum compensation for HMM-based speech synthesis using line spectral pairs. ICASSP 2016: 5595-5599
[c157]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuLD16
Yu Gu, Zhen-Hua Ling, Li-Rong Dai:
Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks. INTERSPEECH 2016: 297-301
[c156]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuLD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuLD16
Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai:
Articulatory-to-Acoustic Conversion with Cascaded Prediction of Spectral and Excitation Features Using Neural Networks. INTERSPEECH 2016: 1502-1506
[c155]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLLJD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLLJD16
Ling-Hui Chen, Li-Juan Liu, Zhen-Hua Ling, Yuan Jiang, Li-Rong Dai:
The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F₀ Conversion. INTERSPEECH 2016: 1642-1646
[c154]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangTD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangTD16
Jianshu Zhang, Jian Tang, Li-Rong Dai:
RNN-BLSTM Based Multi-Pitch Estimation. INTERSPEECH 2016: 1785-1789
[c153]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangJXWD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangJXWD16
Shiliang Zhang, Hui Jiang, Shifu Xiong, Si Wei, Li-Rong Dai:
Compact Feedforward Sequential Memory Networks for Large Vocabulary Continuous Speech Recognition. INTERSPEECH 2016: 3389-3393
[c152]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangZWD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangZWD16
Jian Tang, Shiliang Zhang, Si Wei, Li-Rong Dai:
Future Context Attention for Unidirectional LSTM Based Acoustic Model. INTERSPEECH 2016: 3394-3398
[c151]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoDDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoDDL16
Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement. INTERSPEECH 2016: 3713-3717
[c150]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/FanDD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/FanDD16
Nana Fan, Jun Du, Li-Rong Dai:
A regression approach to binaural speech segregation via deep neural network. ISCSLP 2016: 1-5
[c149]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HouZD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HouZD16
Junfeng Hou, Shiliang Zhang, Li-Rong Dai:
Learning FOFE based FNN-LMs with noise contrastive estimation and part-of-speech features. ISCSLP 2016: 1-5
[c148]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HuangXYD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HuangXYD16
Zhiying Huang, Shaofei Xue, Zhijie Yan, Li-Rong Dai:
Unsupervised speaker adaptation of BLSTM-RNN for LVCSR based on speaker code. ISCSLP 2016: 1-5
[c147]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiuLWHD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiuLWHD16
Junhua Liu, Zhen-Hua Ling, Si Wei, Guoping Hu, Li-Rong Dai:
Cluster-based senone selection for the efficient calculation of deep neural network acoustic models. ISCSLP 2016: 1-5
[c146]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QianMQD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QianMQD16
Mengjie Qian, Ian McLoughlin, Wu Quo, Li-Rong Dai:
Mismatched training data enhancement for automatic recognition of children's speech using DNN-HMM. ISCSLP 2016: 1-5
[c145]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/TuDDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/TuDDL16
Yanhui Tu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A speaker-dependent deep learning approach to joint speech separation and acoustic modeling for multi-talker automatic speech recognition. ISCSLP 2016: 1-5
[c144]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XueYHD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XueYHD16
Shaofei Xue, Zhijie Yan, Zhiying Huang, Li-Rong Dai:
Rapid speaker adaptation based on D-code extracted from BLSTM-RNN in LVCSR. ISCSLP 2016: 1-5
[c143]
- view
  - electronic edition @ nii.ac.jp (open access)
  - details & citations
- export record
  dblp key:
  - conf/ntcir/ZhangHZD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ntcir/ZhangHZD16
Junbei Zhang, Junfeng Hou, Shiliang Zhang, Li-Rong Dai:
USTC at NTCIR-12 STC Task. NTCIR 2016
[c142]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/SongCMD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/SongCMD16
Yan Song, Ruilian Cui, Ian McLoughlin, Li-Rong Dai:
Improvements on Deep Bottleneck Network based I-Vector Representation for Spoken Language Identification. Odyssey 2016: 140-145
[c141]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/JinSMDY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/JinSMDY16
Ma Jin, Yan Song, Ian McLoughlin, Li-Rong Dai, Zhongfu Ye:
LID-senone Extraction via Deep Neural Networks for End-to-End Language Identification. Odyssey 2016: 210-216
[c140]
- view
  authority control:
- export record
  dblp key:
  - conf/vcip/SongHMD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vcip/SongHMD16
Yan Song, Xinhai Hong, Ian McLoughlin, Li-Rong Dai:
Image classification with CNN-based Fisher vector coding. VCIP 2016: 1-4
2015
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/CaiLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/CaiLD15
Ming-Qi Cai, Zhen-Hua Ling, Li-Rong Dai:
Statistical parametric speech synthesis using a hidden trajectory model. Speech Commun. 72: 149-159 (2015)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ChenLDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ChenLDL15
Liping Chen, Kong-Aik Lee, Li-Rong Dai, Haizhou Li:
Quasi-Factorial Prior for i-vector Extraction. IEEE Signal Process. Lett. 22(12): 2484-2488 (2015)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XuDDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuDDL15
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Regression Approach to Speech Enhancement Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 23(1): 7-19 (2015)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Zhou0DHL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Zhou0DHL15
Pan Zhou, Hui Jiang, Li-Rong Dai, Yu Hu, Qingfeng Liu:
State-Clustering Based Multiple Deep Neural Networks Modeling Approach for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 23(4): 631-642 (2015)
[c139]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZhangJXHD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangJXHD15
Shiliang Zhang, Hui Jiang, Mingbin Xu, Junfeng Hou, Li-Rong Dai:
The Fixed-Size Ordinally-Forgetting Encoding Method for Neural Network Language Models. ACL (2) 2015: 495-500
[c138]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DuWTBDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DuWTBDL15
Jun Du, Qing Wang, Yanhui Tu, Xiao Bao, Li-Rong Dai, Chin-Hui Lee:
An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework. ASRU 2015: 430-435
[c137]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/LiuLD15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/LiuLD15a
Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai:
LIP movement generation using restricted Boltzmann machines for visual speech synthesis. ChinaSIP 2015: 606-610
[c136]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/GaoDXLDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/GaoDXLDL15
Tian Gao, Jun Du, Li Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
A unified speaker-dependent speech separation and enhancement system based on deep neural networks. ChinaSIP 2015: 687-691
[c135]
- view
  authority control:
- export record
  dblp key:
  - conf/ica/GaoDXLDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ica/GaoDXLDL15
Tian Gao, Jun Du, Yong Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments. LVA/ICA 2015: 75-82
[c134]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TuDDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TuDDL15
Yanhui Tu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Speech Separation based on signal-noise-dependent deep neural networks for robust speech recognition. ICASSP 2015: 61-65
[c133]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SongCHMSD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SongCHMSD15
Yan Song, Ruilian Cui, Xinhai Hong, Ian McLoughlin, Jiong Shi, Li-Rong Dai:
Improved language identification using deep bottleneck network. ICASSP 2015: 4200-4204
[c132]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GaoDDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GaoDDL15
Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Joint training of front-end and back-end deep neural networks for robust speech recognition. ICASSP 2015: 4375-4379
[c131]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Xue0DL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Xue0DL15
Shaofei Xue, Hui Jiang, Li-Rong Dai, Qingfeng Liu:
Unsupervised speaker adaptation of deep neural network based on the combination of speaker codes and singular value decomposition for speech recognition. ICASSP 2015: 4555-4559
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuCLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuCLD15
Li-Juan Liu, Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai:
Spectral conversion using deep neural networks trained with multi-source speakers. ICASSP 2015: 4849-4853
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLMGLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLMGLD15
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Channel adaptation of plda for text-independent speaker verification. ICASSP 2015: 5251-5255
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/icdar/DuZHZWD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdar/DuZHZWD15
Jun Du, Jian-Fang Zhai, Jin-Shui Hu, Bo Zhu, Si Wei, Li-Rong Dai:
Writer adaptive feature extraction based on convolutional neural networks for online handwritten Chinese character recognition. ICDAR 2015: 841-845
[c127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLMGLD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLMGLD15
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Phone-centric local variability vector for text-constrained speaker verification. INTERSPEECH 2015: 229-233
[c126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SongHJCMD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SongHJCMD15
Yan Song, Xinhai Hong, Bing Jiang, Ruilian Cui, Ian McLoughlin, Li-Rong Dai:
Deep bottleneck network based i-vector representation for language identification. INTERSPEECH 2015: 398-402
[c125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangDDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangDDL15
Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
High-resolution acoustic modeling and compact language modeling of language-universal speech attributes for spoken language identification. INTERSPEECH 2015: 992-996
[c124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuDHDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuDHDL15
Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee:
Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement. INTERSPEECH 2015: 1508-1512
[c123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLYD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLYD15
Qian Chen, Zhen-Hua Ling, Chen-Yu Yang, Li-Rong Dai:
Automatic phrase boundary labeling of speech synthesis database using context-dependent HMMs and n-gram prior distributions. INTERSPEECH 2015: 1581-1585
[c122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangDBWDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangDBWDL15
Qing Wang, Jun Du, Xiao Bao, Zi-Rui Wang, Li-Rong Dai, Chin-Hui Lee:
A universal VAD based on jointly trained deep neural networks. INTERSPEECH 2015: 2282-2286
[c121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangJWD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangJWD15
Shiliang Zhang, Hui Jiang, Si Wei, Li-Rong Dai:
Rectified linear neural networks with tied-scalar regularization for LVCSR. INTERSPEECH 2015: 2635-2639
[c120]
- view
  authority control:
- export record
  dblp key:
  - conf/mir/SongMD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mir/SongMD15
Yan Song, Ian McLoughlin, Li-Rong Dai:
Deep Bottleneck Feature for Image Classification. ICMR 2015: 491-494
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhangJXHD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangJXHD15
Shiliang Zhang, Hui Jiang, Mingbin Xu, Junfeng Hou, Li-Rong Dai:
A Fixed-Size Encoding Method for Variable-Length Sequences with its Application to Neural Network Language Models. CoRR abs/1505.01504 (2015)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhangJWD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangJWD15
Shiliang Zhang, Hui Jiang, Si Wei, Li-Rong Dai:
Feedforward Sequential Memory Neural Networks without Recurrent Feedback. CoRR abs/1510.02693 (2015)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhangLJWDH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangLJWDH15
Shiliang Zhang, Cong Liu, Hui Jiang, Si Wei, Li-Rong Dai, Yu Hu:
Feedforward Sequential Memory Networks: A New Structure to Learn Long-term Dependency. CoRR abs/1512.08301 (2015)
2014
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/YangLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/YangLD14
Chen-Yu Yang, Zhen-Hua Ling, Li-Rong Dai:
Unsupervised Prosodic Labeling of Speech Synthesis Databases Using Context-Dependent HMMs. IEICE Trans. Inf. Syst. 97-D(6): 1449-1460 (2014)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/XiaLJD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/XiaLJD14
Xian-Jun Xia, Zhen-Hua Ling, Yuan Jiang, Li-Rong Dai:
HMM-based unit selection speech synthesis using log likelihood ratios derived from perceptual data. Speech Commun. 63: 27-37 (2014)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/XuDDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/XuDDL14
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
An Experimental Study on Speech Enhancement Based on Deep Neural Networks. IEEE Signal Process. Lett. 21(1): 65-68 (2014)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XueA0DL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XueA0DL14
Shaofei Xue, Ossama Abdel-Hamid, Hui Jiang, Li-Rong Dai, Qingfeng Liu:
Fast adaptation of deep neural network based on discriminant codes for speech recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(12): 1713-1725 (2014)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenLLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenLLD14
Ling-Hui Chen, Zhen-Hua Ling, Li-Juan Liu, Li-Rong Dai:
Voice conversion using deep neural networks with layer-wise generative training. IEEE ACM Trans. Audio Speech Lang. Process. 22(12): 1859-1872 (2014)
[c119]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/XuDDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/XuDDL14
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Global variance equalization for improving deep neural network based speech enhancement. ChinaSIP 2014: 71-75
[c118]
- view
  authority control:
- export record
  dblp key:
  - conf/icalip/SongG0M14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icalip/SongG0M14
Yan Song, Wu Guo, Li-Rong Dai, Ian Vince McLoughlin:
A spectral based visual matching method for image classification. ICAILP 2014: 666-670
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DuDH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DuDH14
Jun Du, Li-Rong Dai, Qiang Huo:
Synthesized stereo mapping via deep neural networks for noisy speech recognition. ICASSP 2014: 1764-1768
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YinLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YinLD14
Xiang Yin, Zhen-Hua Ling, Li-Rong Dai:
Spectral modeling using neural autoregressive distribution estimators for statistical parametric speech synthesis. ICASSP 2014: 3824-3828
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLMGLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLMGLD14
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Minimum divergence estimation of speaker prior in multi-session PLDA scoring. ICASSP 2014: 4007-4011
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuWGBXD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuWGBXD14
Diyuan Liu, Si Wei, Wu Guo, Yebo Bao, Shifu Xiong, Li-Rong Dai:
Lattice based optimization of bottleneck feature extractor with linear transformation. ICASSP 2014: 5617-5621
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouD014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouD014
Pan Zhou, Li-Rong Dai, Hui Jiang:
Sequence training of multiple deep neural networks for better performance and faster training speed. ICASSP 2014: 5627-5631
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XueA0D14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XueA0D14
Shaofei Xue, Ossama Abdel-Hamid, Hui Jiang, Li-Rong Dai:
Direct adaptation of hybrid DNN/HMM model for fast speaker adaptation in LVCSR based on speaker code. ICASSP 2014: 6339-6343
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangBZ0D14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangBZ0D14
Shiliang Zhang, Yebo Bao, Pan Zhou, Hui Jiang, Li-Rong Dai:
Improving deep neural networks for LVCSR using dropout and shrinking structure. ICASSP 2014: 6849-6853
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuCLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuCLD14
Li-Juan Liu, Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai:
Using bidirectional associative memories for joint spectral envelope modeling in voice conversion. ICASSP 2014: 7884-7888
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/icfhr/DuHZWD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icfhr/DuHZWD14
Jun Du, Jin-Shui Hu, Bo Zhu, Si Wei, Li-Rong Dai:
Writer Adaptation Using Bottleneck Features and Discriminative Linear Regression for Online Handwritten Chinese Character Recognition. ICFHR 2014: 311-316
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/DuHZWD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/DuHZWD14
Jun Du, Jin-Shui Hu, Bo Zhu, Si Wei, Li-Rong Dai:
A Study of Designing Compact Classifiers Using Deep Neural Networks for Online Handwritten Chinese Character Recognition. ICPR 2014: 2950-2955
[c107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuWGXDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuWGXDL14
Jun Du, Qing Wang, Tian Gao, Yong Xu, Li-Rong Dai, Chin-Hui Lee:
Robust speech recognition with speech enhanced deep neural networks. INTERSPEECH 2014: 616-620
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiLD14
Ming-Qi Cai, Zhen-Hua Ling, Li-Rong Dai:
Formant-controlled speech synthesis using hidden trajectory model. INTERSPEECH 2014: 1529-1533
[c105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YinLQSHLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YinLQSHLD14
Xiang Yin, Ming Lei, Yao Qian, Frank K. Soong, Lei He, Zhen-Hua Ling, Li-Rong Dai:
Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree. INTERSPEECH 2014: 2273-2277
[c104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLD14
Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai:
Voice conversion using generative trained deep neural networks with multiple frame spectral envelopes. INTERSPEECH 2014: 2313-2317
[c103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuDDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuDDL14
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Dynamic noise aware training for speech enhancement based on deep neural networks. INTERSPEECH 2014: 2670-2674
[c102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLD14
Xin Wang, Zhen-Hua Ling, Li-Rong Dai:
Concept-to-speech generation by integrating syntagmatic features into HMM-based speech synthesis. INTERSPEECH 2014: 2942-2946
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangSWMD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangSWMD14
Bing Jiang, Yan Song, Si Wei, Ian Vince McLoughlin, Li-Rong Dai:
Task-aware deep bottleneck features for spoken language identification. INTERSPEECH 2014: 3012-3016
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/Xue0D14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Xue0D14
Shaofei Xue, Hui Jiang, Li-Rong Dai:
Speaker adaptation of hybrid NN/HMM model for speech recognition based on singular value decomposition. ISCSLP 2014: 1-5
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChenLMGLD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChenLMGLD14
Liping Chen, Kong-Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai:
Local variability vector for text-independent speaker verification. ISCSLP 2014: 54-58
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/KongXGGD014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/KongXGGD014
Changqing Kong, Shaofei Xue, Jianqing Gao, Wu Guo, Li-Rong Dai, Hui Jiang:
Speaker adaptive bottleneck features extraction for LVCSR based on discriminative learning of speaker codes. ISCSLP 2014: 83-87
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/JiangSWWMD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/JiangSWWMD14
Bing Jiang, Yan Song, Si Wei, Meng-Ge Wang, Ian McLoughlin, Li-Rong Dai:
Performance evaluation of deep bottleneck features for spoken language identification. ISCSLP 2014: 143-147
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangDDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangDDL14
Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A fusion approach to spoken language identification based on combining multiple phone recognizers and speech attribute detectors. ISCSLP 2014: 158-162
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/SunLYD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/SunLYD14
Yu-Sheng Sun, Zhen-Hua Ling, Xiang Yin, Li-Rong Dai:
Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis. ISCSLP 2014: 201-205
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/TuDXDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/TuDXDL14
Yanhui Tu, Jun Du, Yong Xu, Li-Rong Dai, Chin-Hui Lee:
Speech separation based on improved deep neural networks with dual outputs of speech features for both target and interfering speakers. ISCSLP 2014: 250-254
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/GaoLCD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/GaoLCD14
Li Gao, Zhen-Hua Ling, Ling-Hui Chen, Li-Rong Dai:
Improving F0 prediction using bidirectional associative memories and syllable-level F0 features for HMM-based Mandarin speech synthesis. ISCSLP 2014: 275-279
[c92]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XuDDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XuDDL14
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Cross-language transfer learning for deep neural network based speech enhancement. ISCSLP 2014: 336-340
[c91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LeeM0CGD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LeeM0CGD14
Kong Aik Lee, Bin Ma, Haizhou Li, Liping Chen, Wu Guo, Li-Rong Dai:
Local Variability Modeling for Text-Independent Speaker Verification. Odyssey 2014: 54-59
2013
[c90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/ChenL0SXZY013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/ChenL0SXZY013
Ling-Hui Chen, Zhen-Hua Ling, Yuan Jiang, Yang Song, Xian-Jun Xia, Yi-Qing Zu, Run-Qiang Yan, Li-Rong Dai:
The USTC System for Blizzard Challenge 2013. Blizzard Challenge 2013
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouLLD013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouLLD013
Pan Zhou, Cong Liu, Qingfeng Liu, Li-Rong Dai, Hui Jiang:
A cluster-based multiple deep neural networks method for large vocabulary continuous speech recognition. ICASSP 2013: 6650-6654
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangLD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangLD13
Chen-Yu Yang, Zhen-Hua Ling, Li-Rong Dai:
Unsupervised prosodic phrase boundary labeling of Mandarin speech synthesis database using context-dependent HMM. ICASSP 2013: 6875-6879
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Bao0DL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Bao0DL13
Yebo Bao, Hui Jiang, Li-Rong Dai, Cong Liu:
Incoherent training of deep neural networks to de-correlate bottleneck features for speech recognition. ICASSP 2013: 6980-6984
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangSJDM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangSJDM13
Meng-Ge Wang, Yan Song, Bing Jiang, Li-Rong Dai, Ian McLoughlin:
Exemplar based language recognition method for short-duration speech segments. ICASSP 2013: 7354-7358
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenGSD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenGSD13
LianWu Chen, Wu Guo, Yan Song, Li-Rong Dai:
Phoneme variation based synthesized speech discrimination for speaker verification. ICASSP 2013: 7874-7877
[c84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLSD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLSD13
Ling-Hui Chen, Zhen-Hua Ling, Yan Song, Li-Rong Dai:
Joint spectral distribution modeling using restricted boltzmann machines for voice conversion. INTERSPEECH 2013: 3052-3056
2012
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LingD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LingD12
Zhen-Hua Ling, Li-Rong Dai:
Minimum Kullback-Leibler Divergence Parameter Generation for HMM-Based Speech Synthesis. IEEE Trans. Speech Audio Process. 20(5): 1492-1502 (2012)
[c83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/LingXSYC012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/LingXSYC012
Zhen-Hua Ling, Xian-Jun Xia, Yang Song, Chen-Yu Yang, Ling-Hui Chen, Li-Rong Dai:
The USTC System for Blizzard Challenge 2012. Blizzard Challenge 2012
[c82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YinLLD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YinLLD12
Xiang Yin, Zhen-Hua Ling, Ming Lei, Li-Rong Dai:
Considering Global Variance of the Log Power Spectrum Derived from Mel-Cepstrum in HMM-based Parametric Speech Synthesis. INTERSPEECH 2012: 1147-1150
[c81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangSGD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangSGD12
Bing Jiang, Yan Song, Wu Guo, Li-Rong Dai:
Exemplar-Based Sparse Representation for Language Recognition on I-Vectors. INTERSPEECH 2012: 2057-2060
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangLD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangLD12
Xin Wang, Zhen-Hua Ling, Li-Rong Dai:
Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis. ISCSLP 2012: 84-87
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XuGSD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XuGSD12
Yong Xu, Wu Guo, Shan Su, Li-Rong Dai:
Spoken term detection for OOV terms based on triphone confusion matrix. ISCSLP 2012: 98-102
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XiaLYD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XiaLYD12
Xian-Jun Xia, Zhen-Hua Ling, Chen-Yu Yang, Li-Rong Dai:
Improved unit selection speech synthesis method utilizing subjective evaluation results on synthetic speech. ISCSLP 2012: 160-164
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WuSGD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WuSGD12
Kui Wu, Yan Song, Wu Guo, Li-Rong Dai:
Intra-conversation intra-speaker variability compensation for speaker clustering. ISCSLP 2012: 330-334
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XuGD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XuGD12
Yong Xu, Wu Guo, Li-Rong Dai:
A hybrid fragment / syllable-based system for improved OOV term detection. ISCSLP 2012: 378-382
2011
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuHDJ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuHDJ11
Cong Liu, Yu Hu, Li-Rong Dai, Hui Jiang:
Trust Region-Based Optimization for Maximum Mutual Information Estimation of HMMs in Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2474-2485 (2011)
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/acpr/SongTLTD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acpr/SongTLTD11
Yan Song, Jinhui Tang, Xia Li, Qi Tian, Li-Rong Dai:
Effective image representation based on bi-layer visual codebook. ACPR 2011: 224-228
[c74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/ChenYL000W11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/ChenYL000W11
Ling-Hui Chen, Chen-Yu Yang, Zhen-Hua Ling, Yuan Jiang, Li-Rong Dai, Yu Hu, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2011. Blizzard Challenge 2011
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LongYSDG11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LongYSDG11
Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo:
Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model. ICASSP 2011: 4520-4523
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeiLD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeiLD11
Ming Lei, Zhen-Hua Ling, Li-Rong Dai:
Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis. ICASSP 2011: 4712-4715
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLMLGD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLMLGD11
Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai:
Factored covariance modeling for text-independent speaker verification. ICASSP 2011: 4856-4859
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLD11
Ling-Hui Chen, Zhen-Hua Ling, Li-Rong Dai:
Non-parallel training for voice conversion based on FT-GMM. ICASSP 2011: 5116-5119
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuLDW11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuLDW11
Heng Lu, Zhen-Hua Ling, Li-Rong Dai, Ren-Hua Wang:
Building HMM based unit-selection speech synthesis system using synthetic speech naturalness evaluation score. ICASSP 2011: 5352-5355
[c68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LongYSDG11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LongYSDG11
Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo:
Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model. INTERSPEECH 2011: 373-376
[c67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenNZTLD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenNZTLD11
Ling-Hui Chen, Yoshihiko Nankaku, Heiga Zen, Keiichi Tokuda, Zhen-Hua Ling, Li-Rong Dai:
Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis. INTERSPEECH 2011: 1801-1804
[c66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeiYRLKD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeiYRLKD11
Ming Lei, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Li-Rong Dai:
Formant-Controlled HMM-Based Speech Synthesis. INTERSPEECH 2011: 2777-2780
2010
[j3]
- view
  - electronic edition @ aclclp.org.tw (open access)
  - details & citations
- export record
  dblp key:
  - journals/ijclclp/0002LDW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijclclp/0002LDW10
Heng Lu, Zhen-Hua Ling, Li-Rong Dai, Ren-Hua Wang:
Cross-Validation and Minimum Generation Error based Decision Tree Pruning for HMM-based Speech Synthesis. Int. J. Comput. Linguistics Chin. Lang. Process. 15(1) (2010)
[c65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0006LLW000W10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0006LLW000W10
Yuan Jiang, Zhen-Hua Ling, Ming Lei, Cheng-Cheng Wang, Heng Lu, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2010. Blizzard Challenge 2010
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeiLD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeiLD10
Ming Lei, Zhen-Hua Ling, Li-Rong Dai:
Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis. ICASSP 2010: 4230-4233
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoZLD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoZLD10
Wu Guo, Zhao Zhang, Yanhua Long, Li-Rong Dai:
N-gram nearest neighbor algorithm for voice password system. ICASSP 2010: 4438-4441
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DuHDW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DuHDW10
Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
HMM-based pseudo-clean speech synthesis for splice algorithm. ICASSP 2010: 4570-4573
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuHJD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuHJD10
Cong Liu, Yu Hu, Hui Jiang, Li-Rong Dai:
A bounded trust region optimization for discriminative training of HMMS in speech recognition. ICASSP 2010: 4914-4917
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/SongTWLD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/SongTWLD10
Yan Song, Qi Tian, Mengyue Wang, Heng Liu, Li-Rong Dai:
Multiple instance learning using visual phrases for object classification. ICME 2010: 649-654
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuLWDW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuLWDW10
Heng Lu, Zhen-Hua Ling, Si Wei, Li-Rong Dai, Ren-Hua Wang:
Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier. INTERSPEECH 2010: 162-165
[c58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LingHD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LingHD10
Zhen-Hua Ling, Yu Hu, Li-Rong Dai:
Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis. INTERSPEECH 2010: 825-828
[c57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLMLGD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLMLGD10
Eryu Wang, Kong-Aik Lee, Bin Ma, Haizhou Li, Wu Guo, Li-Rong Dai:
The estimation and kernel metric of spectral correlation for text-independent speaker verification. INTERSPEECH 2010: 1065-1068
[c56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LongDMG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LongDMG10
Yanhua Long, Li-Rong Dai, Bin Ma, Wu Guo:
Effects of the phonological relevance in speaker verification. INTERSPEECH 2010: 2130-2133
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeiWSLD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeiWSLD10
Ming Lei, Yi-Jian Wu, Frank K. Soong, Zhen-Hua Ling, Li-Rong Dai:
A hierarchical F0 modeling method for HMM-based speech synthesis. INTERSPEECH 2010: 2170-2173
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhaoLLDL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhaoLLDL10
Tian-Yi Zhao, Zhen-Hua Ling, Ming Lei, Li-Rong Dai, Qingfeng Liu:
Minimum generation error training for HMM-based prediction of articulatory movements. ISCSLP 2010: 99-102
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LingWD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LingWD10
Zhen-Hua Ling, Zhiguo Wang, Li-Rong Dai:
Statistical modeling of syllable-level F0 features for HMM-based unit selection speech synthesis. ISCSLP 2010: 144-147
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XuSLZD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XuSLZD10
Ying Xu, Yan Song, Yanhua Long, Hai-Bing Zhong, Li-Rong Dai:
The description of iFlyTek Speech Lab system for NIST2009 Language Recognition Evaluation. ISCSLP 2010: 157-161
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangGDLML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangGDLML10
Eryu Wang, Wu Guo, Li-Rong Dai, Kong-Aik Lee, Bin Ma, Haizhou Li:
Factor analysis based spatial correlation modeling for speaker verification. ISCSLP 2010: 166-170
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangLWHD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangLWHD10
Zhiguo Wang, Cong Liu, Hai-Kun Wang, Yu Hu, Li-Rong Dai:
Phonetic clustering based confidence measure for embedded speech recognition. ISCSLP 2010: 186-189
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LongDWMG10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LongDWMG10
Yanhua Long, Li-Rong Dai, Eryu Wang, Bin Ma, Wu Guo:
Non-negative matrix factorization based discriminative features for speaker verification. ISCSLP 2010: 291-295
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChenGD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChenGD10
LianWu Chen, Wu Guo, Li-Rong Dai:
Speaker verification against synthetic speech. ISCSLP 2010: 309-312
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChenLGD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChenLGD10
Ling-Hui Chen, Zhen-Hua Ling, Wu Guo, Li-Rong Dai:
GMM-based voice conversion with explicit modelling on feature transform. ISCSLP 2010: 364-368
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/YangL0GD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/YangL0GD10
Chen-Yu Yang, Zhen-Hua Ling, Heng Lu, Wu Guo, Li-Rong Dai:
Automatic phrase boundary labeling for Mandarin TTS corpus using context-dependent HMM. ISCSLP 2010: 374-377

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/cviu/WangHMHQSD09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cviu/WangHMHQSD09
Meng Wang, Xian-Sheng Hua, Tao Mei, Richang Hong, Guo-Jun Qi, Yan Song, Li-Rong Dai:
Semi-supervised kernel density estimation for video annotation. Comput. Vis. Image Underst. 113(3): 384-396 (2009)
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0002LLWZC00W09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0002LLWZC00W09
Heng Lu, Zhen-Hua Ling, Ming Lei, Cheng-Cheng Wang, Huan-huan Zhao, Ling-Hui Chen, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2009. Blizzard Challenge 2009
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuWTDW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuWTDW09
Heng Lu, Yi-Jian Wu, Keiichi Tokuda, Li-Rong Dai, Ren-Hua Wang:
Full covariance state duration modeling for HMM-based speech synthesis. ICASSP 2009: 4033-4036
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMLSZSYTKHPGLDNTEASSJ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMLSZSYTKHPGLDNTEASSJ09
Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai You, Rong Tong, Ismo Kärkkäinen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li, Li-Rong Dai, Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah, Chng Eng Siong, Tanja Schultz, Qin Jin:
The I4U system in NIST 2008 speaker recognition evaluation. ICASSP 2009: 4201-4204
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoLLPWD09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoLLPWD09
Wu Guo, Yanhua Long, Yijie Li, Lei Pan, Eryu Wang, Li-Rong Dai:
iFLY system for the NIST 2008 speaker recognition evaluation. ICASSP 2009: 4209-4212
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LongMLGSD09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LongMLGSD09
Yanhua Long, Bin Ma, Haizhou Li, Wu Guo, Chng Eng Siong, Li-Rong Dai:
Exploiting prosodic information for Speaker Recognition. ICASSP 2009: 4225-4228
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/SongDW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/SongDW09
Yan Song, Li-Rong Dai, Ren-Hua Wang:
An automatic language identification method based on subspace analysis. ICME 2009: 598-601
[c39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLD09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLD09
Cheng-Cheng Wang, Zhen-Hua Ling, Li-Rong Dai:
Asynchronous F0 and spectrum modeling for HMM-based speech synthesis. INTERSPEECH 2009: 404-407
2008
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/Ling0H0W08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/Ling0H0W08
Zhen-Hua Ling, Heng Lu, Guoping Hu, Li-Rong Dai, Ren-Hua Wang:
The USTC System for Blizzard Challenge 2008. Blizzard Challenge 2008
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QinWLWD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QinWLWD08
Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang, Li-Rong Dai:
Minumum generation error linear regression based model adaptation for HMM-based speech synthesis. ICASSP 2008: 3953-3956
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QinWLWD08a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QinWLWD08a
Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang, Li-Rong Dai:
Minimum generation error criterion considering global/local variance for HMM-based speech synthesis. ICASSP 2008: 4621-4624
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhuYHWDW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhuYHWDW08
Bo Zhu, Zhi-Jie Yan, Yu Hu, Zhiguo Wang, Li-Rong Dai, Ren-Hua Wang:
Investigation on Adaptation Using Different Discriminative Training Criteria Based Linear Regression and Map. ISCSLP 2008: 93-96
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangLZD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangLZD08
Cheng-Cheng Wang, Zhen-Hua Ling, Bu-Fan Zhang, Li-Rong Dai:
Multi-Layer F0 Modeling for HMM-Based Speech Synthesis. ISCSLP 2008: 129-132
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/0002LWHDW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/0002LWHDW08
Heng Lu, Zhen-Hua Ling, Si Wei, Yu Hu, Li-Rong Dai, Ren-Hua Wang:
Heteronym Verification for Mandarin Speech Synthesis. ISCSLP 2008: 137-140
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/GuoDW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/GuoDW08
Wu Guo, Li-Rong Dai, Ren-Hua Wang:
Double Gauss Based Unsupervised Score Normalization in Speaker Verification. ISCSLP 2008: 165-168
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiuHLWDW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiuHLWDW08
Cong Liu, Yu Hu, Xiong-Guo Lei, Zhiguo Wang, Li-Rong Dai, Ren-Hua Wang:
Exploiting Non-Target Region Information for Confidence Measure Based on Bayesian Information Criterion. ISCSLP 2008: 229-232
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LongGD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LongGD08
Yanhua Long, Wu Guo, Li-Rong Dai:
Interfusing the Confused Region Score of Speaker Verification Systems. ISCSLP 2008: 314-317
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangGD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangGD08
Eryu Wang, Wu Guo, Li-Rong Dai:
Parallel Phone Recognizer based MLLR Speaker Recognition. ISCSLP 2008: 318-321
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/SongD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/SongD08
Yan Song, Li-Rong Dai:
A Sample and Feature Selection Scheme for GMM-SVM Based Language Recognition. ISCSLP 2008: 326-329
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/BingYD08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/BingYD08
Xu Bing, Yan Song, Li-Rong Dai:
The Adaptation Schemes In PR-SVM Based Language Recognition. ISCSLP 2008: 334-337
2007
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijsc/WangHMTQSD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijsc/WangHMTQSD07
Meng Wang, Xian-Sheng Hua, Tao Mei, Jinhui Tang, Guo-Jun Qi, Yan Song, Li-Rong Dai:
Interactive Video Annotation by Multi-Concept Multi-Modality Active Learning. Int. J. Semantic Comput. 1(4): 459-477 (2007)
[c26]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/blizzard/LingQ0G0W0ZYCH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/LingQ0G0W0ZYCH07
Zhen-Hua Ling, Long Qin, Heng Lu, Yu Gao, Li-Rong Dai, Ren-Hua Wang, Yuan Jiang, Zhi-Wei Zhao, Jin-Hui Yang, Jie Chen, Guo-Ping Hu:
The USTC and iflytek speech synthesis systems for Blizzard Challenge 2007. Blizzard Challenge 2007
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/fskd/GuoLWD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fskd/GuoLWD07
Wu Guo, Lei Pan, Ren-Hua Wang, Li-Rong Dai:
Angle of Models Distance as Test Algorithm in Speaker Verification. FSKD (4) 2007: 231-234
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangHSDW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangHSDW07
Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, Ren-Hua Wang:
An Interactive Video Annotation Frameowrk with Multiple Modalities. ICASSP (1) 2007: 957-960
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangHSHD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangHSHD07
Meng Wang, Xian-Sheng Hua, Yan Song, Richang Hong, Li-Rong Dai:
Lazy Learning Based Efficient Video Annotation. ICME 2007: 607-610
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangHYSD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangHYSD07
Meng Wang, Xian-Sheng Hua, Xun Yuan, Yan Song, Li-Rong Dai:
Multi-Graph Semi-Supervised Learning for Video Semantic Feature Extraction. ICME 2007: 1978-1981
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangMYSD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangMYSD07
Meng Wang, Tao Mei, Xun Yuan, Yan Song, Li-Rong Dai:
Video annotation by graph-based learning with neighborhood similarity. ACM Multimedia 2007: 325-328
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangHYSD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangHYSD07
Meng Wang, Xian-Sheng Hua, Xun Yuan, Yan Song, Li-Rong Dai:
Optimizing multi-graph learning: towards a unified video annotation scheme. ACM Multimedia 2007: 862-871
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/mmm/WangHSLDW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmm/WangHSLDW07
Meng Wang, Xian-Sheng Hua, Yan Song, Wei Lai, Li-Rong Dai, Ren-Hua Wang:
An Efficient Automatic Video Shot Size Annotation Scheme. MMM (1) 2007: 649-658
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/semco/WangHSTD07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/semco/WangHSTD07
Meng Wang, Xian-Sheng Hua, Yan Song, Jinhui Tang, Li-Rong Dai:
RMulti-Concept Multi-Modality Active Learning for Interactive Video Annotation. ICSC 2007: 321-328
2006
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/QiSHZD06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/QiSHZD06
Guo-Jun Qi, Yan Song, Xian-Sheng Hua, Hong-Jiang Zhang, Li-Rong Dai:
Video Annotation by Active Learning and Cluster Tuning. CVPR Workshops 2006: 114
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SongHDWW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SongHDWW06
Yan Song, Xian-Sheng Hua, Li-Rong Dai, Meng Wang, Ren-Hua Wang:
An Automatic Video Semantic Annotation Scheme Based on Combination of Complementary Predictors. ICASSP (5) 2006: 501-504
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/WangHSDZ06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/WangHSDZ06
Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, HongJiang Zhang:
Semi-Supervised Kernel Regression. ICDM 2006: 1130-1135
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/SongQHDW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/SongQHDW06
Yan Song, Guo-Jun Qi, Xian-Sheng Hua, Li-Rong Dai, Ren-Hua Wang:
Video Annotation by Active Learning and Semi-Supervised Ensembling. ICME 2006: 933-936
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangHDS06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangHDS06
Meng Wang, Xian-Sheng Hua, Li-Rong Dai, Yan Song:
Enhanced Semi-Supervised Learning for Automatic Video Annotation. ICME 2006: 1485-1488
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/iscas/WangHSDL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscas/WangHSDL06
Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, Shipeng Li:
Automatic video annotation based on co-adaptation and label correction. ISCAS 2006
[c11]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/GuoW006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/GuoW006
Wu Guo, Renhua Wang, Lirong Dai:
Feature Extraction and Test Algorithm for Speaker Verification. ISCSLP 2006
[c10]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iscslp/Zhang00W06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Zhang00W06
Feng Zhang, Yan Song, Lirong Dai, Ren-Hua Wang:
Two-layer Distance Scheme in Matching Engine for Query by Humming System. ISCSLP 2006
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/mir/SongHQDWZ06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mir/SongHQDWZ06
Yan Song, Xian-Sheng Hua, Guo-Jun Qi, Li-Rong Dai, Meng Wang, HongJiang Zhang:
Efficient semantic annotation method for indexing large personal video database. Multimedia Information Retrieval 2006: 289-296
2005
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QinCLD05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QinCLD05
Long Qin, Gao Peng Chen, Zhen-Hua Ling, Li-Rong Dai:
An Improved Spectral and Prosodic Transformation Method in STRAIGHT-based Voice Conversion. ICASSP (1) 2005: 21-24
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiHWD05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiHWD05
Jianfeng Li, Guoping Hu, Ren-Hua Wang, Li-Rong Dai:
Sliding Window Smoothing For Maximum Entropy Based Intonational Phrase Prediction In Chinese. ICASSP (1) 2005: 285-288
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/mir/SongHDW05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mir/SongHDW05
Yan Song, Xian-Sheng Hua, Li-Rong Dai, Meng Wang:
Semi-automatic video annotation based on active learning with multiple complementary predictors. Multimedia Information Retrieval 2005: 97-104
2004
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiLWD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiLWD04
Jin-Yu Li, Bo Liu, Ren-Hua Wang, Li-Rong Dai:
A complexity reduction of ETSI advanced front-end for DSR. ICASSP (1) 2004: 61-64
[c4]
- no documents available
  - details & citations
- export record
  dblp key:
  - conf/icip/LaiGWDZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/LaiGWDZ04
Wei Lai, Xiaodong Gu, Ren-Hua Wang, Li-Rong Dai, HongJiang Zhang:
A region based multiple frame-rate tradeoff of video streaming. ICIP 2004: 2067-2070
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiDW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiDW04
Xiao-Bing Li, Li-Rong Dai, Ren-Hua Wang:
MCE-based training of subspace distribution clustering HMM. ISCSLP 2004: 113-116
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiuDLW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiuDLW04
Bo Liu, Li-Rong Dai, Jin-Yu Li, Ren-Hua Wang:
Double Gaussian based feature normalization for robust speech recognition. ISCSLP 2004: 253-256
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/pcm/LaiGWDZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pcm/LaiGWDZ04
Wei Lai, Xiaodong Gu, Ren-Hua Wang, Li-Rong Dai, HongJiang Zhang:
Perceptual Video Streaming by Adaptive Spatial-temporal Scalability. PCM (2) 2004: 431-438

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.