default search action

combined dblp search
author search
venue search
publication search

ask others

Xulong Zhang 0001

张旭龙

> Home > Persons

Person information

unicode name: 张旭龙
affiliation: Lab of Large Audio Model (LLAM), Shanghai, China
affiliation: Ping An Technology, Shenzhen, China
affiliation (PhD 2021): Fudan University, Shanghai, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2023
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tomccap/Duan0ZTLO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tomccap/Duan0ZTLO23
Wei Duan, Yi Yu, Xulong Zhang, Suhua Tang, Wei Li, Keizo Oyama:
Melody Generation from Lyrics with Local Interpretability. ACM Trans. Multim. Comput. Commun. Appl. 19(3): 124:1-124:21 (2023)
2017
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jsjkx/LiFWZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jsjkx/LiFWZ18
Wei Li, Xiangyi Feng, Yiming Wu, Xulong Zhang:
流行音乐主旋律提取技术综述 (Review on Main Melody Extraction from Pop Music). 计算机科学 44(5): 1-5 (2017)
2013
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijoe/WuZZZ13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijoe/WuZZZ13
Qingtao Wu, Xulong Zhang, Ruijuan Zheng, Mingchuan Zhang:
Probability-Symmetric Storage Allocation for Distributed Storage Systems Based on Network Coding. Int. J. Online Biomed. Eng. 9(S4): 64-68 (2013)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/apweb/ShiWZCYX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apweb/ShiWZCYX24
Haoxiang Shi, Jianzong Wang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao:
RSET: Remapping-Based Sorting Method for Emotion Transfer Speech Synthesis. APWeb/WAIM (1) 2024: 90-104
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/cscwd/WangL00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cscwd/WangL00024
Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
Medical Speech Symptoms Classification via Disentangled Representation. CSCWD 2024: 1110-1115
[c56]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/Li00W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/Li00W24
Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang:
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding. EMNLP 2024: 4500-4511
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DengT000W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DengT000W24
Yimin Deng, Huaizhen Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval. ICASSP 2024: 7150-7154
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Zhang0000W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Zhang0000W24
Bingyuan Zhang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wang:
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model. ICASSP 2024: 8276-8280
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Tang000W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Tang000W24
Haobin Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
ED-TTS: Multi-Scale Emotion Modeling Using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. ICASSP 2024: 12146-12150
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icic/WangSLZCX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icic/WangSLZCX24
Jianzong Wang, Haoxiang Shi, Kaiyi Luo, Xulong Zhang, Ning Cheng, Jing Xiao:
RREH: Reconstruction Relations Embedded Hashing for Semi-paired Cross-Modal Retrieval. ICIC (LNAI 5) 2024: 374-385
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icic/ShiZCZYXW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icic/ShiZCZYXW24
Haoxiang Shi, Xulong Zhang, Ning Cheng, Yong Zhang, Jun Yu, Jing Xiao, Jianzong Wang:
Enhancing Emotion Recognition in Conversation Through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning. ICIC (LNAI 3) 2024: 391-401
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/DengWZCX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/DengWZCX24
Yimin Deng, Jianzong Wang, Xulong Zhang, Ning Cheng, Jing Xiao:
Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation. IJCNN 2024: 1-7
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/LiWZZXC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/LiWZZXC24
Pengcheng Li, Jianzong Wang, Xulong Zhang, Yong Zhang, Jing Xiao, Ning Cheng:
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion. IJCNN 2024: 1-7
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/LiangWZZCX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/LiangWZZCX24
Ziqi Liang, Jianzong Wang, Xulong Zhang, Yong Zhang, Ning Cheng, Jing Xiao:
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning. IJCNN 2024: 1-7
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/OuyangWZLLZCX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/OuyangWZLLZCX24
Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering. IJCNN 2024: 1-7
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/WangLZCX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/WangLZCX24
Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
ConTuner: Singing Voice Beautifying with Pitch and Expressiveness Condition. IJCNN 2024: 1-6
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/WangLZCX24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/WangLZCX24a
Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization. IJCNN 2024: 1-7
2023
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/adma/ZhangWCX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/adma/ZhangWCX23
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Voice Conversion with Denoising Diffusion Probabilistic GAN Models. ADMA (4) 2023: 154-167
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/adma/ZhuZWCX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/adma/ZhuZWCX23
Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Symbolic and Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music. ADMA (4) 2023: 168-181
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/adma/ZhangWCSZX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/adma/ZhangWCSZX23
Xulong Zhang, Jianzong Wang, Ning Cheng, Yifu Sun, Chuanyao Zhang, Jing Xiao:
Machine Unlearning Methodology Based on Stochastic Teacher Network. ADMA (5) 2023: 250-261
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/apweb/LiuGJTZLZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apweb/LiuGJTZLZL23
Wenting Liu, Zhaozhong Gui, Guilin Jiang, Lihua Tang, Lichun Zhou, Wan Leng, Xulong Zhang, Yujiang Liu:
Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data. APWeb-WAIM (2) 2023: 74-88
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/apweb/YeZSXHLZHY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apweb/YeZSXHLZHY23
Yu Ye, Gongjin Zhang, Hongbiao Si, Liang Xu, Shenghua Hu, Yong Li, Xulong Zhang, Kaiyu Hu, Fangzhou Ye:
A Hierarchy-Based Analysis Approach for Blended Learning: A Case Study with Chinese Students. APWeb-WAIM (2) 2023: 89-102
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/apweb/ZhouYLGZSTSZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apweb/ZhouYLGZSTSZC23
Shanyi Zhou, Ning Yan, Zhijun Li, Mo Geng, Xulong Zhang, Hongbiao Si, Lihua Tang, Wenyuan Sun, Longda Zhang, Yi Cao:
Research on the Impact of Executive Shareholding on New Investment in Enterprises Based on Multivariable Linear Regression Model. APWeb-WAIM (2) 2023: 147-161
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/apweb/GuoSJZLZZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apweb/GuoSJZLZZL23
Hao Guo, Hongbiao Si, Guilin Jiang, Wei Zhang, Zhiyan Liu, Xuanyi Zhu, Xulong Zhang, Yang Liu:
An Empirical Study of Attention Networks for Semantic Segmentation. APWeb-WAIM (1) 2023: 222-235
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/bdcloud/WangDL00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bdcloud/WangDL00023
Jianzong Wang, Yimin Deng, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding. ISPA/BDCloud/SocialCom/SustainCom 2023: 752-757
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/bdcloud/WangL00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bdcloud/WangL00023
Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. ISPA/BDCloud/SocialCom/SustainCom 2023: 923-928
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/bdcloud/Deng0W0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bdcloud/Deng0W0023
Yimin Deng, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation. ISPA/BDCloud/SocialCom/SustainCom 2023: 1143-1148
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RuZWCX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RuZWCX23
Ganghui Ru, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Music Genre Classification from multi-modal Properties of Music and Genre Correlations Perspective. ICASSP 2023: 1-5
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TangZWCX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TangZWCX23
Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Learning Speech Representations with Flexible Hidden Feature Dimensions. ICASSP 2023: 1-5
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TangZWCX23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TangZWCX23a
Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
VQ-CL: Learning Disentangled Speech Representations with Contrastive Learning and Vector Quantization. ICASSP 2023: 1-5
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TangZWCX23b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TangZWCX23b
Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. ICASSP 2023: 1-5
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangTWCLX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangTWCLX23
Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
Dynamic Alignment Mask CTC: Improved Mask CTC With Aligned Cross Entropy. ICASSP 2023: 1-5
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuZWCX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuZWCX23
Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving EEG-based Emotion Recognition by Fusing Time-Frequency and Spatial Representations. ICASSP 2023: 1-5
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/ictai/Si0YW0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ictai/Si0YW0023
Yazhong Si, Xulong Zhang, Fan Yang, Jianzong Wang, Ning Cheng, Jing Xiao:
AOSR-Net: All-in-One Sandstorm Removal Network. ICTAI 2023: 641-645
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/ictai/Wang0S0X23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ictai/Wang0S0X23
Jianzong Wang, Xulong Zhang, Aolan Sun, Ning Cheng, Jing Xiao:
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework. ICTAI 2023: 905-912
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/ictai/Luo0WL0X23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ictai/Luo0WL0X23
Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, Jing Xiao:
Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval. ICTAI 2023: 913-917
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/WangZTSCX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/WangZTSCX23
Jianzong Wang, Xulong Zhang, Haobin Tang, Aolan Sun, Ning Cheng, Jing Xiao:
SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model. IJCNN 2023: 1-7
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tang0W0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tang0W0023
Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis. INTERSPEECH 2023: 12-16
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sun0W0H023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sun0W0H023
Yifu Sun, Xulong Zhang, Jianzong Wang, Ning Cheng, Kaiyu Hu, Jing Xiao:
Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning. INTERSPEECH 2023: 5456-5460
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DengT0W0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DengT0W0023
Yimin Deng, Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. ACM Multimedia 2023: 184-192
2022
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/apweb/ZhangWCXX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apweb/ZhangWCXX22
Xulong Zhang, Jianzong Wang, Ning Cheng, Edward Xiao, Jing Xiao:
Shallow Diffusion Motion Model for Talking Face Generation from Speech. APWeb/WAIM (2) 2022: 144-157
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangZWCX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangZWCX22
Qiqi Wang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning. ICASSP 2022: 3184-3188
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoZWCX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoZWCX22
Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-Shot Multi-speaker text-to-speech. ICASSP 2022: 4293-4297
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TangZWCX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TangZWCX22
Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Avqvc: One-Shot Voice Conversion By Vector Quantization With Applying Contrastive Learning. ICASSP 2022: 4613-4617
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/SiW0QC022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/SiW0QC022
Shijing Si, Jianzong Wang, Xulong Zhang, Xiaoyang Qu, Ning Cheng, Jing Xiao:
Boosting StarGANs for Voice Conversion with Contrastive Discriminator. ICONIP (2) 2022: 355-366
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/ictai/SunZLWCX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ictai/SunZLWCX22
Aolan Sun, Xulong Zhang, Tiandong Ling, Jianzong Wang, Ning Cheng, Jing Xiao:
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar. ICTAI 2022: 1002-1006
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ZhangWCX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ZhangWCX22
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
SUSing: SU-net for Singing Voice Synthesis. IJCNN 2022: 1-7
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ZhangWCX22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ZhangWCX22a
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MDCNN-SID: Multi-scale Dilated Convolution Network for Singer Identification. IJCNN 2022: 1-7
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ZhangWCX22b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ZhangWCX22b
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS. IJCNN 2022: 1-7
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ZhangWCX22c
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ZhangWCX22c
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features. IJCNN 2022: 1-7
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ZhangWCX22d
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ZhangWCX22d
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSID: Singer Identification with Domain Adaptation for Metaverse. IJCNN 2022: 1-7
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuoWCX0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuoWCX0022
Jian Luo, Jianzong Wang, Ning Cheng, Edward Xiao, Xulong Zhang, Jing Xiao:
Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation. INTERSPEECH 2022: 5313-5317
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/msn/ZhangWCX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/msn/ZhangWCX22
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data. MSN 2022: 456-460
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/msn/ZhangWCZX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/msn/ZhangWCZX22
Xulong Zhang, Jianzong Wang, Ning Cheng, Kexin Zhu, Jing Xiao:
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach. MSN 2022: 485-489
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/msn/ZhangWCX22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/msn/ZhangWCX22a
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse. MSN 2022: 841-846
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/msn/ZhangWCZZX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/msn/ZhangWCZZX22
Xulong Zhang, Jianzong Wang, Ning Cheng, Mengyuan Zhao, Zhiyong Zhang, Jing Xiao:
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition. MSN 2022: 915-920
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/msn/ZhangWCX22b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/msn/ZhangWCX22b
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Semi-Supervised Learning Based on Reference Model for Low-resource TTS. MSN 2022: 966-971
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/msn/ZhangWCX22c
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/msn/ZhangWCX22c
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Imbalanced Text Classification with Dynamic Curriculum Learning. MSN 2022: 1031-1036
2021
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhangWCXX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhangWCXX21
Xulong Zhang, Jianzong Wang, Ning Cheng, Edward Xiao, Jing Xiao:
Cyclegean: Cycle Generative Enhanced Adversarial Network for Voice Conversion. ASRU 2021: 930-937
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/TangZWCZXX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/TangZWCZXX21
Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Zhen Zeng, Edward Xiao, Jing Xiao:
TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training. ASRU 2021: 938-945
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangQYSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangQYSL21
Xulong Zhang, Jiale Qian, Yi Yu, Yifu Sun, Wei Li:
Singer Identification Using Deep Timbre Feature Learning with KNN-NET. ICASSP 2021: 3380-3384

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08049
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08049
Bingyuan Zhang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wang:
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model. CoRR abs/2401.08049 (2024)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08096
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08096
Yimin Deng, Huaizhen Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval. CoRR abs/2401.08096 (2024)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08166
Haobin Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. CoRR abs/2401.08166 (2024)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-05000
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-05000
Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
Medical Speech Symptoms Classification via Disentangled Representation. CoRR abs/2403.05000 (2024)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-19187
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-19187
Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition. CoRR abs/2404.19187 (2024)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-19212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-19212
Ziqi Liang, Jianzong Wang, Xulong Zhang, Yong Zhang, Ning Cheng, Jing Xiao:
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning. CoRR abs/2404.19212 (2024)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-19214
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-19214
Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization. CoRR abs/2404.19214 (2024)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-19316
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-19316
Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering. CoRR abs/2404.19316 (2024)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-00603
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-00603
Yimin Deng, Jianzong Wang, Xulong Zhang, Ning Cheng, Jing Xiao:
Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation. CoRR abs/2405.00603 (2024)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-00930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-00930
Pengcheng Li, Jianzong Wang, Xulong Zhang, Yong Zhang, Jing Xiao, Ning Cheng:
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion. CoRR abs/2405.00930 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17028
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17028
Haoxiang Shi, Jianzong Wang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao:
RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis. CoRR abs/2405.17028 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17777
Jianzong Wang, Haoxiang Shi, Kaiyi Luo, Xulong Zhang, Ning Cheng, Jing Xiao:
RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval. CoRR abs/2405.17777 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17900
Haoxiang Shi, Xulong Zhang, Ning Cheng, Yong Zhang, Jun Yu, Jing Xiao, Jianzong Wang:
Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning. CoRR abs/2405.17900 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19627
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19627
Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang:
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding. CoRR abs/2409.19627 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-21897
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-21897
Yifu Sun, Xulong Zhang, Monan Zhou, Wei Li:
Semi-Supervised Self-Learning Enhanced Music Emotion Recognition. CoRR abs/2410.21897 (2024)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-13089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-13089
Xulong Zhang, Xiaoyang Qu, Haoxiang Shi, Chunguang Xiao, Jianzong Wang:
ESARM: 3D Emotional Speech-to-Animation via Reward Model from Automatically-Ranked Demonstrations. CoRR abs/2411.13089 (2024)
2023
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-07667
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-07667
Ganghui Ru, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Music Genre Classification from multi-modal properties of music and genre correlations Perspective. CoRR abs/2303.07667 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-07682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-07682
Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. CoRR abs/2303.07682 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-07687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-07687
Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy. CoRR abs/2303.07687 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-11421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-11421
Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving EEG-based Emotion Recognition by Fusing Time-frequency And Spatial Representations. CoRR abs/2303.11421 (2023)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-11547
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-11547
Jianzong Wang, Xulong Zhang, Haobin Tang, Aolan Sun, Ning Cheng, Jing Xiao:
SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model. CoRR abs/2304.11547 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00648
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00648
Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis. CoRR abs/2306.00648 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11084
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11084
Yimin Deng, Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. CoRR abs/2308.11084 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-12792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-12792
Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Erik Cambria, Björn W. Schuller:
Sparks of Large Audio Models: A Survey and Outlook. CoRR abs/2308.12792 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-14317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-14317
Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Symbolic & Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music. CoRR abs/2308.14317 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-14319
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-14319
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Voice Conversion with Denoising Diffusion Probabilistic GAN Models. CoRR abs/2308.14319 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-14322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-14322
Xulong Zhang, Jianzong Wang, Ning Cheng, Yifu Sun, Chuanyao Zhang, Jing Xiao:
Machine Unlearning Methodology base on Stochastic Teacher Network. CoRR abs/2308.14322 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07509
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07509
Zipeng Qi, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks. CoRR abs/2309.07509 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08837
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08837
Jianzong Wang, Xulong Zhang, Aolan Sun, Ning Cheng, Jing Xiao:
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework. CoRR abs/2309.08837 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08838
Yazhong Si, Xulong Zhang, Fan Yang, Jianzong Wang, Ning Cheng, Jing Xiao:
AOSR-Net: All-in-One Sandstorm Removal Network. CoRR abs/2309.08838 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08839
Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, Jing Xiao:
Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval. CoRR abs/2309.08839 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10217
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10217
Hao Guo, Hongbiao Si, Guilin Jiang, Wei Zhang, Zhiyan Liu, Xuanyi Zhu, Xulong Zhang, Yang Liu:
An Empirical Study of Attention Networks for Semantic Segmentation. CoRR abs/2309.10217 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10218
Yu Ye, Gongjin Zhang, Hongbiao Si, Liang Xu, Shenghua Hu, Yong Li, Xulong Zhang, Kaiyu Hu, Fangzhou Ye:
A Hierarchy-based Analysis Approach for Blended Learning: A Case Study with Chinese Students. CoRR abs/2309.10218 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10986
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10986
Shanyi Zhou, Ning Yan, Zhijun Li, Mo Geng, Xulong Zhang, Hongbiao Si, Lihua Tang, Wenyuan Sun, Longda Zhang, Yi Cao:
Research on the Impact of Executive Shareholding on New Investment in Enterprises Based on Multivariable Linear Regression Model. CoRR abs/2309.10986 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-07965
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-07965
Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. CoRR abs/2311.07965 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-08670
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-08670
Yimin Deng, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation. CoRR abs/2311.08670 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-08673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-08673
Jianzong Wang, Yimin Deng, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding. CoRR abs/2311.08673 (2023)
2022
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-10020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-10020
Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
AVQVC: One-shot Voice Conversion by Vector Quantization with applying contrastive learning. CoRR abs/2202.10020 (2022)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-10712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-10712
Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech. CoRR abs/2202.10712 (2022)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-10976
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-10976
Qiqi Wang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning. CoRR abs/2202.10976 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11817
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features. CoRR abs/2205.11817 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11821
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11821
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSID: Singer Identification with Domain Adaptation for Metaverse. CoRR abs/2205.11821 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11824
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11824
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS. CoRR abs/2205.11824 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11841
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11841
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
SUSing: SU-net for Singing Voice Synthesis. CoRR abs/2205.11841 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-13689
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-13689
Jian Luo, Jianzong Wang, Ning Cheng, Edward Xiao, Xulong Zhang, Jing Xiao:
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech Separation. CoRR abs/2206.13689 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-04035
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-04035
Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Zhen Zeng, Edward Xiao, Jing Xiao:
TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training. CoRR abs/2208.04035 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-10088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-10088
Shijing Si, Jianzong Wang, Xulong Zhang, Xiaoyang Qu, Ning Cheng, Jing Xiao:
Boosting Star-GANs for Voice Conversion with Contrastive Discriminator. CoRR abs/2209.10088 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-06877
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-06877
Aolan Sun, Xulong Zhang, Tiandong Ling, Jianzong Wang, Ning Cheng, Jing Xiao:
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar. CoRR abs/2210.06877 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13803
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data. CoRR abs/2210.13803 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13805
Xulong Zhang, Jianzong Wang, Ning Cheng, Kexin Zhu, Jing Xiao:
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach. CoRR abs/2210.13805 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13811
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse. CoRR abs/2210.13811 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14723
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14723
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Semi-Supervised Learning Based on Reference Model for Low-resource TTS. CoRR abs/2210.14723 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14724
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14724
Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Imbalanced Text Classification with Dynamic Curriculum Learning. CoRR abs/2210.14724 (2022)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14725
Xulong Zhang, Jianzong Wang, Ning Cheng, Mengyuan Zhao, Zhiyong Zhang, Jing Xiao:
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition. CoRR abs/2210.14725 (2022)
2021
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-10236
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-10236
Xulong Zhang, Jiale Qian, Yi Yu, Yifu Sun, Wei Li:
Singer Identification Using Deep Timbre Feature Learning with KNN-Net. CoRR abs/2102.10236 (2021)
2020
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-04040
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-04040
Xulong Zhang, Yi Yu, Xi Chen, Wei Li:
Comparison for Improvements of Singing Voice Detection System Based on Vocal Separation. CoRR abs/2004.04040 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-04371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-04371
Xulong Zhang, Yongwei Gao, Yi Yu, Wei Li:
Music Artist Classification with WaveNet Classifier for Raw Waveform Audio Data. CoRR abs/2004.04371 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.