default search action
Xulong Zhang 0001
Person information
- unicode name: 张旭龙
- affiliation: Lab of Large Audio Model (LLAM), Shanghai, China
- affiliation: Ping An Technology, Shenzhen, China
- affiliation (PhD 2021): Fudan University, Shanghai, China
Other persons with the same name
- Xulong Zhang 0002 — Shanxi Datong University, School of Computer and Network Engineering, Datong, China (and 1 more)
- Xulong Zhang 0003 — Beihang University, Robotic Research Institute, Beijing, China
- Xulong Zhang 0004 — Harbin Institute of Technology, Harbin, China
- Xulong Zhang 0005 — Wuhan University of Technology, Wuhan, Hubei, China
- Xulong Zhang 0006 — Lamar University, Beaumont, TX, USA
- Xulong Zhang 0007 — Nanjing University, Nanjing, China
- Xulong Zhang 0008 — Central South University, School of Automation, Changsha, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c58]Haoxiang Shi, Jianzong Wang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao:
RSET: Remapping-Based Sorting Method for Emotion Transfer Speech Synthesis. APWeb/WAIM (1) 2024: 90-104 - [c57]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
Medical Speech Symptoms Classification via Disentangled Representation. CSCWD 2024: 1110-1115 - [c56]Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang:
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding. EMNLP 2024: 4500-4511 - [c55]Yimin Deng, Huaizhen Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval. ICASSP 2024: 7150-7154 - [c54]Bingyuan Zhang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wang:
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model. ICASSP 2024: 8276-8280 - [c53]Haobin Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
ED-TTS: Multi-Scale Emotion Modeling Using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. ICASSP 2024: 12146-12150 - [c52]Jianzong Wang, Haoxiang Shi, Kaiyi Luo, Xulong Zhang, Ning Cheng, Jing Xiao:
RREH: Reconstruction Relations Embedded Hashing for Semi-paired Cross-Modal Retrieval. ICIC (LNAI 5) 2024: 374-385 - [c51]Haoxiang Shi, Xulong Zhang, Ning Cheng, Yong Zhang, Jun Yu, Jing Xiao, Jianzong Wang:
Enhancing Emotion Recognition in Conversation Through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning. ICIC (LNAI 3) 2024: 391-401 - [c50]Yimin Deng, Jianzong Wang, Xulong Zhang, Ning Cheng, Jing Xiao:
Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation. IJCNN 2024: 1-7 - [c49]Pengcheng Li, Jianzong Wang, Xulong Zhang, Yong Zhang, Jing Xiao, Ning Cheng:
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion. IJCNN 2024: 1-7 - [c48]Ziqi Liang, Jianzong Wang, Xulong Zhang, Yong Zhang, Ning Cheng, Jing Xiao:
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning. IJCNN 2024: 1-7 - [c47]Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering. IJCNN 2024: 1-7 - [c46]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
ConTuner: Singing Voice Beautifying with Pitch and Expressiveness Condition. IJCNN 2024: 1-6 - [c45]Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization. IJCNN 2024: 1-7 - [i56]Bingyuan Zhang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wang:
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model. CoRR abs/2401.08049 (2024) - [i55]Yimin Deng, Huaizhen Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval. CoRR abs/2401.08096 (2024) - [i54]Haobin Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. CoRR abs/2401.08166 (2024) - [i53]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
Medical Speech Symptoms Classification via Disentangled Representation. CoRR abs/2403.05000 (2024) - [i52]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition. CoRR abs/2404.19187 (2024) - [i51]Ziqi Liang, Jianzong Wang, Xulong Zhang, Yong Zhang, Ning Cheng, Jing Xiao:
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning. CoRR abs/2404.19212 (2024) - [i50]Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization. CoRR abs/2404.19214 (2024) - [i49]Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering. CoRR abs/2404.19316 (2024) - [i48]Yimin Deng, Jianzong Wang, Xulong Zhang, Ning Cheng, Jing Xiao:
Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation. CoRR abs/2405.00603 (2024) - [i47]Pengcheng Li, Jianzong Wang, Xulong Zhang, Yong Zhang, Jing Xiao, Ning Cheng:
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion. CoRR abs/2405.00930 (2024) - [i46]Haoxiang Shi, Jianzong Wang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao:
RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis. CoRR abs/2405.17028 (2024) - [i45]Jianzong Wang, Haoxiang Shi, Kaiyi Luo, Xulong Zhang, Ning Cheng, Jing Xiao:
RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval. CoRR abs/2405.17777 (2024) - [i44]Haoxiang Shi, Xulong Zhang, Ning Cheng, Yong Zhang, Jun Yu, Jing Xiao, Jianzong Wang:
Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning. CoRR abs/2405.17900 (2024) - [i43]Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang:
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding. CoRR abs/2409.19627 (2024) - [i42]Yifu Sun, Xulong Zhang, Monan Zhou, Wei Li:
Semi-Supervised Self-Learning Enhanced Music Emotion Recognition. CoRR abs/2410.21897 (2024) - 2023
- [j3]Wei Duan, Yi Yu, Xulong Zhang, Suhua Tang, Wei Li, Keizo Oyama:
Melody Generation from Lyrics with Local Interpretability. ACM Trans. Multim. Comput. Commun. Appl. 19(3): 124:1-124:21 (2023) - [c44]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Voice Conversion with Denoising Diffusion Probabilistic GAN Models. ADMA (4) 2023: 154-167 - [c43]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Symbolic and Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music. ADMA (4) 2023: 168-181 - [c42]Xulong Zhang, Jianzong Wang, Ning Cheng, Yifu Sun, Chuanyao Zhang, Jing Xiao:
Machine Unlearning Methodology Based on Stochastic Teacher Network. ADMA (5) 2023: 250-261 - [c41]Wenting Liu, Zhaozhong Gui, Guilin Jiang, Lihua Tang, Lichun Zhou, Wan Leng, Xulong Zhang, Yujiang Liu:
Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data. APWeb-WAIM (2) 2023: 74-88 - [c40]Yu Ye, Gongjin Zhang, Hongbiao Si, Liang Xu, Shenghua Hu, Yong Li, Xulong Zhang, Kaiyu Hu, Fangzhou Ye:
A Hierarchy-Based Analysis Approach for Blended Learning: A Case Study with Chinese Students. APWeb-WAIM (2) 2023: 89-102 - [c39]Shanyi Zhou, Ning Yan, Zhijun Li, Mo Geng, Xulong Zhang, Hongbiao Si, Lihua Tang, Wenyuan Sun, Longda Zhang, Yi Cao:
Research on the Impact of Executive Shareholding on New Investment in Enterprises Based on Multivariable Linear Regression Model. APWeb-WAIM (2) 2023: 147-161 - [c38]Hao Guo, Hongbiao Si, Guilin Jiang, Wei Zhang, Zhiyan Liu, Xuanyi Zhu, Xulong Zhang, Yang Liu:
An Empirical Study of Attention Networks for Semantic Segmentation. APWeb-WAIM (1) 2023: 222-235 - [c37]Jianzong Wang, Yimin Deng, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding. ISPA/BDCloud/SocialCom/SustainCom 2023: 752-757 - [c36]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. ISPA/BDCloud/SocialCom/SustainCom 2023: 923-928 - [c35]Yimin Deng, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation. ISPA/BDCloud/SocialCom/SustainCom 2023: 1143-1148 - [c34]Ganghui Ru, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Music Genre Classification from multi-modal Properties of Music and Genre Correlations Perspective. ICASSP 2023: 1-5 - [c33]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Learning Speech Representations with Flexible Hidden Feature Dimensions. ICASSP 2023: 1-5 - [c32]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
VQ-CL: Learning Disentangled Speech Representations with Contrastive Learning and Vector Quantization. ICASSP 2023: 1-5 - [c31]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. ICASSP 2023: 1-5 - [c30]Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
Dynamic Alignment Mask CTC: Improved Mask CTC With Aligned Cross Entropy. ICASSP 2023: 1-5 - [c29]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving EEG-based Emotion Recognition by Fusing Time-Frequency and Spatial Representations. ICASSP 2023: 1-5 - [c28]Yazhong Si, Xulong Zhang, Fan Yang, Jianzong Wang, Ning Cheng, Jing Xiao:
AOSR-Net: All-in-One Sandstorm Removal Network. ICTAI 2023: 641-645 - [c27]Jianzong Wang, Xulong Zhang, Aolan Sun, Ning Cheng, Jing Xiao:
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework. ICTAI 2023: 905-912 - [c26]Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, Jing Xiao:
Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval. ICTAI 2023: 913-917 - [c25]Jianzong Wang, Xulong Zhang, Haobin Tang, Aolan Sun, Ning Cheng, Jing Xiao:
SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model. IJCNN 2023: 1-7 - [c24]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis. INTERSPEECH 2023: 12-16 - [c23]Yifu Sun, Xulong Zhang, Jianzong Wang, Ning Cheng, Kaiyu Hu, Jing Xiao:
Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning. INTERSPEECH 2023: 5456-5460 - [c22]Yimin Deng, Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. ACM Multimedia 2023: 184-192 - [i41]Ganghui Ru, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Music Genre Classification from multi-modal properties of music and genre correlations Perspective. CoRR abs/2303.07667 (2023) - [i40]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. CoRR abs/2303.07682 (2023) - [i39]Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy. CoRR abs/2303.07687 (2023) - [i38]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving EEG-based Emotion Recognition by Fusing Time-frequency And Spatial Representations. CoRR abs/2303.11421 (2023) - [i37]Jianzong Wang, Xulong Zhang, Haobin Tang, Aolan Sun, Ning Cheng, Jing Xiao:
SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model. CoRR abs/2304.11547 (2023) - [i36]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis. CoRR abs/2306.00648 (2023) - [i35]Yimin Deng, Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. CoRR abs/2308.11084 (2023) - [i34]Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Erik Cambria, Björn W. Schuller:
Sparks of Large Audio Models: A Survey and Outlook. CoRR abs/2308.12792 (2023) - [i33]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Symbolic & Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music. CoRR abs/2308.14317 (2023) - [i32]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Voice Conversion with Denoising Diffusion Probabilistic GAN Models. CoRR abs/2308.14319 (2023) - [i31]Xulong Zhang, Jianzong Wang, Ning Cheng, Yifu Sun, Chuanyao Zhang, Jing Xiao:
Machine Unlearning Methodology base on Stochastic Teacher Network. CoRR abs/2308.14322 (2023) - [i30]Zipeng Qi, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks. CoRR abs/2309.07509 (2023) - [i29]Jianzong Wang, Xulong Zhang, Aolan Sun, Ning Cheng, Jing Xiao:
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework. CoRR abs/2309.08837 (2023) - [i28]Yazhong Si, Xulong Zhang, Fan Yang, Jianzong Wang, Ning Cheng, Jing Xiao:
AOSR-Net: All-in-One Sandstorm Removal Network. CoRR abs/2309.08838 (2023) - [i27]Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, Jing Xiao:
Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval. CoRR abs/2309.08839 (2023) - [i26]Hao Guo, Hongbiao Si, Guilin Jiang, Wei Zhang, Zhiyan Liu, Xuanyi Zhu, Xulong Zhang, Yang Liu:
An Empirical Study of Attention Networks for Semantic Segmentation. CoRR abs/2309.10217 (2023) - [i25]Yu Ye, Gongjin Zhang, Hongbiao Si, Liang Xu, Shenghua Hu, Yong Li, Xulong Zhang, Kaiyu Hu, Fangzhou Ye:
A Hierarchy-based Analysis Approach for Blended Learning: A Case Study with Chinese Students. CoRR abs/2309.10218 (2023) - [i24]Shanyi Zhou, Ning Yan, Zhijun Li, Mo Geng, Xulong Zhang, Hongbiao Si, Lihua Tang, Wenyuan Sun, Longda Zhang, Yi Cao:
Research on the Impact of Executive Shareholding on New Investment in Enterprises Based on Multivariable Linear Regression Model. CoRR abs/2309.10986 (2023) - [i23]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. CoRR abs/2311.07965 (2023) - [i22]Yimin Deng, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation. CoRR abs/2311.08670 (2023) - [i21]Jianzong Wang, Yimin Deng, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding. CoRR abs/2311.08673 (2023) - 2022
- [c21]Xulong Zhang, Jianzong Wang, Ning Cheng, Edward Xiao, Jing Xiao:
Shallow Diffusion Motion Model for Talking Face Generation from Speech. APWeb/WAIM (2) 2022: 144-157 - [c20]Qiqi Wang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning. ICASSP 2022: 3184-3188 - [c19]Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-Shot Multi-speaker text-to-speech. ICASSP 2022: 4293-4297 - [c18]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Avqvc: One-Shot Voice Conversion By Vector Quantization With Applying Contrastive Learning. ICASSP 2022: 4613-4617 - [c17]Shijing Si, Jianzong Wang, Xulong Zhang, Xiaoyang Qu, Ning Cheng, Jing Xiao:
Boosting StarGANs for Voice Conversion with Contrastive Discriminator. ICONIP (2) 2022: 355-366 - [c16]Aolan Sun, Xulong Zhang, Tiandong Ling, Jianzong Wang, Ning Cheng, Jing Xiao:
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar. ICTAI 2022: 1002-1006 - [c15]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
SUSing: SU-net for Singing Voice Synthesis. IJCNN 2022: 1-7 - [c14]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MDCNN-SID: Multi-scale Dilated Convolution Network for Singer Identification. IJCNN 2022: 1-7 - [c13]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS. IJCNN 2022: 1-7 - [c12]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features. IJCNN 2022: 1-7 - [c11]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSID: Singer Identification with Domain Adaptation for Metaverse. IJCNN 2022: 1-7 - [c10]Jian Luo, Jianzong Wang, Ning Cheng, Edward Xiao, Xulong Zhang, Jing Xiao:
Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation. INTERSPEECH 2022: 5313-5317 - [c9]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data. MSN 2022: 456-460 - [c8]Xulong Zhang, Jianzong Wang, Ning Cheng, Kexin Zhu, Jing Xiao:
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach. MSN 2022: 485-489 - [c7]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse. MSN 2022: 841-846 - [c6]Xulong Zhang, Jianzong Wang, Ning Cheng, Mengyuan Zhao, Zhiyong Zhang, Jing Xiao:
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition. MSN 2022: 915-920 - [c5]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Semi-Supervised Learning Based on Reference Model for Low-resource TTS. MSN 2022: 966-971 - [c4]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Imbalanced Text Classification with Dynamic Curriculum Learning. MSN 2022: 1031-1036 - [i20]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
AVQVC: One-shot Voice Conversion by Vector Quantization with applying contrastive learning. CoRR abs/2202.10020 (2022) - [i19]Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech. CoRR abs/2202.10712 (2022) - [i18]Qiqi Wang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning. CoRR abs/2202.10976 (2022) - [i17]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features. CoRR abs/2205.11817 (2022) - [i16]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSID: Singer Identification with Domain Adaptation for Metaverse. CoRR abs/2205.11821 (2022) - [i15]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS. CoRR abs/2205.11824 (2022) - [i14]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
SUSing: SU-net for Singing Voice Synthesis. CoRR abs/2205.11841 (2022) - [i13]Jian Luo, Jianzong Wang, Ning Cheng, Edward Xiao, Xulong Zhang, Jing Xiao:
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech Separation. CoRR abs/2206.13689 (2022) - [i12]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Zhen Zeng, Edward Xiao, Jing Xiao:
TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training. CoRR abs/2208.04035 (2022) - [i11]Shijing Si, Jianzong Wang, Xulong Zhang, Xiaoyang Qu, Ning Cheng, Jing Xiao:
Boosting Star-GANs for Voice Conversion with Contrastive Discriminator. CoRR abs/2209.10088 (2022) - [i10]Aolan Sun, Xulong Zhang, Tiandong Ling, Jianzong Wang, Ning Cheng, Jing Xiao:
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar. CoRR abs/2210.06877 (2022) - [i9]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data. CoRR abs/2210.13803 (2022) - [i8]Xulong Zhang, Jianzong Wang, Ning Cheng, Kexin Zhu, Jing Xiao:
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach. CoRR abs/2210.13805 (2022) - [i7]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse. CoRR abs/2210.13811 (2022) - [i6]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Semi-Supervised Learning Based on Reference Model for Low-resource TTS. CoRR abs/2210.14723 (2022) - [i5]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Imbalanced Text Classification with Dynamic Curriculum Learning. CoRR abs/2210.14724 (2022) - [i4]Xulong Zhang, Jianzong Wang, Ning Cheng, Mengyuan Zhao, Zhiyong Zhang, Jing Xiao:
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition. CoRR abs/2210.14725 (2022) - 2021
- [c3]Xulong Zhang, Jianzong Wang, Ning Cheng, Edward Xiao, Jing Xiao:
Cyclegean: Cycle Generative Enhanced Adversarial Network for Voice Conversion. ASRU 2021: 930-937 - [c2]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Zhen Zeng, Edward Xiao, Jing Xiao:
TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training. ASRU 2021: 938-945 - [c1]Xulong Zhang, Jiale Qian, Yi Yu, Yifu Sun, Wei Li:
Singer Identification Using Deep Timbre Feature Learning with KNN-NET. ICASSP 2021: 3380-3384 - [i3]Xulong Zhang, Jiale Qian, Yi Yu, Yifu Sun, Wei Li:
Singer Identification Using Deep Timbre Feature Learning with KNN-Net. CoRR abs/2102.10236 (2021) - 2020
- [i2]Xulong Zhang, Yi Yu, Xi Chen, Wei Li:
Comparison for Improvements of Singing Voice Detection System Based on Vocal Separation. CoRR abs/2004.04040 (2020) - [i1]Xulong Zhang, Yongwei Gao, Yi Yu, Wei Li:
Music Artist Classification with WaveNet Classifier for Raw Waveform Audio Data. CoRR abs/2004.04371 (2020)
2010 – 2019
- 2017
- [j2]Wei Li, Xiangyi Feng, Yiming Wu, Xulong Zhang:
流行音乐主旋律提取技术综述 (Review on Main Melody Extraction from Pop Music). 计算机科学 44(5): 1-5 (2017) - 2013
- [j1]Qingtao Wu, Xulong Zhang, Ruijuan Zheng, Mingchuan Zhang:
Probability-Symmetric Storage Allocation for Distributed Storage Systems Based on Network Coding. Int. J. Online Biomed. Eng. 9(S4): 64-68 (2013)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-26 01:52 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint