default search action

combined dblp search
author search
venue search
publication search

ask others

Jian Wu 0027

> Home > Persons

Person information

affiliation: Microsoft Corporation, USA
affiliation: Northwestern Polytechnical University, Xi'an, China
not to be confused with: Jian Wu 0031

Other persons with the same name

see FAQ

Jian Wu — disambiguation page
Jian Wu 0001 — Zhejiang University, College of Computer Science & Technology, Hangzhou, China (and 2 more)
Jian Wu 0002 — Soochow University, School of Computer Science and Technology, Suzhou, China
Jian Wu 0003 — Shanghai Maritime University, School of Economics and Management, China (and 3 more)
Jian Wu 0004 — Taiwan Semiconductor Manufacturing Company, Shanghai, China (and 1 more)
Jian Wu 0005 — Shanghai University, School of Communication and Information Engineering, China
Jian Wu 0006 — Old Dominion University, Department of Computer Science, Norfolk, VA, USA (and 1 more)
Jian Wu 0007 — University of California, Davis, Department of Computer Science, CA, USA (and 1 more)
Jian Wu 0008 — Anqing Normal University, School of Computer and Information, China (and 1 more)
Jian Wu 0009 — Washington University in St. Louis, MO, USA

Jian Wu 0010 — Beijing University of Posts and Telecommunications, State Key Laboratory of Information Photonics and Optical Communications, China
Jian Wu 0011 — Beijing University of Technology, Faculty of Information Technology, China (and 1 more)
Jian Wu 0012 — Tsinghua University, Graduate School at Shenzhen, Institute of Biomedical Engineering, China (and 1 more)
Jian Wu 0013 — Liaocheng University, School of Mechanical and Automotive Engineering, China (and 2 more)
Jian Wu 0014 — Anhui University of Technology, School of Mathematical Science and Engineering, Ma'anshan, China
Jian Wu 0015 — Harbin Institute of Technology, Department of Electrical Engineering, Harbin, China
Jian Wu 0016 — Texas A&M University, Department of Computer Science and Engineering, College Station, TX, USA (and 1 more)
Jian Wu 0017 — Shanghai University of Sport, China
Jian Wu 0018 — University of Twente, Enschede, Netherlands
Jian Wu 0019 — National University of Defense Technology, College of Electronic Science, Changsha, China
Jian Wu 0020 — Central South University of Forestry and Technology, School of Logistics and Transportation, Changsha, China
Jian Wu 0021 — Tongji University, College of Electronics and Information Engineering, Shanghai
Jian Wu 0022 — Tongji University, College of Electronics and Information Engineering, Shanghai, China (and 1 more)
Jian Wu 0023 — Harbin Institute of Technology, Department of Control Science and Engineering, China
Jian Wu 0024 — Jilin University, State Key Laboratory of Automotive Simulation and Control, Changchun, China
Jian Wu 0026 — Chinese Academy of Sciences, Science and Technology on Micro-system Laboratory, Shanghai, China
Jian Wu 0028 — University of Michigan, Department of Electrical Engineering and Computer Science, Ann Arbor, USA
Jian Wu 0029 — University of Hong Kong, HKU, Department of Computer Science, Hong Kong (and 1 more)
Jian Wu 0030 — Tsinghua University, Beijing, Tsinghua National Laboratory for Information Science and Technology, Beijing, China
Jian Wu 0031 — Microsoft, Redmond, WA, USA

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangGLZSXCZBXZCWWCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangGLZSXCZBXZCWWCL24
He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li:
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge. ICASSP Workshops 2024: 63-64
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuKY00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuKY00024
Jian Wu, Naoyuki Kanda, Takuya Yoshioka, Rui Zhao, Zhuo Chen, Jinyu Li:
T-SOT FNT: Streaming Multi-Talker ASR with Text-Only Domain Adaptation Capability. ICASSP 2024: 11531-11535
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-03473
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-03473
He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li:
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge. CoRR abs/2401.03473 (2024)
2023
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiangSYLZDCXQWCLYB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiangSYLZDCXQWCLYB23
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR. ASRU 2023: 1-8
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WuGCZZWLLRLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WuGCZZWLLRLW23
Jian Wu, Yashesh Gaur, Zhuo Chen, Long Zhou, Yimeng Zhu, Tianrui Wang, Jinyu Li, Shujie Liu, Bo Ren, Linquan Liu, Yu Wu:
On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration. ASRU 2023: 1-8
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenKWWWYLSE23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenKWWWYLSE23
Zhuo Chen, Naoyuki Kanda, Jian Wu, Yu Wu, Xiaofei Wang, Takuya Yoshioka, Jinyu Li, Sunit Sivasankaran, Sefik Emre Eskimez:
Speech Separation with Large-Scale Self-Supervised Learning. ICASSP 2023: 1-5
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangCKWWLYWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangCKWWLYWW23
Zili Huang, Zhuo Chen, Naoyuki Kanda, Jian Wu, Yiming Wang, Jinyu Li, Takuya Yoshioka, Xiaofei Wang, Peidong Wang:
Self-Supervised Learning with Bi-Label Masked Speech Prediction for Streaming Multi-Talker Speech Recognition. ICASSP 2023: 1-5
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KandaWWCLY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KandaWWCLY23
Naoyuki Kanda, Jian Wu, Xiaofei Wang, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Vararray Meets T-Sot: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition. ICASSP 2023: 1-5
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SangZLHW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SangZLHW23
Mufan Sang, Yong Zhao, Gang Liu, John H. L. Hansen, Jian Wu:
Improving Transformer-Based Networks with Locality for Automatic Speaker Verification. ICASSP 2023: 1-5
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangXKYW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangXKYW23
Dongmei Wang, Xiong Xiao, Naoyuki Kanda, Takuya Yoshioka, Jian Wu:
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-To-End Neural Diarization. ICASSP 2023: 1-5
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuCHXL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuCHXL23
Jian Wu, Zhuo Chen, Min Hu, Xiong Xiao, Jinyu Li:
Speaker Change Detection For Transformer Transducer ASR. ICASSP 2023: 1-5
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangKWWSCLY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangKWWSCLY23
Muqiao Yang, Naoyuki Kanda, Xiaofei Wang, Jian Wu, Sunit Sivasankaran, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Simulating Realistic Speech Overlaps Improves Multi-Talker ASR. ICASSP 2023: 1-5
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08549
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08549
Jian Wu, Zhuo Chen, Min Hu, Xiong Xiao, Jinyu Li:
Speaker Change Detection for Transformer Transducer ASR. CoRR abs/2302.08549 (2023)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08639
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08639
Mufan Sang, Yong Zhao, Gang Liu, John H. L. Hansen, Jian Wu:
Improving Transformer-based Networks With Locality For Automatic Speaker Verification. CoRR abs/2302.08639 (2023)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-03917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-03917
Jian Wu, Yashesh Gaur, Zhuo Chen, Long Zhou, Yimeng Zhu, Tianrui Wang, Jinyu Li, Shujie Liu, Bo Ren, Linquan Liu, Yu Wu:
On decoder-only architecture for speech-to-text and large language model integration. CoRR abs/2307.03917 (2023)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06327
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06327
Mohammad Soleymanpour, Mahmoud Al Ismail, Fahimeh Bahmaninezhad, Kshitiz Kumar, Jian Wu:
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss. CoRR abs/2308.06327 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08131
Jian Wu, Naoyuki Kanda, Takuya Yoshioka, Rui Zhao, Zhuo Chen, Jinyu Li:
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability. CoRR abs/2309.08131 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13573
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13573
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR. CoRR abs/2309.13573 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-02248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-02248
Jing Pan, Jian Wu, Yashesh Gaur, Sunit Sivasankaran, Zhuo Chen, Shujie Liu, Jinyu Li:
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning. CoRR abs/2311.02248 (2023)
2022
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ChenWCWLCLKYXWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ChenWCWLCLKYXWZ22
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Xiangzhan Yu, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1505-1518 (2022)
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TompkinsKW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TompkinsKW22
Daniel Tompkins, Kshitiz Kumar, Jian Wu:
Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, and Pretraining: an Ablation Study. ICASSP 2022: 1016-1020
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCWYWML22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCWYWML22
Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li:
Continuous Speech Separation with Recurrent Selective Attention Network. ICASSP 2022: 6017-6021
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWWCCLWQWLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWWCCLWQWLY22
Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li, Xiangzhan Yu:
Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training. ICASSP 2022: 6152-6156
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kanda0WXMWG00Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kanda0WXMWG00Y22
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. INTERSPEECH 2022: 521-525
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen0000WL00YW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen0000WL00YW22
Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Zhuo Chen, Peidong Wang, Gang Liu, Jinyu Li, Jian Wu, Xiangzhan Yu, Furu Wei:
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? INTERSPEECH 2022: 3699-3703
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaWWXMWGC0Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaWWXMWGC0Y22
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. INTERSPEECH 2022: 3774-3778
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/snams/MiaoWBCP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/snams/MiaoWBCP22
Li Miao, Jian Wu, Piyush Behre, Shuangyu Chang, Sarangarajan Parthasarathy:
Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages. SNAMS 2022: 1-5
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-00842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-00842
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. CoRR abs/2202.00842 (2022)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-03514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-03514
Daniel Tompkins, Kshitiz Kumar, Jian Wu:
Maximizing Audio Event Detection Model Performance on Small Datasets Through Knowledge Transfer, Data Augmentation, And Pretraining: An Ablation Study. CoRR abs/2202.03514 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16685
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. CoRR abs/2203.16685 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-12765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-12765
Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Zhuo Chen, Peidong Wang, Gang Liu, Jinyu Li, Jian Wu, Xiangzhan Yu, Furu Wei:
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? CoRR abs/2204.12765 (2022)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-12777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-12777
Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Takuya Yoshioka, Shujie Liu, Jinyu Li, Xiangzhan Yu:
Ultra Fast Speech Separation Model with Teacher Student Learning. CoRR abs/2204.12777 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-08598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-08598
Mostafa Karimi, Changliang Liu, Ken'ichi Kumatani, Yao Qian, Tianyu Wu, Jian Wu:
Deploying self-supervised learning in the wild for hybrid automatic speech recognition. CoRR abs/2205.08598 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-04041
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-04041
Li Miao, Jian Wu, Piyush Behre, Shuangyu Chang, Sarangarajan Parthasarathy:
Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages. CoRR abs/2209.04041 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-04974
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-04974
Naoyuki Kanda, Jian Wu, Xiaofei Wang, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition. CoRR abs/2209.04974 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-11266
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-11266
Gang Liu, Tianyan Zhou, Yong Zhao, Yu Wu, Zhuo Chen, Yao Qian, Jian Wu:
The Microsoft System for VoxCeleb Speaker Recognition Challenge 2022. CoRR abs/2209.11266 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15715
Muqiao Yang, Naoyuki Kanda, Xiaofei Wang, Jian Wu, Sunit Sivasankaran, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Simulating realistic speech overlaps improves multi-talker ASR. CoRR abs/2210.15715 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05172
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05172
Zhuo Chen, Naoyuki Kanda, Jian Wu, Yu Wu, Xiaofei Wang, Takuya Yoshioka, Jinyu Li, Sunit Sivasankaran, Sefik Emre Eskimez:
Speech separation with large-scale self-supervised learning. CoRR abs/2211.05172 (2022)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05564
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05564
Zili Huang, Zhuo Chen, Naoyuki Kanda, Jian Wu, Yiming Wang, Jinyu Li, Takuya Yoshioka, Xiaofei Wang, Peidong Wang:
Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition. CoRR abs/2211.05564 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-06493
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-06493
Xiaofei Wang, Zhuo Chen, Yu Shi, Jian Wu, Naoyuki Kanda, Takuya Yoshioka:
Breaking trade-offs in speech separation with sparsely-gated mixture of experts. CoRR abs/2211.06493 (2022)
2021
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KandaXWZGWMCY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KandaXWZGWMCY21
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio. ASRU 2021: 296-303
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWCW0Y00021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWCW0Y00021
Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Jinyu Li, Takuya Yoshioka, Chengyi Wang, Shujie Liu, Ming Zhou:
Continuous Speech Separation with Conformer. ICASSP 2021: 5749-5753
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoK0ZYC0L0W0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoK0ZYC0L0W0021
Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong:
Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020. ICASSP 2021: 5824-5828
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DasKW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DasKW21
Amit Das, Kshitiz Kumar, Jian Wu:
Multi-Dialect Speech Recognition in English Using Attention on Ensemble of Experts. ICASSP 2021: 6244-6248
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenWCWY00Y21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenWCWY00Y21
Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Takuya Yoshioka, Shujie Liu, Jinyu Li, Xiangzhan Yu:
Ultra Fast Speech Separation Model with Teacher Student Learning. Interspeech 2021: 3026-3030
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuCCWYK0L21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuCCWYK0L21
Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li:
Investigation of Practical Aspects of Single Channel Speech Separation for ASR. Interspeech 2021: 3066-3070
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuCLJKCHXWBXDC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuCLJKCHXWBXDC21
Yihui Fu, Luyao Cheng, Shubo Lv, Yukai Jv, Yuxiang Kong, Zhuo Chen, Yanxin Hu, Lei Xie, Jian Wu, Hui Bu, Xin Xu, Jun Du, Jingdong Chen:
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario. Interspeech 2021: 3665-3669
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AfshanKW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AfshanKW21
Amber Afshan, Kshitiz Kumar, Jian Wu:
Sequence-Level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models. Interspeech 2021: 4084-4088
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/KongWWGZWX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/KongWWGZWX21
Yuxiang Kong, Jian Wu, Quandong Wang, Peng Gao, Weiji Zhuang, Yujun Wang, Lei Xie:
Multi-Channel Automatic Speech Recognition Using Deep Complex Unet. SLT 2021: 104-110
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ZhouZW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ZhouZW21
Tianyan Zhou, Yong Zhao, Jian Wu:
ResNeXt and Res2Net Structures for Speaker Verification. SLT 2021: 301-307
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/FuWHXX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/FuWHXX21
Yihui Fu, Jian Wu, Yanxin Hu, Mengtao Xing, Lei Xie:
DESNet: A Multi-Channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation. SLT 2021: 857-864
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/FuYHWWYZXHBMO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/FuYHWWYZXHBMO21
Yihui Fu, Zhuoyuan Yao, Weipeng He, Jian Wu, Xiong Wang, Zhanheng Yang, Shimin Zhang, Lei Xie, Dongyan Huang, Hui Bu, Petr Motlícek, Jean-Marc Odobez:
IEEE SLT 2021 Alpha-Mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines. SLT 2021: 1101-1108
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-03634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-03634
Jixuan Wang, Xiong Xiao, Jian Wu, Ranjani Ramamurthy, Frank Rudzicz, Michael Brudno:
Speaker attribution with voice profiles by graph-based semi-supervised learning. CoRR abs/2102.03634 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-03603
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-03603
Yihui Fu, Luyao Cheng, Shubo Lv, Yukai Jv, Yuxiang Kong, Zhuo Chen, Yanxin Hu, Lei Xie, Jian Wu, Hui Bu, Xin Xu, Jun Du, Jingdong Chen:
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario. CoRR abs/2104.03603 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-00099
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-00099
Amber Afshan, Kshitiz Kumar, Jian Wu:
Sequence-level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models. CoRR abs/2107.00099 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-01922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-01922
Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li:
Investigation of Practical Aspects of Single Channel Speech Separation for ASR. CoRR abs/2107.01922 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-02852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-02852
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka:
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio. CoRR abs/2107.02852 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05752
Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li, Xiangzhan Yu:
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training. CoRR abs/2110.05752 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13900
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. CoRR abs/2110.13900 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13900
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. CoRR abs/2110.13900 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14838
Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li:
Continuous Speech Separation with Recurrent Selective Attention Network. CoRR abs/2110.14838 (2021)
2020
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoZCW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoZCW20
Yong Zhao, Tianyan Zhou, Zhuo Chen, Jian Wu:
Improving Deep CNN Networks with Long Temporal Context for Text-Independent Speaker Verification. ICASSP 2020: 6834-6838
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuZWGWKLLMY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuZWGWKLLMY20
Jianwei Yu, Shi-Xiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu:
Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset. ICASSP 2020: 6984-6988
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangXWRRB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangXWRRB20
Jixuan Wang, Xiong Xiao, Jian Wu, Ranjani Ramamurthy, Frank Rudzicz, Michael Brudno:
Speaker Diarization with Session-Level Speaker Embedding Refinement Using Graph Neural Networks. ICASSP 2020: 7109-7113
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenYLZMLWXL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenYLZMLWXL20
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Xiong Xiao, Jinyu Li:
Continuous Speech Separation: Dataset and Analysis. ICASSP 2020: 7284-7288
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SharmaYWZTWHLG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SharmaYWZTWHLG20
Eva Sharma, Guoli Ye, Wenning Wei, Rui Zhao, Yao Tian, Jian Wu, Lei He, Ed Lin, Yifan Gong:
Adaptation of RNN Transducer with Text-To-Speech Technology for Keyword Spotting. ICASSP 2020: 7484-7488
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuCLYTLLX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuCLYTLLX20
Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie:
An End-to-End Architecture of Online Multi-Channel Speech Separation. INTERSPEECH 2020: 81-85
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangXWRRB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangXWRRB20
Jixuan Wang, Xiong Xiao, Jian Wu, Ranjani Ramamurthy, Frank Rudzicz, Michael Brudno:
Speaker Attribution with Voice Profiles by Graph-Based Semi-Supervised Learning. INTERSPEECH 2020: 289-293
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarSKW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarSKW20
Kshitiz Kumar, Emilian Stoimenov, Hosam Khalil, Jian Wu:
Fast and Slow Acoustic Model. INTERSPEECH 2020: 541-545
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuXWY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuXWY20
Haohe Liu, Lei Xie, Jian Wu, Geng Yang:
Channel-Wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music. INTERSPEECH 2020: 1241-1245
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarRGW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarRGW20
Kshitiz Kumar, Bo Ren, Yifan Gong, Jian Wu:
Bandpass Noise Generation and Augmentation for Unified ASR. INTERSPEECH 2020: 1683-1687
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarLGW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarLGW20
Kshitiz Kumar, Chaojun Liu, Yifan Gong, Jian Wu:
1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM. INTERSPEECH 2020: 2107-2111
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuLLXZFWZX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuLLXZFWZX20
Yanxin Hu, Yun Liu, Shubo Lv, Mengtao Xing, Shimin Zhang, Yihui Fu, Jian Wu, Bihong Zhang, Lei Xie:
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement. INTERSPEECH 2020: 2472-2476
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangWX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangWX20
Li Zhang, Jian Wu, Lei Xie:
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge. INTERSPEECH 2020: 3471-3475
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-01656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-01656
Jianwei Yu, Shi-Xiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu:
Audio-visual Recognition of Overlapped speech for the LRS2 dataset. CoRR abs/2001.01656 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-11482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-11482
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li:
Continuous speech separation: dataset and analysis. CoRR abs/2001.11482 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-11371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-11371
Jixuan Wang, Xiong Xiao, Jian Wu, Ranjani Ramamurthy, Frank Rudzicz, Michael Brudno:
Speaker diarization with session-level speaker embedding refinement using graph neural networks. CoRR abs/2005.11371 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-02480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-02480
Tianyan Zhou, Yong Zhao, Jian Wu:
ResNeXt and Res2Net Structure for Speaker Verification. CoRR abs/2007.02480 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-00264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-00264
Yanxin Hu, Yun Liu, Shubo Lv, Mengtao Xing, Shimin Zhang, Yihui Fu, Jian Wu, Bihong Zhang, Lei Xie:
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement. CoRR abs/2008.00264 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03521
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03521
Li Zhang, Jian Wu, Lei Xie:
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge. CoRR abs/2008.03521 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-05216
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-05216
Haohe Liu, Lei Xie, Jian Wu, Geng Yang:
Channel-wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music. CoRR abs/2008.05216 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-03141
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-03141
Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie:
An End-to-end Architecture of Online Multi-channel Speech Separation. CoRR abs/2009.03141 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11458
Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong:
Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020. CoRR abs/2010.11458 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02131
Yihui Fu, Jian Wu, Yanxin Hu, Mengtao Xing, Lei Xie:
DESNet: A Multi-channel Network for Simultaneous Speech Dereverberation, Enhancement and Separation. CoRR abs/2011.02131 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02198
Yihui Fu, Zhuoyuan Yao, Weipeng He, Jian Wu, Xiong Wang, Zhanheng Yang, Shimin Zhang, Lei Xie, Dongyan Huang, Hui Bu, Petr Motlícek, Jean-Marc Odobez:
IEEE SLT 2021 Alpha-mini Speech Challenge: Open Datasets, Tracks, Rules and Baselines. CoRR abs/2011.02198 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-09081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-09081
Yuxiang Kong, Jian Wu, Quandong Wang, Peng Gao, Weiji Zhuang, Yujun Wang, Lei Xie:
Multi-Channel Automatic Speech Recognition Using Deep Complex Unet. CoRR abs/2011.09081 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WuXZCYXY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WuXZCYXY19
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu:
Time Domain Audio Visual Speech Separation. ASRU 2019: 667-673
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhouZLGW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhouZLGW19
Tianyan Zhou, Yong Zhao, Jinyu Li, Yifan Gong, Jian Wu:
CNN with Phonetic Attention for Text-Independent Speaker Verification. ASRU 2019: 718-725
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuXZCYX019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuXZCYX019
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu:
Improved Speaker-Dependent Separation for CHiME-5 Challenge. INTERSPEECH 2019: 466-470
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BahmaninezhadWG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BahmaninezhadWG19
Fahimeh Bahmaninezhad, Jian Wu, Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu:
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation. INTERSPEECH 2019: 4574-4578
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03760
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu:
Time Domain Audio Visual Speech Separation. CoRR abs/1904.03760 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03792
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu:
Improved Speaker-Dependent Separation for CHiME-5 Challenge. CoRR abs/1904.03792 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-06286
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-06286
Rongzhi Gu, Jian Wu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
End-to-End Multi-Channel Speech Separation. CoRR abs/1905.06286 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-07497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-07497
Fahimeh Bahmaninezhad, Jian Wu, Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu:
A comprehensive study of speech separation: spectrogram vs waveform separation. CoRR abs/1905.07497 (2019)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuDLWGA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuDLWGA09
Dong Yu, Li Deng, Peng Liu, Jian Wu, Yifan Gong, Alex Acero:
Cross-lingual speech recognition under runtime resource constraints. ICASSP 2009: 4193-4196
2008
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YuDDWGA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YuDDWGA08
Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, Alex Acero:
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor. IEEE Trans. Speech Audio Process. 16(5): 1061-1070 (2008)
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuDDWGA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuDDWGA08
Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, Alex Acero:
A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition. ICASSP 2008: 4041-4044
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiDYWGA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiDYWGA08
Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan Gong, Alex Acero:
Adaptation of compressed HMM parameters for resource-constrained speech recognition. ICASSP 2008: 4333-4336
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/YuDWGA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/YuDWGA08
Dong Yu, Li Deng, Jian Wu, Yifan Gong, Alex Acero:
Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition. ISCSLP 2008: 69-72
2005
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/DengWDA05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/DengWDA05
Li Deng, Jian Wu, Jasha Droppo, Alex Acero:
Analysis and comparison of two speech feature extraction/compensation algorithms. IEEE Signal Process. Lett. 12(6): 477-480 (2005)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.