default search action
Xin Wang 0037
Person information
- affiliation: Graduate University for Advanced Studies (SOKENDAI), National Institute of Informatics, Department of Informatics, Tokyo, Japan
Other persons with the same name
- Xin Wang — disambiguation page
- Xin Wang 0001 — Stony Brook University, State University of New York at Stony Brook, Department of Electrical and Computer Engineering, NY, USA (and 2 more)
- Xin Wang 0002 — Fudan University, School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, China (and 1 more)
- Xin Wang 0003 — Fudan University, Department of Communication Science and Engineering, State Key Laboratory of ASIC and System, Shanghai, China (and 3 more)
- Xin Wang 0004 — Northwest University, School of Information Science and Technology, Xi'an, China (and 2 more)
- Xin Wang 0005 — University of California, Santa Cruz, CA, USA
- Xin Wang 0006 (aka: Xin (Cindy) Wang) — University of Birmingham, UK
- Xin Wang 0007 — University of Alberta, Edmonton, AL, Canada
- Xin Wang 0008 — City University of Hong Kong, Department of Biomedical Sciences, Hong Kong, China (and 5 more)
- Xin Wang 0009 — Xerox (and 1 more)
- Xin Wang 0010 — Intelius (and 2 more)
- Xin Wang 0011 — Beihang University, School of Electronics and Information Engineering, Beijing, China
- Xin Wang 0012 — Shanghai Jiaotong University, Center of Electrical and Electronic Technology, China (and 2 more)
- Xin Wang 0013 — Arrowsight (and 2 more)
- Xin Wang 0014 — University of Southampton, UK
- Xin Wang 0015 — Zhejiang University, Department of Earth Sciences, Hangzhou, China
- Xin Wang 0016 — Hong Kong University of Science and Technology, Department of Industrial Engineering & Logistics Management (and 1 more)
- Xin Wang 0017 — Baidu Research, Cognitive Computing Lab (CCL), Beijing, China (and 1 more)
- Xin Wang 0018 — University of Tennessee at Knoxville, TN, USA (and 1 more)
- Xin Wang 0019 — Tsinghua University, Department of Computer Science and Technology, China (and 2 more)
- Xin Wang 0020 — Nankai University, College of Environmental Science and Engineering, Tianjin, China
- Xin Wang 0021 — The Johns Hopkins School of Medicine, Russell H. Morgan Department of Radiology and Radiological Science, Baltimore, MD, USA
- Xin Wang 0022 — Hong Kong University of Science and Technology, Thrust of Artificial Intelligence, Information Hub, Guangzhou, China (and 3 more)
- Xin Wang 0023 — University of Connecticut, Storrs, CT, USA
- Xin Wang 0024 — Norwegian University of Science and Technology, Department of Industrial Economics and Technology Management, Trondheim, Norway (and 1 more)
- Xin Wang 0025 — Monash University Malaysia, School of Engineering, Bandar Sunway, Malaysia
- Xin Wang 0026 — Air Force Engineering University, Information and Navigation College, Xi'an, China
- Xin Wang 0027 — Southwest University, College of Electronic and Information Engineering, Chongqing (and 1 more)
- Xin Wang 0028 — Southwest University, College of Electronic and Information Engineering, Chongqing, China (and 2 more)
- Xin Wang 0029 — Hunan University, College of Computer Science and Electronics Engineering, Changsha, China
- Xin Wang 0030 — Tianjin University, School of Computer Science and Technology, China
- Xin Wang 0031 — OmniVision Technologies Inc., San Jose, CA, USA (and 2 more)
- Xin Wang 0032 — Nanjing University, Department of Geographic Information Science, China
- Xin Wang 0033 — Nantong University, School of Electronic and Information Engineering, China
- Xin Wang 0034 — Shanghai Dianji University, School of Electronic Information Engineering, China
- Xin Wang 0035 — Jilin University (JLU), School of Artificial Intelligence, Changchun, China (and 4 more)
- Xin Wang 0036 (aka: Xin Tony Wang) — Tongji University, Department of Computer Science and Technology, Shanghai, China (and 1 more)
- Xin Wang 0038 — Dalian University of Technology, School of Software, China
- Xin Wang 0039 — Southern Illinois University, Department of Electrical and Computer Engineering, Edwardsville, IL, USA (and 2 more)
- Xin Wang 0040 — National University of Singapore, School of Computing, Singapore (and 1 more)
- Xin Wang 0041 — Shenzhen Academy of Aerospace Technology, China
- Xin Wang 0042 — Iowa State University, Department of Statistics, Ames, IA, USA
- Xin Wang 0043 — Heilongjiang University, Electronic Engineering College, Harbin, China
- Xin Wang 0044 — Qilu University of Technology, China (and 1 more)
- Xin Wang 0045 — State University of New York, Albany, College of Integrated Health Sciences, Department of Epidemiology and Biostatistics, NY, USA (and 2 more)
- Xin Wang 0046 — Beijing Jiaotong University, Institute of Information Science, Beijing Key Laboratory of Advanced Information Science and Network, China
- Xin Wang 0047 — Chinese Academy of Sciences, Northwest Institute of Eco-Environment and Resources, Key Laboratory of Land Surface Process and Climate Change in Cold and Arid Regions, Lanzhou, China (and 1 more)
- Xin Wang 0048 — Heilongjiang University, Heilongjiang Provincial Key Laboratory of the Theory and Computation of Complex Systems, Harbin, China (and 1 more)
- Xin Wang 0049 — Taizhou University, School of Mechanical Engineering, China (and 1 more)
- Xin Wang 0050 — Jilin University, College of Communication Engineering, Changchun, China
- Xin Wang 0051 — Chongqing University, College of Mechanical Engineering, State Key Laboratory of Mechanical Transmission, China
- Xin Wang 0052 — Beijing University of Technology, Beijing University of Technology, China
- Xin Wang 0053 — Pierre and Marie Curie University, Paris, France
- Xin Wang 0054 — Delft University of Technology, The Netherlands
- Xin Wang 0055 — Massachusetts Institute of Technology, Cambridge, MA, USA
- Xin Wang 0056 — Virginia Commonwealth University, Department of Electrical and Computer Engineering, Richmond, VA, USA
- Xin Wang 0057 — Jilin University, School of Transportation, Changchun, China
- Xin Wang 0058 — Shaanxi Normal University, School of Computer Science, Xian, China (and 2 more)
- Xin Wang 0059 — Fudan University, Department of Electronic Engineering, Shanghai, China
- Xin Wang 0060 — Zhejiang University, College of Computer Science and Technology, China
- Xin Wang 0061 (aka: Xin Eric Wang) — University of California, Santa Cruz, CA, USA (and 1 more)
- Xin Wang 0063 — Northwestern Polytechnical University, School of Astronautics, Xi'an, China
- Xin Wang 0064 — Southwest Petroleum University, Chengdu, China (and 2 more)
- Xin Wang 0065 — Soochow University, Department of Mathematics, Suzhou, China (and 2 more)
- Xin Wang 0066 — Microsoft Research, Redmond, WA, USA (and 1 more)
- Xin Wang 0067 — Harvard University, School of Public Health, Boston, MA, USA (and 1 more)
- Xin Wang 0068 — Hohai University, College of Computer and Information, Nanjing, China
- Xin Wang 0069 — Carleton University, Department of Mechanical and Aerospace Engineering, Ottawa, ON, Canada (and 1 more)
- Xin Wang 0070 — Hefei University of Technology, School of Computer and Information, China
- Xin Wang 0071 — Qualcomm Inc., San Jose, CA, USA (and 1 more)
- Xin Wang 0072 — Peking University, School of EECS, Key Laboratory of Machine Perception, Bejing, China
- Xin Wang 0073 — DOCOMO Beijing Communications Laboratories Co., Ltd., China
- Xin Wang 0074 — Dalian Maritime University, College of Science, China
- Xin Wang 0075 — Chongqing University, State Key Laboratory of Power Transmission Equipment and System Security and New Technology, China
- Xin Wang 0076 — National University of Defense Technology, School of Computer, Changsha, China
- Xin Wang 0077 — Dalian University of Technology, Dalian University of Technology, China
- Xin Wang 0078 — Changsha University of Science and Technology, School of Computer and Communication Engineering, China
- Xin Wang 0079 — Nanjing University of Aeronautics and Astronautics, College of Electronic and Information Engineering, China (and 1 more)
- Xin Wang 0080 — Beijing University of Posts and Telecommunications, State Key Laboratory of Information Photonics and Optical Communications, China
- Xin Wang 0081 — Dalian Maritime University, College of Navigation, China
- Xin Wang 0082 — Nanjing University of Posts and Telecommunications, Department of Communication Engineering, China (and 1 more)
- Xin Wang 0083 — Nanjing University of Information Science and Technology, Jiangsu Province Atmospheric Environment and Equipment Technology Collaborative Innovation Center, China
- Xin Wang 0084 — Shanghai University, School of Science, Department of Mathematics, China (and 1 more)
- Xin Wang 0085 — Chinese University of Hong Kong, Department of Biomedical Engineering, Hong Kong
- Xin Wang 0086 — Chinese Academy of Sciences, Institute of Information Engineering, Beijing, China
- Xin Wang 0087 — Shanghai International Studies University, School of Business and Management, China (and 1 more)
- Xin Wang 0088 — Chinese Academy of Sciences, Shenzhen Institutes of Advanced Technology, CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems, China (and 1 more)
- Xin Wang 0089 — Inner Monglia University, College of Computer Science, Hohhot, China
- Xin Wang 0090 — University of Toledo, Department of Psychiatry, OH, USA (and 1 more)
- Xin Wang 0091 — Beijing Jiaotong University, School of Electronic and Information Engineering, China
- Xin Wang 0092 — Chinese Academy of Sciences, National Time Service Center, Xi'an, China (and 2 more)
- Xin Wang 0093 — Jiangsu Vocational Institute of Architectural Technology, Xuzhou, China (and 1 more)
- Xin Wang 0094 — Airdoc Co., Ltd, Beijing, China
- Xin Wang 0095 — China Institute of Marine Technology and Economy, CIMTEC, Beijing, China
- Xin Wang 0096 — University of Hannover, Institute of Photogrammetry and GeoInformation, Germany (and 1 more)
- Xin Wang 0097 — China University of Mining and Technology, School of Electrical and Power Engineering, Xuzhou, China
- Xin Wang 0098 — Changchun University of Science and Technology, School of Science, China
- Xin Wang 0099 — Liaoning Normal University, Department of Computing and Information Technology, Dalian City, China
- Xin Wang 0100 — Northwestern Polytechnical University, School of Marine Science and Technology, Xi'an, China
- Xin Wang 0101 — National University of Singapore, Department of Industrial Systems Engineering and Management, Singapore
- Xin Wang 0102 — North China University of Science and Technology, College of Artificial Intelligence, Tangshan, China
- Xin Wang 0103 — East China Jiaotong University, School of Information Engineering, Nanchang, China
- Xin Wang 0104 — Northwest University, Department of Computer Science, Xi'an, China
- Xin Wang 0105 — Xiamen University, Department of Electronic Science, Biomedical Intelligent Cloud Research and Development Center, China
- Xin Wang 0106 — Hunan Cancer Hospital (Affiliated Cancer Hospital of Xiangya School of Medicine, Central South University), Changsha, China
- Xin Wang 0107 — Harbin Institute of Technology, School of Mechanical Engineering and Automation, Shenzhen, China (and 1 more)
- Xin Wang 0108 — Shanghai Jiao Tong University, John Hopcroft Center / AI Institute, China
- Xin Wang 0109 — Tianjin University, School of Electrical and Information Engineering, China
- Xin Wang 0110 — Jilin University, College of Computer Science and Technology, Changchun, China (and 2 more)
- Xin Wang 0111 — National University of Defense Technology, College of Computer Science, Science and Technology on Parallel and Distributed Processing Laboratory, Changsha, China
- Xin Wang 0112 — Hunan University of Technology, School of Electrical and Information Engineering, Zhuzhou, China (and 1 more)
- Xin Wang 0113 — University of Washington, Department of Electrical and Computer Engineering, Seattle, WA, USA (and 2 more)
- Xin Wang 0114 — Wuhan University, School of Computer Science, Wuhan, China (and 1 more)
- Xin Wang 0115 — Illinois Institute of Technology, Department of Computer Science, Chicago, IL, USA
- Xin Wang 0116 — Google Research, Mountain View, CA, USA (and 1 more)
- Xin Wang 0117 — Tsinghua University, Department of Computer Science and Technology, Centre for Computational Mental Healthcare, Beijing, China
- Xin Wang 0118 — Hong Kong Polytechnic University, Hong Kong
- Xin Wang 0119 — Fudan University, School of Computer Science, Shanghai, China (and 1 more)
- Xin Wang 0120 — Ohio State University, Columbus, OH, USA
- Xin Wang 0121 — Netherlands Cancer Institute (NKI), Department of Radiology, Amsterdam, The Netherlands (and 2 more)
- Xin Wang 0122 — University of Maryland Baltimore County, Department of Information Systems, Baltimore, MD, USA
- Xin Wang 0123 — Rutgers University, Department of Electrical and Computer Engineering, CAIP Center, Piscataway, NJ, USA
- Xin Wang 0124 — Harbin Institute of Technology, Faculty of Computing, School of Computer Science and Technology, Department of Computer Science and Engineering, Harbin, China (and 1 more)
- Xin Wang 0125 — Shanghai Jiao Tong University, School of Biomedical Engineering, Shanghai, China
- Xin Wang 0126 — Xi'an Research Institute of High Technology, Xi'an, China
- Xin Wang 0127 (aka: Xin A. Wang) — University of Texas MD Anderson Cancer Center, Department of Radiation Physics, Houston, TX, USA (and 1 more)
- Xin Wang 0128 — School of Computer Science, Wuhan University, Wuhan, China
- Xin Wang 0129 — Northwestern Polytechnical University, School of Civil Aviation, School of Aeronautics, Xi'an, China
- Xin Wang 0130 — Ghent University, IDLab Design Group, Gent, Belgium (and 1 more)
- Xin Wang 0131 — Donghua University, College of Textiles, JD AI Research, Shanghai, China
- Xin Wang 0132 — Sichuan Changhong Electric Co. Ltd, Mianyang, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j21]Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi:
Joint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances. Comput. Speech Lang. 86: 101619 (2024) - [j20]Mengxiao Zhu, Xin Wang, Xiantao Wang, Zihang Chen, Wei Huang:
Application of Prompt Learning Models in Identifying the Collaborative Problem Solving Skills in an Online Task. Proc. ACM Hum. Comput. Interact. 8(CSCW2): 1-23 (2024) - [j19]Michele Panariello, Natalia A. Tomashenko, Xin Wang, Xiaoxiao Miao, Pierre Champion, Hubert Nourtel, Massimiliano Todisco, Nicholas W. D. Evans, Emmanuel Vincent, Junichi Yamagishi:
The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3477-3491 (2024) - [j18]Cheng Gong, Xin Wang, Erica Cooper, Dan Wells, Longbiao Wang, Jianwu Dang, Korin Richmond, Junichi Yamagishi:
ZMM-TTS: Zero-Shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-Supervised Discrete Speech Representations. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4036-4051 (2024) - [c71]Xin Wang, Junichi Yamagishi:
Can Large-Scale Vocoded Spoofed Data Improve Speech Spoofing Countermeasure with a Self-Supervised Front End? ICASSP 2024: 10311-10315 - [c70]Lauri Juvela, Xin Wang:
Collaborative Watermarking for Adversarial Speech Synthesis. ICASSP 2024: 11231-11235 - [c69]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Nicholas W. D. Evans, Massimiliano Todisco, Jean-François Bonastre, Mickael Rouvier:
Synvox2: Towards A Privacy-Friendly Voxceleb2 Dataset. ICASSP 2024: 11421-11425 - [c68]Wanying Ge, Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Nicholas W. D. Evans:
Spoofing Attack Augmentation: Can Differently-Trained Attack Models Improve Generalisation? ICASSP 2024: 12531-12535 - [i81]Natalia A. Tomashenko, Xiaoxiao Miao, Pierre Champion, Sarina Meyer, Xin Wang, Emmanuel Vincent, Michele Panariello, Nicholas W. D. Evans, Junichi Yamagishi, Massimiliano Todisco:
The VoicePrivacy 2024 Challenge Evaluation Plan. CoRR abs/2404.02677 (2024) - [i80]Jee-weon Jung, Xin Wang, Nicholas W. D. Evans, Shinji Watanabe, Hye-jin Shim, Hemlata Tak, Sidhhant Arora, Junichi Yamagishi, Joon Son Chung:
To what extent can ASV systems naturally defend against spoofing attacks? CoRR abs/2406.05339 (2024) - [i79]Lin Zhang, Xin Wang, Erica Cooper, Mireia Díez, Federico Landini, Nicholas W. D. Evans, Junichi Yamagishi:
Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. CoRR abs/2406.07816 (2024) - [i78]Cheng Gong, Erica Cooper, Xin Wang, Chunyu Qiang, Mengzhe Geng, Dan Wells, Longbiao Wang, Jianwu Dang, Marc Tessier, Aidan Pine, Korin Richmond, Junichi Yamagishi:
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios. CoRR abs/2406.08911 (2024) - [i77]Xin Wang, Tomi Kinnunen, Kong Aik Lee, Paul-Gauthier Noé, Junichi Yamagishi:
Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis. CoRR abs/2406.10836 (2024) - [i76]Xiaoxiao Miao, Ruijie Tao, Chang Zeng, Xin Wang:
A Benchmark for Multi-speaker Anonymization. CoRR abs/2407.05608 (2024) - [i75]Mengxiao Zhu, Xin Wang, Xiantao Wang, Zihang Chen, Wei Huang:
Application of Prompt Learning Models in Identifying the Collaborative Problem Solving Skills in an Online Task. CoRR abs/2407.12487 (2024) - [i74]Xiaoxiao Miao, Yuxiang Zhang, Xin Wang, Natalia A. Tomashenko, Donny Cheng Lock Soh, Ian McLoughlin:
Adapting General Disentanglement-Based Speaker Anonymization for Enhanced Emotion Preservation. CoRR abs/2408.05928 (2024) - [i73]Xin Wang, Héctor Delgado, Hemlata Tak, Jee-weon Jung, Hye-jin Shim, Massimiliano Todisco, Ivan Kukanov, Xuechen Liu, Md. Sahidullah, Tomi Kinnunen, Nicholas W. D. Evans, Kong Aik Lee, Junichi Yamagishi:
ASVspoof 5: Crowdsourced Speech Data, Deepfakes, and Adversarial Attacks at Scale. CoRR abs/2408.08739 (2024) - [i72]Massimiliano Todisco, Michele Panariello, Xin Wang, Héctor Delgado, Kong Aik Lee, Nicholas W. D. Evans:
Malacopula: adversarial automatic speaker verification attacks using a neural-based generalised Hammerstein model. CoRR abs/2408.09300 (2024) - [i71]Xuechen Liu, Xin Wang, Junichi Yamagishi:
A Preliminary Case Study on Long-Form In-the-Wild Audio Spoofing Detection. CoRR abs/2408.14066 (2024) - [i70]Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi:
Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches. CoRR abs/2409.06327 (2024) - [i69]Jee-weon Jung, Wangyou Zhang, Soumi Maiti, Yihan Wu, Xin Wang, Ji-Hoon Kim, Yuta Matsunaga, Seyun Um, Jinchuan Tian, Hye-jin Shim, Nicholas W. D. Evans, Joon Son Chung, Shinnosuke Takamichi, Shinji Watanabe:
Text-To-Speech Synthesis In The Wild. CoRR abs/2409.08711 (2024) - [i68]Lauri Juvela, Xin Wang:
Audio Codec Augmentation for Robust Collaborative Watermarking of Speech Synthesis. CoRR abs/2409.13382 (2024) - [i67]Jee-weon Jung, Yihan Wu, Xin Wang, Ji-Hoon Kim, Soumi Maiti, Yuta Matsunaga, Hye-jin Shim, Jinchuan Tian, Nicholas W. D. Evans, Joon Son Chung, Wangyou Zhang, Seyun Um, Shinnosuke Takamichi, Shinji Watanabe:
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild. CoRR abs/2409.17285 (2024) - 2023
- [j17]Shi Cheng, Jun Du, Shutong Niu, Alejandrina Cristià, Xin Wang, Qing Wang, Chin-Hui Lee:
Using iterative adaptation and dynamic mask for child speech extraction under real-world multilingual conditions. Speech Commun. 152: 102956 (2023) - [j16]Lin Zhang, Xin Wang, Erica Cooper, Nicholas W. D. Evans, Junichi Yamagishi:
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance. IEEE ACM Trans. Audio Speech Lang. Process. 31: 813-825 (2023) - [j15]Xuechen Liu, Xin Wang, Md. Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas W. D. Evans, Andreas Nautsch, Kong Aik Lee:
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2507-2522 (2023) - [j14]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Speaker Anonymization Using Orthogonal Householder Neural Network. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3681-3695 (2023) - [c67]Mateusz Dubiel, Minoru Nakayama, Xin Wang:
Modelling Attention Levels with Ocular Responses in a Speech-in-Noise Recall Task. ETRA 2023: 89:1-89:7 - [c66]Paul-Gauthier Noé, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-François Bonastre, Driss Matrouf:
Hiding Speaker's Sex in Speech Using Zero-Evidence Speaker Representation in an Analysis/Synthesis Pipeline. ICASSP 2023: 1-5 - [c65]Xuan Shi, Erica Cooper, Xin Wang, Junichi Yamagishi, Shrikanth Narayanan:
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems? ICASSP 2023: 1-5 - [c64]Xin Wang, Junichi Yamagishi:
Spoofed Training Data for Speech Spoofing Countermeasure Can Be Efficiently Created Using Neural Vocoders. ICASSP 2023: 1-5 - [c63]Chang Zeng, Xin Wang, Xiaoxiao Miao, Erica Cooper, Junichi Yamagishi:
Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms. INTERSPEECH 2023: 1998-2002 - [c62]Lin Zhang, Xin Wang, Erica Cooper, Nicholas W. D. Evans, Junichi Yamagishi:
Range-Based Equal Error Rate for Spoof Localization. INTERSPEECH 2023: 3212-3216 - [c61]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards Single Integrated Spoofing-aware Speaker Verification Embeddings. INTERSPEECH 2023: 3989-3993 - [i66]Lin Zhang, Xin Wang, Erica Cooper, Nicholas W. D. Evans, Junichi Yamagishi:
Range-Based Equal Error Rate for Spoof Localization. CoRR abs/2305.17739 (2023) - [i65]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Language-independent speaker anonymization using orthogonal Householder neural network. CoRR abs/2305.18823 (2023) - [i64]Sung Hwan Mun, Hye-jin Shim, Hemlata Tak, Xin Wang, Xuechen Liu, Md. Sahidullah, Myeonghun Jeong, Min Hyun Han, Massimiliano Todisco, Kong Aik Lee, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Nam Soo Kim, Jee-weon Jung:
Towards single integrated spoofing-aware speaker verification embeddings. CoRR abs/2305.19051 (2023) - [i63]Xin Wang, Junichi Yamagishi:
Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end? CoRR abs/2309.06014 (2023) - [i62]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Nicholas W. D. Evans, Massimiliano Todisco, Jean-François Bonastre, Mickael Rouvier:
SynVox2: Towards a privacy-friendly VoxCeleb2 dataset. CoRR abs/2309.06141 (2023) - [i61]Nicolas Jonason, Xin Wang, Erica Cooper, Lauri Juvela, Bob L. T. Sturm, Junichi Yamagishi:
DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input. CoRR abs/2309.07658 (2023) - [i60]Wanying Ge, Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Nicholas W. D. Evans:
Spoofing attack augmentation: can differently-trained attack models improve generalisation? CoRR abs/2309.09586 (2023) - [i59]Lauri Juvela, Xin Wang:
Collaborative Watermarking for Adversarial Speech Synthesis. CoRR abs/2309.15224 (2023) - [i58]Xuechen Liu, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi:
Speaker-Text Retrieval via Contrastive Learning. CoRR abs/2312.06055 (2023) - [i57]Cheng Gong, Xin Wang, Erica Cooper, Dan Wells, Longbiao Wang, Jianwu Dang, Korin Richmond, Junichi Yamagishi:
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations. CoRR abs/2312.14398 (2023) - 2022
- [j13]Natalia A. Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier Noé, Andreas Nautsch, Nicholas W. D. Evans, Junichi Yamagishi, Benjamin O'Brien, Anaïs Chanclu, Jean-François Bonastre, Massimiliano Todisco, Mohamed Maouche:
The VoicePrivacy 2020 Challenge: Results and findings. Comput. Speech Lang. 74: 101362 (2022) - [j12]Brij Mohan Lal Srivastava, Mohamed Maouche, Md. Sahidullah, Emmanuel Vincent, Aurélien Bellet, Marc Tommasi, Natalia A. Tomashenko, Xin Wang, Junichi Yamagishi:
Privacy and Utility of X-Vector Based Speaker Anonymization. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2383-2395 (2022) - [c60]Xin Wang, Junichi Yamagishi:
Estimating the Confidence of Speech Spoofing Countermeasure. ICASSP 2022: 6372-6376 - [c59]Chang Zeng, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi:
Attention Back-End for Automatic Speaker Verification with Multiple Enrollment Utterances. ICASSP 2022: 6717-6721 - [c58]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions. INTERSPEECH 2022: 4426-4430 - [c57]Xin Wang, Junichi Yamagishi:
Investigating Self-Supervised Front Ends for Speech Spoofing Countermeasures. Odyssey 2022: 100-106 - [c56]Hemlata Tak, Massimiliano Todisco, Xin Wang, Jee-weon Jung, Junichi Yamagishi, Nicholas W. D. Evans:
Automatic Speaker Verification Spoofing and Deepfake Detection Using Wav2vec 2.0 and Data Augmentation. Odyssey 2022: 112-119 - [c55]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models. Odyssey 2022: 279-286 - [c54]Xin Wang, Junichi Yamagishi:
Investigating Active-Learning-Based Training Data Selection for Speech Spoofing Countermeasure. SLT 2022: 585-592 - [i56]Xin Wang, Junichi Yamagishi:
A Practical Guide to Logical Access Voice Presentation Attack Detection. CoRR abs/2201.03321 (2022) - [i55]Hemlata Tak, Massimiliano Todisco, Xin Wang, Jee-weon Jung, Junichi Yamagishi, Nicholas W. D. Evans:
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation. CoRR abs/2202.12233 (2022) - [i54]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models. CoRR abs/2202.13097 (2022) - [i53]Natalia A. Tomashenko, Xin Wang, Xiaoxiao Miao, Hubert Nourtel, Pierre Champion, Massimiliano Todisco, Emmanuel Vincent, Nicholas W. D. Evans, Junichi Yamagishi, Jean-François Bonastre:
The VoicePrivacy 2022 Challenge Evaluation Plan. CoRR abs/2203.12468 (2022) - [i52]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions. CoRR abs/2203.14834 (2022) - [i51]Lin Zhang, Xin Wang, Erica Cooper, Nicholas W. D. Evans, Junichi Yamagishi:
The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance. CoRR abs/2204.05177 (2022) - [i50]Natalia A. Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco:
The VoicePrivacy 2020 Challenge Evaluation Plan. CoRR abs/2205.07123 (2022) - [i49]Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi:
Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances. CoRR abs/2209.00485 (2022) - [i48]Xuechen Liu, Xin Wang, Md. Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas W. D. Evans, Andreas Nautsch, Kong Aik Lee:
ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild. CoRR abs/2210.02437 (2022) - [i47]Xin Wang, Junichi Yamagishi:
Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders. CoRR abs/2210.10570 (2022) - [i46]Xuan Shi, Erica Cooper, Xin Wang, Junichi Yamagishi, Shrikanth Narayanan:
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems? CoRR abs/2211.13868 (2022) - [i45]Paul-Gauthier Noé, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-François Bonastre, Driss Matrouf:
Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline. CoRR abs/2211.16065 (2022) - 2021
- [j11]Yusuke Yasuda, Xin Wang, Junichi Yamagishi:
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis. Comput. Speech Lang. 67: 101183 (2021) - [j10]Andreas Nautsch, Xin Wang, Nicholas W. D. Evans, Tomi H. Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md. Sahidullah, Junichi Yamagishi, Kong Aik Lee:
ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech. IEEE Trans. Biom. Behav. Identity Sci. 3(2): 252-265 (2021) - [c53]Canasai Kruengkrai, Junichi Yamagishi, Xin Wang:
A Multi-Level Attention Model for Evidence-Based Fact Checking. ACL/IJCNLP (Findings) 2021: 2447-2460 - [c52]Mateusz Dubiel, Minoru Nakayama, Xin Wang:
Evaluating Synthetic Speech Workload with Oculo-motor Indices: Preliminary Observations for Japanese Speech. BIOSIGNALS 2021: 335-342 - [c51]Mateusz Dubiel, Minoru Nakayama, Xin Wang:
Combining Oculo-motor Indices to Measure Cognitive Load of Synthetic Speech in Noisy Listening Conditions. ETRA Short Papers 2021: 27:1-27:6 - [c50]Yusuke Yasuda, Xin Wang, Junichi Yamagishi:
End-to-End Text-to-Speech Using Latent Duration Based on VQ-VAE. ICASSP 2021: 5694-5698 - [c49]Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Junichi Yamagishi:
How Similar or Different is Rakugo Speech Synthesizer to Professional Performers? ICASSP 2021: 6488-6492 - [c48]Xin Wang, Junichi Yamagishi:
A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection. Interspeech 2021: 4259-4263 - [c47]Lin Zhang, Xin Wang, Erica Cooper, Junichi Yamagishi, Jose Patino, Nicholas W. D. Evans:
An Initial Investigation for Detecting Partially Spoofed Audio. Interspeech 2021: 4264-4268 - [c46]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. Interspeech 2021: 4299-4303 - [c45]Yang Ai, Haoyu Li, Xin Wang, Junichi Yamagishi, Zhen-Hua Ling:
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation. SLT 2021: 477-484 - [c44]Erica Cooper, Xin Wang, Junichi Yamagishi:
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis. SSW 2021: 130-135 - [i44]Andreas Nautsch, Xin Wang, Nicholas W. D. Evans, Tomi Kinnunen, Ville Vestman, Massimiliano Todisco, Héctor Delgado, Md. Sahidullah, Junichi Yamagishi, Kong Aik Lee:
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech. CoRR abs/2102.05889 (2021) - [i43]Chang Zeng, Xin Wang, Erica Cooper, Junichi Yamagishi:
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances. CoRR abs/2104.01541 (2021) - [i42]Lin Zhang, Xin Wang, Erica Cooper, Junichi Yamagishi, Jose Patino, Nicholas W. D. Evans:
An Initial Investigation for Detecting Partially Spoofed Audio. CoRR abs/2104.02518 (2021) - [i41]Erica Cooper, Xin Wang, Junichi Yamagishi:
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis. CoRR abs/2104.12292 (2021) - [i40]Canasai Kruengkrai, Junichi Yamagishi, Xin Wang:
A Multi-Level Attention Model for Evidence-Based Fact Checking. CoRR abs/2106.00950 (2021) - [i39]Tomi Kinnunen, Andreas Nautsch, Md. Sahidullah, Nicholas W. D. Evans, Xin Wang, Massimiliano Todisco, Héctor Delgado, Junichi Yamagishi, Kong Aik Lee:
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. CoRR abs/2106.06362 (2021) - [i38]Lin Zhang, Xin Wang, Erica Cooper, Junichi Yamagishi:
Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection. CoRR abs/2107.14132 (2021) - [i37]Jean-François Bonastre, Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noé, Jose Patino, Md. Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia A. Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi:
Benchmarking and challenges in security and privacy for voice biometrics. CoRR abs/2109.00281 (2021) - [i36]Héctor Delgado, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Jose Patino, Md. Sahidullah, Massimiliano Todisco, Xin Wang, Junichi Yamagishi:
ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan. CoRR abs/2109.00535 (2021) - [i35]Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md. Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas W. D. Evans, Héctor Delgado:
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection. CoRR abs/2109.00537 (2021) - [i34]Natalia A. Tomashenko, Xin Wang, Emmanuel Vincent, Jose Patino, Brij Mohan Lal Srivastava, Paul-Gauthier Noé, Andreas Nautsch, Nicholas W. D. Evans, Junichi Yamagishi, Benjamin O'Brien, Anaïs Chanclu, Jean-François Bonastre, Massimiliano Todisco, Mohamed Maouche:
The VoicePrivacy 2020 Challenge: Results and findings. CoRR abs/2109.00648 (2021) - [i33]Xin Wang, Junichi Yamagishi:
Estimating the confidence of speech spoofing countermeasure. CoRR abs/2110.04775 (2021) - [i32]Xin Wang, Junichi Yamagishi:
Investigating self-supervised front ends for speech spoofing countermeasures. CoRR abs/2111.07725 (2021) - 2020
- [j9]Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Shinji Takaki, Junichi Yamagishi:
Modeling of Rakugo Speech and Its Limitations: Toward Speech Synthesis That Entertains Audiences. IEEE Access 8: 138149-138161 (2020) - [j8]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Zhen-Hua Ling:
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Comput. Speech Lang. 64: 101114 (2020) - [j7]Xin Wang, Shinji Takaki, Junichi Yamagishi, Simon King, Keiichi Tokuda:
A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 28: 157-170 (2020) - [j6]Xin Wang, Shinji Takaki, Junichi Yamagishi:
Neural Source-Filter Waveform Models for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 28: 402-415 (2020) - [j5]Tomi Kinnunen, Héctor Delgado, Nicholas W. D. Evans, Kong Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2195-2210 (2020) - [c43]Erica Cooper, Cheng-I Lai, Yusuke Yasuda, Fuming Fang, Xin Wang, Nanxin Chen, Junichi Yamagishi:
Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings. ICASSP 2020: 6184-6188 - [c42]Yi Zhao, Xin Wang, Lauri Juvela, Junichi Yamagishi:
Transferring Neural Speech Waveform Synthesizers to Musical Instrument Sounds Generation. ICASSP 2020: 6269-6273 - [c41]Yusuke Yasuda, Xin Wang, Junichi Yamagishi:
Effect of Choice of Probability Distribution, Randomness, and Search Methods for Alignment Modeling in Sequence-to-Sequence Text-to-Speech Synthesis Using Hard Alignment. ICASSP 2020: 6724-6728 - [c40]Xin Wang, Jun Du, Alejandrina Cristià, Lei Sun, Chin-Hui Lee:
A Study of Child Speech Extraction Using Joint Speech Enhancement and Separation in Realistic Conditions. ICASSP 2020: 7304-7308 - [c39]Natalia A. Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco:
Introducing the VoicePrivacy Initiative. INTERSPEECH 2020: 1693-1697 - [c38]Brij Mohan Lal Srivastava, Natalia A. Tomashenko, Xin Wang, Emmanuel Vincent, Junichi Yamagishi, Mohamed Maouche, Aurélien Bellet, Marc Tommasi:
Design Choices for X-Vector Based Speaker Anonymization. INTERSPEECH 2020: 1713-1717 - [c37]Xin Wang, Junichi Yamagishi:
Using Cyclic Noise as the Source Signal for Neural Source-Filter-Based Speech Waveform Model. INTERSPEECH 2020: 1992-1996 - [c36]Yang Ai, Xin Wang, Junichi Yamagishi, Zhen-Hua Ling:
Reverberation Modeling for Source-Filter-Based Neural Vocoder. INTERSPEECH 2020: 3560-3564 - [c35]Xin Wang, Wei Huang, Qi Liu, Yu Yin, Zhenya Huang, Le Wu, Jianhui Ma, Xue Wang:
Fine-Grained Similarity Measurement between Educational Videos and Exercises. ACM Multimedia 2020: 331-339 - [c34]Leibny Paola García-Perera, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak:
Speaker Detection in the Wild: Lessons Learned from JSALT 2019. Odyssey 2020: 415-422 - [i31]Natalia A. Tomashenko, Brij Mohan Lal Srivastava, Xin Wang, Emmanuel Vincent, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Jose Patino, Jean-François Bonastre, Paul-Gauthier Noé, Massimiliano Todisco:
Introducing the VoicePrivacy Initiative. CoRR abs/2005.01387 (2020) - [i30]Yang Ai, Xin Wang, Junichi Yamagishi, Zhen-Hua Ling:
Reverberation Modeling for Source-Filter-based Neural Vocoder. CoRR abs/2005.07379 (2020) - [i29]Brij Mohan Lal Srivastava, Natalia A. Tomashenko, Xin Wang, Emmanuel Vincent, Junichi Yamagishi, Mohamed Maouche, Aurélien Bellet, Marc Tommasi:
Design Choices for X-vector Based Speaker Anonymization. CoRR abs/2005.08601 (2020) - [i28]Yusuke Yasuda, Xin Wang, Junichi Yamagishi:
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis. CoRR abs/2005.10390 (2020) - [i27]Tomi Kinnunen, Héctor Delgado, Nicholas W. D. Evans, Kong Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals. CoRR abs/2007.05979 (2020) - [i26]Yusuke Yasuda, Xin Wang, Junichi Yamagishi:
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE. CoRR abs/2010.09602 (2020) - [i25]Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Junichi Yamagishi:
How Similar or Different Is Rakugo Speech Synthesizer to Professional Performers? CoRR abs/2010.11549 (2020) - [i24]Yang Ai, Haoyu Li, Xin Wang, Junichi Yamagishi, Zhen-Hua Ling:
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation. CoRR abs/2011.03955 (2020) - [i23]Erica Cooper, Xin Wang, Yi Zhao, Yusuke Yasuda, Junichi Yamagishi:
Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis. CoRR abs/2011.04839 (2020)
2010 – 2019
- 2019
- [c33]Xin Wang, Shinji Takaki, Junichi Yamagishi:
Neural Source-filter-based Waveform Model for Statistical Parametric Speech Synthesis. ICASSP 2019: 5916-5920 - [c32]Fuming Fang, Xin Wang, Junichi Yamagishi, Isao Echizen:
Audiovisual Speaker Conversion: Jointly and Simultaneously Transforming Facial Expression and Acoustic Characteristics. ICASSP 2019: 6795-6799 - [c31]Yusuke Yasuda, Xin Wang, Shinji Takaki, Junichi Yamagishi:
Investigation of Enhanced Tacotron Text-to-speech Synthesis Systems with Self-attention for Pitch Accent Language. ICASSP 2019: 6905-6909 - [c30]Shinji Takaki, Toru Nakashika, Xin Wang, Junichi Yamagishi:
STFT Spectral Loss for Training a Neural Speech Waveform Model. ICASSP 2019: 7065-7069 - [c29]Massimiliano Todisco, Xin Wang, Ville Vestman, Md. Sahidullah, Héctor Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Tomi H. Kinnunen, Kong Aik Lee:
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. INTERSPEECH 2019: 1008-1012 - [c28]Mingyang Zhang, Xin Wang, Fuming Fang, Haizhou Li, Junichi Yamagishi:
Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet. INTERSPEECH 2019: 1298-1302 - [c27]Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa:
Training Multi-Speaker Neural Text-to-Speech Systems Using Speaker-Imbalanced Speech Corpora. INTERSPEECH 2019: 1303-1307 - [c26]Chen-Chou Lo, Szu-Wei Fu, Wen-Chin Huang, Xin Wang, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang:
MOSNet: Deep Learning-Based Objective Assessment for Voice Conversion. INTERSPEECH 2019: 1541-1545 - [c25]Xin Wang, Junichi Yamagishi:
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis. SSW 2019: 1-6 - [c24]Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Shinji Takaki, Junichi Yamagishi:
Rakugo speech synthesis using segment-to-segment neural transduction and style tokens - toward speech synthesis for entertaining audiences. SSW 2019: 111-116 - [c23]Fuming Fang, Xin Wang, Junichi Yamagishi, Isao Echizen, Massimiliano Todisco, Nicholas W. D. Evans, Jean-François Bonastre:
Speaker Anonymization Using X-vector and Neural Waveform Models. SSW 2019: 155-160 - [c22]Yusuke Yasuda, Xin Wang, Junichi Yamagishi:
Initial investigation of encoder-decoder end-to-end TTS using marginalization of monotonic hard alignments. SSW 2019: 211-216 - [i22]Mingyang Zhang, Xin Wang, Fuming Fang, Haizhou Li, Junichi Yamagishi:
Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet. CoRR abs/1903.12389 (2019) - [i21]Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa:
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora. CoRR abs/1904.00771 (2019) - [i20]Massimiliano Todisco, Xin Wang, Ville Vestman, Md. Sahidullah, Héctor Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas W. D. Evans, Tomi Kinnunen, Kong Aik Lee:
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. CoRR abs/1904.05441 (2019) - [i19]Chen-Chou Lo, Szu-Wei Fu, Wen-Chin Huang, Xin Wang, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang:
MOSNet: Deep Learning based Objective Assessment for Voice Conversion. CoRR abs/1904.08352 (2019) - [i18]Xin Wang, Shinji Takaki, Junichi Yamagishi:
Neural source-filter waveform models for statistical parametric speech synthesis. CoRR abs/1904.12088 (2019) - [i17]Fuming Fang, Xin Wang, Junichi Yamagishi, Isao Echizen, Massimiliano Todisco, Nicholas W. D. Evans, Jean-François Bonastre:
Speaker Anonymization Using X-vector and Neural Waveform Models. CoRR abs/1905.13561 (2019) - [i16]Xin Wang, Junichi Yamagishi:
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis. CoRR abs/1908.10256 (2019) - [i15]Yusuke Yasuda, Xin Wang, Junichi Yamagishi:
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments. CoRR abs/1908.11535 (2019) - [i14]Yi Zhao, Xin Wang, Lauri Juvela, Junichi Yamagishi:
Transferring neural speech waveform synthesizers to musical instrument sounds generation. CoRR abs/1910.12381 (2019) - [i13]Yusuke Yasuda, Xin Wang, Junichi Yamagishi:
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment. CoRR abs/1910.12383 (2019) - [i12]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019) - [i11]Seyyed Saeed Sarfjoo, Xin Wang, Gustav Eje Henter, Jaime Lorenzo-Trueba, Shinji Takaki, Junichi Yamagishi:
Transformation of low-quality device-recorded speech to high-quality speech using improved SEGAN model. CoRR abs/1911.03952 (2019) - [i10]Paola García, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak:
Speaker detection in the wild: Lessons learned from JSALT 2019. CoRR abs/1912.00938 (2019) - 2018
- [b1]Xin Wang:
Fundamental Frequency Modeling for Neural-Network-Based Statistical Parametric Speech Synthesis. Graduate University for Advanced Studies, Japan, 2018 - [j4]Xin Wang, Shinji Takaki, Junichi Yamagishi:
Investigating very deep highway networks for parametric speech synthesis. Speech Commun. 96: 1-9 (2018) - [j3]Xin Wang, Shinji Takaki, Junichi Yamagishi:
Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 26(8): 1406-1419 (2018) - [c21]Gustav Eje Henter, Jaime Lorenzo-Trueba, Xin Wang, Mariko Kondo, Junichi Yamagishi:
Cyborg Speech: Deep Multilingual Speech Synthesis for Generating Segmental Foreign Accent with Natural Prosody. ICASSP 2018: 4799-4803 - [c20]Xin Wang, Jaime Lorenzo-Trueba, Shinji Takaki, Lauri Juvela, Junichi Yamagishi:
A Comparison of Recent Waveform Generation and Acoustic Modeling Methods for Neural-Network-Based Speech Synthesis. ICASSP 2018: 4804-4808 - [c19]Lauri Juvela, Bajibabu Bollepalli, Xin Wang, Hirokazu Kameoka, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speech Waveform Synthesis from MFCC Sequences with Generative Adversarial Networks. ICASSP 2018: 5679-5683 - [c18]Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa:
Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects. INTERSPEECH 2018: 37-41 - [c17]Xin Wang, Jun Du, Lei Sun, Qing Wang, Chin-Hui Lee:
A Progressive Deep Learning Approach to Child Speech Separation. ISCSLP 2018: 76-80 - [c16]Jaime Lorenzo-Trueba, Fuming Fang, Xin Wang, Isao Echizen, Junichi Yamagishi, Tomi Kinnunen:
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data. Odyssey 2018: 240-247 - [i9]Jaime Lorenzo-Trueba, Fuming Fang, Xin Wang, Isao Echizen, Junichi Yamagishi, Tomi Kinnunen:
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data. CoRR abs/1803.00860 (2018) - [i8]Lauri Juvela, Bajibabu Bollepalli, Xin Wang, Hirokazu Kameoka, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speech waveform synthesis from MFCC sequences with generative adversarial networks. CoRR abs/1804.00920 (2018) - [i7]Xin Wang, Jaime Lorenzo-Trueba, Shinji Takaki, Lauri Juvela, Junichi Yamagishi:
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis. CoRR abs/1804.02549 (2018) - [i6]Gustav Eje Henter, Xin Wang, Junichi Yamagishi:
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis. CoRR abs/1807.11470 (2018) - [i5]Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa:
Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects. CoRR abs/1808.00665 (2018) - [i4]Shinji Takaki, Toru Nakashika, Xin Wang, Junichi Yamagishi:
STFT spectral loss for training a neural speech waveform model. CoRR abs/1810.11945 (2018) - [i3]Xin Wang, Shinji Takaki, Junichi Yamagishi:
Neural source-filter-based waveform model for statistical parametric speech synthesis. CoRR abs/1810.11946 (2018) - [i2]Yusuke Yasuda, Xin Wang, Shinji Takaki, Junichi Yamagishi:
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language. CoRR abs/1810.11960 (2018) - [i1]Fuming Fang, Xin Wang, Junichi Yamagishi, Isao Echizen:
Audiovisual speaker conversion: jointly and simultaneously transforming facial expression and acoustic characteristics. CoRR abs/1810.12730 (2018) - 2017
- [c15]Xin Wang, Jun Du, Yannan Wang:
A maximum likelihood approach to deep neural network based speech dereverberation. APSIPA 2017: 155-158 - [c14]Xin Wang, Shinji Takaki, Junichi Yamagishi:
An autoregressive recurrent mixture density network for parametric speech synthesis. ICASSP 2017: 4895-4899 - [c13]Xin Wang, Shinji Takaki, Junichi Yamagishi:
An RNN-Based Quantized F0 Model with Multi-Tier Feedback Links for Text-to-Speech Synthesis. INTERSPEECH 2017: 1059-1063 - [c12]Gustav Eje Henter, Jaime Lorenzo-Trueba, Xin Wang, Junichi Yamagishi:
Principles for Learning Controllable TTS from Annotated and Latent Variation. INTERSPEECH 2017: 3956-3960 - 2016
- [j2]Xin Wang, Zhen-Hua Ling, Li-Rong Dai:
Concept-to-Speech generation with knowledge sharing for acoustic modelling and utterance filtering. Comput. Speech Lang. 38: 46-67 (2016) - [j1]Xin Wang, Shinji Takaki, Junichi Yamagishi:
Investigation of Using Continuous Representation of Various Linguistic Units in Neural Network Based Text-to-Speech Synthesis. IEICE Trans. Inf. Syst. 99-D(10): 2471-2480 (2016) - [c11]Lauri Juvela, Xin Wang, Shinji Takaki, Sangjin Kim, Manu Airaksinen, Junichi Yamagishi:
The NII speech synthesis entry for Blizzard Challenge 2016. Blizzard Challenge 2016 - [c10]Xin Wang, Minghui Dong, Zhen-Hua Ling:
A full training framework of cross-stream dependence modelling for HMM-based singing voice synthesis. ICASSP 2016: 5165-5169 - [c9]Cassia Valentini-Botinhao, Xin Wang, Shinji Takaki, Junichi Yamagishi:
Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks. INTERSPEECH 2016: 352-356 - [c8]Lauri Juvela, Xin Wang, Shinji Takaki, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks. INTERSPEECH 2016: 2283-2287 - [c7]Xin Wang, Shinji Takaki, Junichi Yamagishi:
Enhance the Word Vector with Prosodic Information for the Recurrent Neural Network Based TTS System. INTERSPEECH 2016: 2856-2860 - [c6]Xin Wang, Shinji Takaki, Junichi Yamagishi:
A Comparative Study of the Performance of HMM, DNN, and RNN based Speech Synthesis Systems Trained on Very Large Speaker-Dependent Corpora. SSW 2016: 118-121 - [c5]Cassia Valentini-Botinhao, Xin Wang, Shinji Takaki, Junichi Yamagishi:
Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech. SSW 2016: 146-152 - [c4]Xin Wang, Shinji Takaki, Junichi Yamagishi:
Investigating Very Deep Highway Networks for Parametric Speech Synthesis. SSW 2016: 166-171 - 2014
- [c3]Xin Wang, Zhen-Hua Ling, Li-Rong Dai:
Concept-to-speech generation by integrating syntagmatic features into HMM-based speech synthesis. INTERSPEECH 2014: 2942-2946 - 2013
- [c2]Shen Liu, Jianguo Wei, Xin Wang, Wenhuan Lu, Qiang Fang, Jianwu Dang:
An anisotropic diffusion filter based on multidirectional separability. INTERSPEECH 2013: 3187-3190 - 2012
- [c1]Xin Wang, Zhen-Hua Ling, Li-Rong Dai:
Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis. ISCSLP 2012: 84-87
Coauthor Index
aka: Tomi H. Kinnunen
aka: Kong Aik Lee
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-19 23:12 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint