default search action
Yonghong Yan 0002
Person information
- affiliation: Chinese Academy of Sciences, Institute of Acoustics / Xinjiang Technical Institute of Physics and Chemistry, China
Other persons with the same name
- Yonghong Yan 0001 — University of South Carolina, Columbia, SC, USA (and 4 more)
- Yonghong Yan 0003 — Chongqing University, Faculty of Architecture and Urban Planning, China
- Yonghong Yan 0004 — University of North Carolina, College of Computing and Informatics, Charlotte, NC, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j94]Shengchang Xiao, Xueshuai Zhang, Pengyuan Zhang, Yonghong Yan:
Semi-supervised sound event detection with dynamic convolution and confidence-aware mean teacher. Digit. Signal Process. 156: 104794 (2025) - 2024
- [j93]Jiakun Shen, Xueshuai Zhang, Yu Lu, Pengfei Ye, Pengyuan Zhang, Yonghong Yan:
Novel audio characteristic-dependent feature extraction and data augmentation methods for cough-based respiratory disease classification. Comput. Biol. Medicine 179: 108843 (2024) - [j92]Chengzhang Li, Ming Zhang, Xuejun Zhang, Yonghong Yan:
MCRSpell: A metric learning of correct representation for Chinese spelling correction. Expert Syst. Appl. 237(Part B): 121513 (2024) - [j91]Jianjun Gu, Dingding Yao, Junfeng Li, Yonghong Yan:
A novel semi-blind source separation framework towards maximum signal-to-interference ratio. Signal Process. 217: 109359 (2024) - [j90]Haitian Lu, Gaofeng Cheng, Yonghong Yan:
Conversational Short-Phrase Speaker Diarization via Self-Adjusting Speech Segmentation and Embedding Extraction. IEEE Signal Process. Lett. 31: 2340-2344 (2024) - [j89]Han Zhu, Gaofeng Cheng, Jindong Wang, Wenxin Hou, Pengyuan Zhang, Yonghong Yan:
Boosting Cross-Domain Speech Recognition With Self-Supervision. IEEE ACM Trans. Audio Speech Lang. Process. 32: 471-485 (2024) - [j88]Yifan Chen, Gaofeng Cheng, Runyan Yang, Pengyuan Zhang, Yonghong Yan:
Interrelate Training and Clustering for Online Speaker Diarization. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1352-1364 (2024) - [c263]Zelin Qiu, Jianjun Gu, Dingding Yao, Junfeng Li, Yonghong Yan:
BMMSNet: Bidirectional Mapping and Multilevel Similarity Comparison for EEG-Speech Match-Mismatch Problem. ICASSP Workshops 2024: 117-118 - [c262]Aolin Hu, Xueshuai Zhang, Shaoxing Zhang, Pengyuan Zhang, Yu Lu, Pengfei Ye, Qingwei Zhao, Yonghong Yan:
Snore Sound Features Based on Percussive Enhancing and Positional Encoding Combined with Multi-Task Learning for Osahs Detection. ICASSP 2024: 901-905 - [c261]Jiakun Shen, Xueshuai Zhang, Pengyuan Zhang, Yonghong Yan, Qingwei Zhao, Ta Li, Yanfen Tang, Shaoxing Zhang:
One-Epoch Training with Single Test Sample in Test Time for Better Generalization of Cough-Based Covid-19 Detection Model. ICASSP 2024: 931-935 - [c260]Ke Chen, Zhihua Huang, Kexin Lu, Yonghong Yan:
CosDiff: Code-Switching TTS Model Based on A Multi-Task DDIM. ICME 2024: 1-6 - 2023
- [j87]Han Wang, Ruiliu Fu, Chengzhang Li, Xuejun Zhang, Jun Zhou, Xing Bai, Yonghong Yan, Qingwei Zhao:
Reminding the incremental language model via data-free self-distillation. Appl. Intell. 53(8): 9298-9320 (2023) - [j86]Yukun Liu, Ta Li, Pengyuan Zhang, Yonghong Yan:
SFA: Searching faster architectures for end-to-end automatic speech recognition models. Comput. Speech Lang. 81: 101500 (2023) - [j85]Jianjun Gu, Longbiao Cheng, Dingding Yao, Junfeng Li, Yonghong Yan:
The effect of source sparsity on independent vector analysis for blind source separation. Signal Process. 213: 109199 (2023) - [j84]Feng Dang, Hangting Chen, Qi Hu, Pengyuan Zhang, Yonghong Yan:
First coarse, fine afterward: A lightweight two-stage complex approach for monaural speech enhancement. Speech Commun. 146: 32-44 (2023) - [j83]Han Zhu, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, Yonghong Yan:
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3320-3330 (2023) - [c259]Jiakun Shen, Xueshuai Zhang, Pengyuan Zhang, Yonghong Yan, Shaoxing Zhang, Zhihua Huang, Yanfen Tang, Yu Wang, Fujie Zhang, Aijun Sun:
Piecewise Position Encoding in Convolutional Neural Network for Cough-Based Covid-19 Detection. ICASSP 2023: 1-5 - [i25]Changfeng Gao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Speech Corpora Divergence Based Unsupervised Data Selection for ASR. CoRR abs/2302.13222 (2023) - [i24]Feng Dang, Qi Hu, Pengyuan Zhang, Yonghong Yan:
ForkNet: Simultaneous Time and Time-Frequency Domain Modeling for Speech Enhancement. CoRR abs/2305.08292 (2023) - [i23]Han Zhu, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, Yonghong Yan:
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition. CoRR abs/2308.06547 (2023) - 2022
- [j82]Zeyu He, Wang Li, Yonghong Yan:
Modeling knowledge proficiency using multi-hierarchical capsule graph neural network. Appl. Intell. 52(7): 7230-7247 (2022) - [j81]Xingyue Zhou, Ning Wang, Yonghong Yan, Kunde Yang:
Underwater Detection of Small-Volume Weak Target Echo in Harbor Scene Under Multisource Interference. IEEE Geosci. Remote. Sens. Lett. 19: 1-5 (2022) - [j80]Daocheng Chen, Longbiao Cheng, Dingding Yao, Junfeng Li, Yonghong Yan:
A Secondary Path-Decoupled Active Noise Control Algorithm Based on Deep Learning. IEEE Signal Process. Lett. 29: 234-238 (2022) - [j79]Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
An E2E-ASR-Based Iteratively-Trained Timestamp Estimator. IEEE Signal Process. Lett. 29: 1654-1658 (2022) - [j78]Keqi Deng, Gaofeng Cheng, Runyan Yang, Yonghong Yan:
Alleviating ASR Long-Tailed Problem by Decoupling the Learning of Representation and Classification. IEEE ACM Trans. Audio Speech Lang. Process. 30: 340-354 (2022) - [j77]Gaofeng Cheng, Haoran Miao, Runyan Yang, Keqi Deng, Yonghong Yan:
ETEH: Unified Attention-Based End-to-End ASR and KWS Architecture. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1360-1373 (2022) - [j76]Changfeng Gao, Gaofeng Cheng, Ta Li, Pengyuan Zhang, Yonghong Yan:
Self-Supervised Pre-Training for Attention-Based Encoder-Decoder ASR Model. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1763-1774 (2022) - [c258]Yukun Liu, Ta Li, Pengyuan Zhang, Yonghong Yan:
NAS-SCAE: Searching Compact Attention-based Encoders For End-to-end Automatic Speech Recognition. INTERSPEECH 2022: 1011-1015 - [c257]Yifan Chen, Yifan Guo, Qingxuan Li, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization. INTERSPEECH 2022: 1456-1460 - [c256]Zehan Li, Haoran Miao, Keqi Deng, Gaofeng Cheng, Sanli Tian, Ta Li, Yonghong Yan:
Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies. INTERSPEECH 2022: 1671-1675 - [c255]Zehui Yang, Yifan Chen, Lei Luo, Runyan Yang, Lingxuan Ye, Gaofeng Cheng, Ji Xu, Yaohui Jin, Qingqing Zhang, Pengyuan Zhang, Lei Xie, Yonghong Yan:
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset. INTERSPEECH 2022: 1736-1740 - [c254]Xueshuai Zhang, Jiakun Shen, Jun Zhou, Pengyuan Zhang, Yonghong Yan, Zhihua Huang, Yanfen Tang, Yu Wang, Fujie Zhang, Shaoxing Zhang, Aijun Sun:
Robust Cough Feature Extraction and Classification Method for COVID-19 Cough Detection Based on Vocalization Characteristics. INTERSPEECH 2022: 2168-2172 - [c253]Han Zhu, Jindong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Decoupled Federated Learning for ASR with Non-IID Data. INTERSPEECH 2022: 2628-2632 - [c252]Sanli Tian, Keqi Deng, Zehan Li, Lingxuan Ye, Gaofeng Cheng, Ta Li, Yonghong Yan:
Knowledge Distillation For CTC-based Speech Recognition Via Consistent Acoustic Representation Learning. INTERSPEECH 2022: 2633-2637 - [c251]Lingxuan Ye, Gaofeng Cheng, Runyan Yang, Zehui Yang, Sanli Tian, Pengyuan Zhang, Yonghong Yan:
Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods. INTERSPEECH 2022: 3163-3167 - [c250]Han Zhu, Li Wang, Gaofeng Cheng, Jindong Wang, Pengyuan Zhang, Yonghong Yan:
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR. INTERSPEECH 2022: 4870-4874 - [c249]Qingxuan Li, Han Zhu, Liuping Luo, Gaofeng Cheng, Pengyuan Zhang, Jiasong Sun, Yonghong Yan:
Sequence Distribution Matching for Unsupervised Domain Adaptation in ASR. ISCSLP 2022: 21-25 - [c248]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. ISCSLP 2022: 488-492 - [c247]Shuhao Deng, Chengfei Li, Jinfeng Bai, Qingqing Zhang, Wei-Qiang Zhang, Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Summary On The ISCSLP 2022 Chinese-English Code-Switching ASR Challenge. ISCSLP 2022: 527-531 - [i22]Zehui Yang, Yifan Chen, Lei Luo, Runyan Yang, Lingxuan Ye, Gaofeng Cheng, Ji Xu, Yaohui Jin, Qingqing Zhang, Pengyuan Zhang, Lei Xie, Yonghong Yan:
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset. CoRR abs/2203.16844 (2022) - [i21]Han Zhu, Jindong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Decoupled Federated Learning for ASR with Non-IID Data. CoRR abs/2206.09102 (2022) - [i20]Han Zhu, Gaofeng Cheng, Jindong Wang, Wenxin Hou, Pengyuan Zhang, Yonghong Yan:
Boosting Cross-Domain Speech Recognition with Self-Supervision. CoRR abs/2206.09783 (2022) - [i19]Yifan Chen, Yifan Guo, Qingxuan Li, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization. CoRR abs/2206.13760 (2022) - [i18]Zehan Li, Haoran Miao, Keqi Deng, Gaofeng Cheng, Sanli Tian, Ta Li, Yonghong Yan:
Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies. CoRR abs/2207.02495 (2022) - [i17]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. CoRR abs/2208.08042 (2022) - [i16]Shuhao Deng, Chengfei Li, Jinfeng Bai, Qingqing Zhang, Wei-Qiang Zhang, Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge. CoRR abs/2210.06091 (2022) - 2021
- [j75]Dongni Hu, Chengxin Chen, Pengyuan Zhang, Junfeng Li, Yonghong Yan, Qingwei Zhao:
A Two-Stage Attention Based Modality Fusion Framework for Multi-Modal Speech Emotion Recognition. IEICE Trans. Inf. Syst. 104-D(8): 1391-1394 (2021) - [j74]Danyang Liu, Ji Xu, Pengyuan Zhang, Yonghong Yan:
A unified system for multilingual speech recognition and language identification. Speech Commun. 127: 17-28 (2021) - [j73]Longbiao Cheng, Junfeng Li, Yonghong Yan:
FSCNet: Feature-Specific Convolution Neural Network for Real-Time Speech Enhancement. IEEE Signal Process. Lett. 28: 1958-1962 (2021) - [j72]Longbiao Cheng, Xingwei Sun, Dingding Yao, Junfeng Li, Yonghong Yan:
Estimation Reliability Function Assisted Sound Source Localization With Enhanced Steering Vector Phase Difference. IEEE ACM Trans. Audio Speech Lang. Process. 29: 421-435 (2021) - [j71]Runyan Yang, Gaofeng Cheng, Haoran Miao, Ta Li, Pengyuan Zhang, Yonghong Yan:
Keyword Search Using Attention-Based End-to-End ASR and Frame-Synchronous Phoneme Alignments. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3202-3215 (2021) - [c246]Zhuo Li, Ce Fang, Runqiu Xiao, Wenchao Wang, Yonghong Yan:
SI-Net: Multi-Scale Context-Aware Convolutional Block for Speaker Verification. ASRU 2021: 220-227 - [c245]Yifan Guo, Yifan Chen, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Far-Field Speech Recognition Based on Complex-Valued Neural Networks and Inter-Frame Similarity Difference Method. ASRU 2021: 1003-1010 - [c244]Zeyu He, Jianzong Kuang, Wang Li, Yonghong Yan:
Using Cognitive Interest Graph and Knowledge-activated Attention for Learning Resource Recommendation. COMPSAC 2021: 93-102 - [c243]Ruiliu Fu, Han Wang, Xuejun Zhang, Jun Zhou, Yonghong Yan:
Decomposing Complex Questions Makes Multi-Hop QA Easier and More Interpretable. EMNLP (Findings) 2021: 169-180 - [c242]Keqi Deng, Gaofeng Cheng, Haoran Miao, Pengyuan Zhang, Yonghong Yan:
History Utterance Embedding Transformer LM for Speech Recognition. ICASSP 2021: 5914-5918 - [c241]Changfeng Gao, Gaofeng Cheng, Runyan Yang, Han Zhu, Pengyuan Zhang, Yonghong Yan:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Text Data. ICASSP 2021: 6543-6547 - [c240]Jianjun Gu, Longbiao Cheng, Xingwei Sun, Junfeng Li, Yonghong Yan:
Residual Echo and Noise Cancellation with Feature Attention Module and Multi-Domain Loss Function. Interspeech 2021: 1114-1118 - [c239]Zengqiang Shang, Zhihua Huang, Haozhe Zhang, Pengyuan Zhang, Yonghong Yan:
Incorporating Cross-Speaker Style Transfer for Multi-Language Text-to-Speech. Interspeech 2021: 1619-1623 - [c238]Haozhe Zhang, Zhihua Huang, Zengqiang Shang, Pengyuan Zhang, Yonghong Yan:
LinearSpeech: Parallel Text-to-Speech with Linear Complexity. Interspeech 2021: 4129-4133 - [c237]Jiakun Shen, Xueshuai Zhang, Wenchao Wang, Zhihua Huang, Pengyuan Zhang, Yonghong Yan:
Cough-based COVID-19 Detection with Multi-band Long-Short Term Memory and Convolutional Neural Networks. ISAIMS 2021: 209-215 - [c236]Changfeng Gao, Gaofeng Cheng, Jun Zhou, Pengyuan Zhang, Yonghong Yan:
Non-autoregressive Deliberation-Attention based End-to-End ASR. ISCSLP 2021: 1-5 - [c235]Zheying Huang, Peng Li, Ji Xu, Pengyuan Zhang, Yonghong Yan:
Context-dependent Label Smoothing Regularization for Attention-based End-to-End Code-Switching Speech Recognition. ISCSLP 2021: 1-5 - [c234]Zhaoqi Li, Long Wu, Ta Li, Yonghong Yan:
Improves Neural Acoustic Word Embeddings Query by Example Spoken Term Detection with Wav2vec Pretraining and Circle Loss. ISCSLP 2021: 1-5 - [c233]Fan Yang, Junfeng Li, Yonghong Yan:
A New Method for Improving Generative Adversarial Networks in Speech Enhancement. ISCSLP 2021: 1-5 - [i15]Yukun Liu, Ta Li, Pengyuan Zhang, Yonghong Yan:
Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search. CoRR abs/2104.05390 (2021) - [i14]Zhuo Li, Ce Fang, Runqiu Xiao, Zhigao Chen, Wenchao Wang, Yonghong Yan:
The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge. CoRR abs/2107.01329 (2021) - [i13]Jianju Gu, Longbiao Cheng, Junfeng Li, Yonghong Yan:
The Source Model Towards Maximizing The Output Signal-To-Interference Ratio For Independent Vector Analysis. CoRR abs/2110.03272 (2021) - [i12]Han Zhu, Li Wang, Ying Hou, Jindong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Wav2vec-S: Semi-Supervised Pre-Training for Speech Recognition. CoRR abs/2110.04484 (2021) - [i11]Han Wang, Ruiliu Fu, Chengzhang Li, Xuejun Zhang, Jun Zhou, Yonghong Yan:
Reminding the Incremental Language Model via Data-Free Self-Distillation. CoRR abs/2110.08745 (2021) - [i10]Ruiliu Fu, Han Wang, Xuejun Zhang, Jun Zhou, Yonghong Yan:
Decomposing Complex Questions Makes Multi-Hop QA Easier and More Interpretable. CoRR abs/2110.13472 (2021) - 2020
- [j70]Xiaoxiao Miao, Ian McLoughlin, Yonghong Yan:
A New Time-Frequency Attention Tensor Network for Language Identification. Circuits Syst. Signal Process. 39(5): 2744-2758 (2020) - [j69]Lu Yin, Junfeng Li, Yonghong Yan, Masato Akagi:
A Two-Stage Phase-Aware Approach for Monaural Multi-Talker Speech Separation. IEICE Trans. Inf. Syst. 103-D(7): 1732-1743 (2020) - [j68]Taisong Li, Zeyu He, Bing Wang, Yonghong Yan, Xianghong Tang:
基于循环时间卷积网络的序列流推荐算法 (Session-based Recommendation Algorithm Based on Recurrent Temporal Convolutional Network). 计算机科学 47(3): 103-109 (2020) - [j67]Fan Yang, Ziteng Wang, Junfeng Li, Risheng Xia, Yonghong Yan:
Improving generative adversarial networks for speech enhancement through regularization of latent representations. Speech Commun. 118: 1-9 (2020) - [j66]Haoran Miao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan:
Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1452-1465 (2020) - [j65]Xingwei Sun, Ze-Feng Gao, Zhong-Yi Lu, Junfeng Li, Yonghong Yan:
A Model Compression Method With Matrix Product Operators for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2837-2847 (2020) - [c232]Haoran Miao, Gaofeng Cheng, Changfeng Gao, Pengyuan Zhang, Yonghong Yan:
Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture. ICASSP 2020: 6084-6088 - [c231]Xuejun Zhang, Yujiang Li, Pengyuan Zhang, Yonghong Yan:
Lingual-Agnostic Meta-Learning for Low-Resource Part-of-Speech Tagging. ICIT 2020: 35-39 - [i9]Haoran Miao, Gaofeng Cheng, Changfeng Gao, Pengyuan Zhang, Yonghong Yan:
Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture. CoRR abs/2001.08290 (2020) - [i8]Xingwei Sun, Ze-Feng Gao, Zhong-Yi Lu, Junfeng Li, Yonghong Yan:
A Model Compression Method with Matrix Product Operators for Speech Enhancement. CoRR abs/2010.04950 (2020) - [i7]Han Zhu, Li Wang, Pengyuan Zhang, Yonghong Yan:
Multi-Accent Adaptation based on Gate Mechanism. CoRR abs/2011.02774 (2020)
2010 – 2019
- 2019
- [j64]Danyang Liu, Ji Xu, Pengyuan Zhang, Yonghong Yan:
Investigation of knowledge transfer approaches to improve the acoustic modeling of Vietnamese ASR system. IEEE CAA J. Autom. Sinica 6(5): 1187-1195 (2019) - [j63]Zhaoqiong Huang, Ji Xu, Zaixiao Gong, Haibin Wang, Yonghong Yan:
Multiple Source Localization in a Shallow Water Waveguide Exploiting Subarray Beamforming and Deep Neural Networks. Sensors 19(21): 4768 (2019) - [j62]Yike Zhang, Pengyuan Zhang, Yonghong Yan:
Tailoring an Interpretable Neural Language Model. IEEE ACM Trans. Audio Speech Lang. Process. 27(7): 1164-1178 (2019) - [c230]Wenjing Wei, Ge Zhan, Xun Wang, Pengyuan Zhang, Yonghong Yan:
A Novel Method for Automatic Heart Murmur Diagnosis Using Phonocardiogram. AIAM (ACM) 2019: 37:1-37:6 - [c229]Dingding Yao, Junfeng Li, Huaxing Xu, Risheng Xia, Yonghong Yan:
A Subband Energy Modification Method for Elevation Control in Median Plane. ICASSP 2019: 586-590 - [c228]Hangting Chen, Pengyuan Zhang, Yonghong Yan:
An Audio Scene Classification Framework with Embedded Filters and a DCT-based Temporal Module. ICASSP 2019: 835-839 - [c227]Xingwei Sun, Risheng Xia, Junfeng Li, Yonghong Yan:
A Deep Learning Based Binaural Speech Enhancement Approach with Spatial Cues Preservation. ICASSP 2019: 5766-5770 - [c226]Wenchao Wang, Yike Zhang, Ji Xu, Yonghong Yan:
Multiple Temporal Scales Based Speaker Embeddings Learning for Text-dependent Speaker Recognition. ICASSP 2019: 6311-6315 - [c225]Chunhui Lu, Pengyuan Zhang, Yonghong Yan:
Self-attention Based Prosodic Boundary Prediction for Chinese Speech Synthesis. ICASSP 2019: 7035-7039 - [c224]Haichuan Bai, Hangting Chen, Yonghong Yan:
Audio Scene Classification with Discriminatively-Trained Segment-Level Features. ICME Workshops 2019: 354-359 - [c223]Long Wu, Hangting Chen, Li Wang, Pengyuan Zhang, Yonghong Yan:
Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning. INTERSPEECH 2019: 431-435 - [c222]Han Zhu, Li Wang, Pengyuan Zhang, Yonghong Yan:
Multi-Accent Adaptation Based on Gate Mechanism. INTERSPEECH 2019: 744-748 - [c221]Haoran Miao, Gaofeng Cheng, Pengyuan Zhang, Ta Li, Yonghong Yan:
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition. INTERSPEECH 2019: 2623-2627 - [c220]Wenjie Li, Pengyuan Zhang, Yonghong Yan:
Target Speaker Recovery and Recognition Network with Average x-Vector and Global Training. INTERSPEECH 2019: 3233-3237 - [c219]Chang Liu, Zhen Zhang, Pengyuan Zhang, Yonghong Yan:
Character-Aware Sub-Word Level Language Modeling for Uyghur and Turkish ASR. INTERSPEECH 2019: 3495-3499 - [c218]Xiaoxiao Miao, Ian McLoughlin, Yonghong Yan:
A New Time-Frequency Attention Mechanism for TDNN and CNN-LSTM-TDNN, with Application to Language Identification. INTERSPEECH 2019: 4080-4084 - [i6]Hangting Chen, Zuozhen Liu, Zongming Liu, Pengyuan Zhang, Yonghong Yan:
Integrating the Data Augmentation Scheme with Various Classifiers for Acoustic Scene Modeling. CoRR abs/1907.06639 (2019) - 2018
- [j61]Taisong Li, Jiawei Zhang, Philip S. Yu, Yan Zhang, Yonghong Yan:
Deep Dynamic Network Embedding for Link Prediction. IEEE Access 6: 29219-29230 (2018) - [j60]Taisong Li, Bing Wang, Yasong Jiang, Yan Zhang, Yonghong Yan:
Restricted Boltzmann Machine-Based Approaches for Link Prediction in Dynamic Networks. IEEE Access 6: 29940-29951 (2018) - [j59]Ziteng Wang, Emmanuel Vincent, Romain Serizel, Yonghong Yan:
Rank-1 constrained Multichannel Wiener Filter for speech recognition in noisy environments. Comput. Speech Lang. 49: 37-51 (2018) - [c217]Ziteng Wang, Junfeng Li, Yonghong Yan, Emmanuel Vincent:
Semi-Supervised Learning with Deep Neural Networks for Relative Transfer Function Inverse Regression. ICASSP 2018: 191-195 - [c216]Ziteng Wang, Lu Yin, Junfeng Li, Yonghong Yan:
On SDW-MWF and Variable Span Linear Filter with Application to Speech Recognition in Noisy Environments. ICASSP 2018: 526-530 - [c215]Zhaoqiong Huang, Ji Xu, Zaixiao Gong, Haibin Wang, Yonghong Yan:
A Deep Neural Network Based Method of Source Localization in a Shallow Water Environment. ICASSP 2018: 3499-3503 - [c214]Yu Zhang, Wenjie Li, Pengyuan Zhang, Yonghong Yan:
Improving Multichannel Speech Recognition with Generalized Cross Correlation Inputs and Multitask Learning. ICASSP 2018: 5704-5708 - [c213]Xingwei Sun, Ziteng Wang, Risheng Xia, Junfeng Li, Yonghong Yan:
Effect of Steering Vector Estimation on MVDR Beamformer for Noisy Speech Recognition. DSP 2018: 1-5 - [c212]Yujiang Li, Xuemin Zhao, Weiqun Xu, Yonghong Yan:
Cross-Lingual Multi-Task Neural Architecture for Spoken Language Understanding. INTERSPEECH 2018: 566-570 - [c211]Lu Yin, Ziteng Wang, Risheng Xia, Junfeng Li, Yonghong Yan:
Multi-talker Speech Separation Based on Permutation Invariant Training and Beamforming. INTERSPEECH 2018: 851-855 - [c210]Gaofeng Cheng, Daniel Povey, Lu Huang, Ji Xu, Sanjeev Khudanpur, Yonghong Yan:
Output-Gate Projected Gated Recurrent Unit for Speech Recognition. INTERSPEECH 2018: 1793-1797 - [c209]Wenjie Li, Gaofeng Cheng, Fengpei Ge, Pengyuan Zhang, Yonghong Yan:
Investigation on the Combination of Batch Normalization and Dropout in BLSTM-based Acoustic Modeling for ASR. INTERSPEECH 2018: 2888-2892 - [c208]Hangting Chen, Pengyuan Zhang, Haichuan Bai, Qingsheng Yuan, Xiuguo Bao, Yonghong Yan:
Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling. INTERSPEECH 2018: 3304-3308 - [c207]Yike Zhang, Pengyuan Zhang, Yonghong Yan:
Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition. INTERSPEECH 2018: 3348-3352 - [c206]Shengyu Yao, Houjun Huang, Ruohua Zhou, Yonghong Yan:
Text-dependent Speaker Verification Using Word-based Scoring. ISCSLP 2018: 314-318 - [c205]Gaofeng Cheng, Lu Huang, Jiasong Sun, Yonghong Yan:
Bidirectional LSTM with Extended Input Context. ISCSLP 2018: 364-368 - [c204]Long Wu, Li Wang, Pengyuan Zhang, Ta Li, Yonghong Yan:
Space-Time Residual LSTM Architechture for Distant Speech Recognition. ISCSLP 2018: 379-383 - [c203]Junqing He, Xian Huang, Xuemin Zhao, Yan Zhang, Yonghong Yan:
Discriminating between Similar Languages on Imbalanced Conversational Texts. LREC 2018 - [c202]Mingming Fu, Xuemin Zhao, Yonghong Yan:
HCCL at SemEval-2018 Task 8: An End-to-End System for Sequence Labeling from Cybersecurity Reports. SemEval@NAACL-HLT 2018: 874-877 - [c201]Xiaoxiao Miao, Ian McLoughlin, Shengyu Yao, Yonghong Yan:
Improved Conditional Generative Adversarial Net Classification For Spoken Language Recognition. SLT 2018: 98-104 - 2017
- [j58]Dongwen Ying, Ruohua Zhou, Junfeng Li, Yonghong Yan:
Window-Dominant Signal Subspace Methods for Multiple Short-Term Speech Source Localization. IEEE ACM Trans. Audio Speech Lang. Process. 25(4): 731-744 (2017) - [c200]Fengpei Ge, Yonghong Yan:
Deep neural network based wake-up-word speech recognition with two-stage detection. ICASSP 2017: 2761-2765 - [c199]Yike Zhang, Pengyuan Zhang, Qingwei Zhao, Yonghong Yan, Zhenjiang Dong, Xia Jia:
An improved lexicon generation method for mandarin speech recognition. ICNC-FSKD 2017: 661-665 - [c198]Ge Zhang, Pengyuan Zhang, Jielin Pan, Yonghong Yan:
Fast variable-frame-rate decoding of speech recognition based on deep neural networks. ICNC-FSKD 2017: 821-825 - [c197]Xu Li, Junfeng Li, Yonghong Yan:
Ideal Ratio Mask Estimation Using Deep Neural Networks for Monaural Speech Segregation in Noisy Reverberant Conditions. INTERSPEECH 2017: 1203-1207 - [c196]Gaofeng Cheng, Vijayaditya Peddinti, Daniel Povey, Vimal Manohar, Sanjeev Khudanpur, Yonghong Yan:
An Exploration of Dropout with LSTMs. INTERSPEECH 2017: 1586-1590 - [c195]Zhaoqiong Huang, Zhanzhong Cao, Dongwen Ying, Jielin Pan, Yonghong Yan:
Time Delay Histogram Based Speech Source Separation Using a Planar Array. INTERSPEECH 2017: 1879-1883 - [c194]Fengpei Ge, Kehuang Li, Bo Wu, Sabato Marco Siniscalchi, Yonghong Yan, Chin-Hui Lee:
Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition. INTERSPEECH 2017: 3847-3851 - [c193]Yu Zhang, Pengyuan Zhang, Yonghong Yan:
Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition. INTERSPEECH 2017: 3857-3861 - [c192]Junqing He, Long Wu, Xuemin Zhao, Yonghong Yan:
HCCL at SemEval-2017 Task 2: Combining Multilingual Word Embeddings and Transliteration Model for Semantic Similarity. SemEval@ACL 2017: 220-225 - [i5]Ziteng Wang, Emmanuel Vincent, Romain Serizel, Yonghong Yan:
Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments. CoRR abs/1707.00201 (2017) - [i4]Ziteng Wang, Emmanuel Vincent, Yonghong Yan:
Relative Transfer Function Inverse Regression from Low Dimensional Manifold. CoRR abs/1710.09091 (2017) - [i3]Xiaofei Wang, Yonghong Yan, Hynek Hermansky:
Stream Attention for far-field multi-microphone ASR. CoRR abs/1711.11141 (2017) - 2016
- [j57]Chao Wu, Xiaofei Wang, Yanmeng Guo, Qiang Fu, Yonghong Yan:
Robust Uncertainty Control of the Simplified Kalman Filter for Acoustic Echo Cancelation. Circuits Syst. Signal Process. 35(12): 4584-4595 (2016) - [j56]Hang Ren, Qingwei Zhao, Yonghong Yan:
Policy Optimization for Spoken Dialog Management Using Genetic Algorithm. IEICE Trans. Inf. Syst. 99-D(10): 2499-2507 (2016) - [j55]Xuyang Wang, Pengyuan Zhang, Qingwei Zhao, Jielin Pan, Yonghong Yan:
Improved End-to-End Speech Recognition Using Adaptive Per-Dimensional Learning Rate Methods. IEICE Trans. Inf. Syst. 99-D(10): 2550-2553 (2016) - [j54]Mengzhe Chen, Jielin Pan, Qingwei Zhao, Yonghong Yan:
Multi-Task Learning in Deep Neural Networks for Mandarin-English Code-Mixing Speech Recognition. IEICE Trans. Inf. Syst. 99-D(10): 2554-2557 (2016) - [j53]Anhao Xing, Qingwei Zhao, Yonghong Yan:
Speeding up Deep Neural Networks in Speech Recognition with Piecewise Quantized Sigmoidal Activation Function. IEICE Trans. Inf. Syst. 99-D(10): 2558-2561 (2016) - [j52]Chenglong Ma, Qingwei Zhao, Jielin Pan, Yonghong Yan:
Short Text Classification Based on Distributional Representations of Words. IEICE Trans. Inf. Syst. 99-D(10): 2562-2565 (2016) - [j51]Hang Ren, Yonghong Yan:
Structural Optimization and Online Evolutionary Learning for Spoken Dialog Management. IEEE Signal Process. Lett. 23(7): 1013-1017 (2016) - [j50]Yueyue Na, Yanmeng Guo, Qiang Fu, Yonghong Yan:
Cross Array and Rank-1 MUSIC Algorithm for Acoustic Highway Lane Detection. IEEE Trans. Intell. Transp. Syst. 17(9): 2502-2514 (2016) - [c191]Zhaoqiong Huang, Ge Zhan, Dongwen Ying, Yonghong Yan:
Robust multiple speech source localization using time delay histogram. ICASSP 2016: 3191-3195 - [c190]Ji Xu, Ge Zhang, Yonghong Yan:
Effective utilization of multiple examples in query-by-example spoken term detection. ICASSP 2016: 5440-5444 - [c189]Taisong Li, Jing Wang, Manshu Tu, Yan Zhang, Yonghong Yan:
Enhancing Link Prediction Using Gradient Boosting Features. ICIC (2) 2016: 81-92 - [c188]Xu Li, Ziteng Wang, Xiaofei Wang, Qiang Fu, Yonghong Yan:
Adaptive Group Sparsity for Non-Negative Matrix Factorization with Application to Unsupervised Source Separation. INTERSPEECH 2016: 3349-3353 - [c187]Ziteng Wang, Xu Li, Xiaofei Wang, Qiang Fu, Yonghong Yan:
A DNN-HMM Approach to Non-Negative Matrix Factorization Based Speech Enhancement. INTERSPEECH 2016: 3763-3767 - [c186]Zhaoqiong Huang, Ge Zhan, Dongwen Ying, Ruohua Zhou, Jielin Pan, Yonghong Yan:
Robust multiple speech source localization based on phase difference regression. ISCSLP 2016: 1-5 - [c185]Junfeng Li, Risheng Xia, Qiang Fang, Aijun Li, Yonghong Yan:
Speech intelligibility enhancement in noisy reverberant conditions. ISCSLP 2016: 1-5 - [c184]Ge Zhan, Zhaoqiong Huang, Dongwen Ying, Jielin Pan, Yonghong Yan:
Improvement of mask-based speech source separation using DNN. ISCSLP 2016: 1-5 - [c183]Xu Li, Xiaofei Wang, Qiang Fu, Yonghong Yan:
Dynamic group sparsity for non-negative matrix factorization with application to unsupervised source separation. IWAENC 2016: 1-5 - [c182]Ziteng Wang, Xiaofei Wang, Xu Li, Qiang Fu, Yonghong Yan:
Oracle performance investigation of the ideal masks. IWAENC 2016: 1-5 - [c181]Yike Zhang, Pengyuan Zhang, Ta Li, Yonghong Yan:
An unsupervised vocabulary selection technique for Chinese automatic speech recognition. SLT 2016: 420-425 - [i2]Hang Ren, Weiqun Xu, Yonghong Yan:
Optimizing human-interpretable dialog management policy using Genetic Algorithm. CoRR abs/1605.03915 (2016) - 2015
- [j49]Meixu Song, Jielin Pan, Qingwei Zhao, Yonghong Yan:
Discriminative Pronunciation Modeling Using the MPE Criterion. IEICE Trans. Inf. Syst. 98-D(3): 717-720 (2015) - [j48]Risheng Xia, Junfeng Li, Andrea Primavera, Stefania Cecchi, Yôiti Suzuki, Yonghong Yan:
A Hybrid Approach for Reverberation Simulation. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 98-A(10): 2101-2108 (2015) - [j47]Xiaofei Wang, Yanmeng Guo, Chao Wu, Qiang Fu, Yonghong Yan:
A reverberation robust target speech detection method using dual-microphone in distant-talking scene. Speech Commun. 72: 47-58 (2015) - [j46]Xianliang Wang, Yulong Wan, Lin Yang, Ruohua Zhou, Yonghong Yan:
Phonotactic language recognition using dynamic pronunciation and language branch discriminative information. Speech Commun. 75: 50-61 (2015) - [c180]Zhichao Wang, Xingyu Na, Xin Li, Jielin Pan, Yonghong Yan:
Two-stage ASGD framework for parallel training of DNN acoustic models using Ethernet. ASRU 2015: 59-64 - [c179]Hang Ren, Weiqun Xu, Yonghong Yan:
Optimizing human-interpretable dialog management policy using genetic algorithm. ASRU 2015: 791-797 - [c178]Meixu Song, Qingqing Zhang, Jielin Pan, Yonghong Yan:
Improving HMM/DNN in ASR of under-resourced languages using probabilistic sampling. ChinaSIP 2015: 20-24 - [c177]Xiaofei Wang, Xu Li, Yanmeng Guo, Qiang Fu, Yonghong Yan:
Reverberation robust multi-channel post-filtering using modified signal presence probability. ChinaSIP 2015: 692-696 - [c176]Jia Sun, Peijia Li, Weiqun Xu, Yonghong Yan:
A Shallow Discourse Parsing System Based On Maximum Entropy Model. CoNLL Shared Task 2015: 84-88 - [c175]Yang Liu, Naushin Nower, Yonghong Yan, Masashi Unoki:
Restoration of instantaneous amplitude and phase of speech signal in noisy reverberant environments. EUSIPCO 2015: 879-883 - [c174]Dongwen Ying, Ge Zhan, Zhaoqiong Huang, Yonghong Yan, Fei Li:
A closed-form method of spatial de-aliasing for multiple speech source localization. GlobalSIP 2015: 1205-1209 - [c173]Qingqing Zhang, Yong Liu, Jielin Pan, Yonghong Yan:
Continuous speech recognition based on convolutional neural network. ICDIP 2015: 963121 - [c172]Yasong Jiang, Yuan Huang, Peng Li, Shengxiang Gao, Yan Zhang, Yonghong Yan:
How to Detect Communities in Large Networks. ICIC (1) 2015: 76-84 - [c171]Qianqian Fang, Huaxing Xu, Risheng Xia, Junfeng Li, Yonghong Yan:
Equalization of Sound Reproduction System Based on the Human Perception Characteristics. IIH-MSP 2015: 380-383 - [c170]Ge Zhan, Zhaoqiong Huang, Dongwen Ying, Jielin Pan, Yonghong Yan:
Spectrographic speech mask estimation using the time-frequency correlation of speech presence. INTERSPEECH 2015: 2287-2291 - [c169]Zhaoqiong Huang, Ge Zhan, Dongwen Ying, Yonghong Yan:
Robust localization of single sound source based on phase difference regression. INTERSPEECH 2015: 3293-3297 - [c168]Chenglong Ma, Weiqun Xu, Peijia Li, Yonghong Yan:
Distributional Representations of Words for Short Text Classification. VS@HLT-NAACL 2015: 33-38 - [c167]Peijia Li, Weiqun Xu, Chenglong Ma, Jia Sun, Yonghong Yan:
IOA: Improving SVM Based Sentiment Classification Through Post Processing. SemEval@NAACL-HLT 2015: 545-550 - [c166]Jun Zhou, Zhen Zhang, Bing Wang, Yan Zhang, Yonghong Yan:
Predicting Who Will Retweet or Not in Microblogs Network. SMP 2015: 168-175 - [c165]Yueyue Na, Yanmeng Guo, Qiang Fu, Yonghong Yan:
An Acoustic Traffic Monitoring System: Design and Implementation. UIC/ATC/ScalCom 2015: 119-126 - [i1]Xiaofei Wang, Chao Wu, Pengyuan Zhang, Ziteng Wang, Yong Liu, Xu Li, Qiang Fu, Yonghong Yan:
Noise Robust IOA/CAS Speech Separation and Recognition System For The Third 'CHIME' Challenge. CoRR abs/1509.06103 (2015) - 2014
- [j45]Yaohui Qi, Fuping Pan, Fengpei Ge, Qingwei Zhao, Yonghong Yan:
Smoothing Method for Improved Minimum Phone Error Linear Regression. IEICE Trans. Inf. Syst. 97-D(8): 2105-2113 (2014) - [j44]Hai Yang, Yunfei Xu, Houjun Huang, Ruohua Zhou, Yonghong Yan:
Voice biometrics using linear Gaussian model. IET Biom. 3(1): 9-15 (2014) - [j43]Ji Xu, Yeming Xiao, Jielin Pan, Yonghong Yan:
Coalescence Type based Confidence Warping for Agglutinative Language Keyword Spotting. J. Softw. 9(10): 2699-2705 (2014) - [j42]Kaiyu Jiang, Chao Wu, Yanmeng Guo, Qiang Fu, Yonghong Yan:
Acoustic Echo Control with Frequency-Domain Stage-Wise Regression. IEEE Signal Process. Lett. 21(10): 1265-1269 (2014) - [c164]Andrea Primavera, Stefania Cecchi, Francesco Piazza, Junfeng Li, Yonghong Yan:
An efficient time varying hybrid reverberator for room acoustic simulation. ChinaSIP 2014: 217-221 - [c163]Xiaofei Wang, Yanmeng Guo, Qiang Fu, Yonghong Yan:
Reverberation robust two-microphone Target Signal Detection algorithm with coherent interference. ChinaSIP 2014: 237-241 - [c162]Xianliang Wang, Yulong Wan, Lin Yang, Ruohua Zhou, Yonghong Yan:
Language recognition system using language branch discriminative information. ICASSP 2014: 5327-5331 - [c161]Anhao Xing, Xin Jin, Ta Li, Xuyang Wang, Jielin Pan, Yonghong Yan:
Speeding up deep neural networks for speech recognition on ARM Cortex-A series processors. ICNC 2014: 123-127 - [c160]Xuyang Wang, Ta Li, Yeming Xiao, Jielin Pan, Yonghong Yan:
Improved mandarin spoken term detection by using deep neural network for keyword verification. ICNC 2014: 144-148 - [c159]Liming Song, Ming Li, Yonghong Yan:
Melody Extraction for Vocal Polyphonic Music Based on Bayesian Framework. IIH-MSP 2014: 570-573 - [c158]Mengzhe Chen, Qingqing Zhang, Jielin Pan, Yonghong Yan:
Boosted Hybrid DNN/HMM System Based on Correlation-Generated Targets. IIH-MSP 2014: 590-593 - [c157]Xuyang Wang, Ta Li, Pengyuan Zhang, Jielin Pan, Yonghong Yan:
Enhanced Out of Vocabulary Word Detection Using Local Acoustic Information. IIH-MSP 2014: 594-597 - [c156]Xing Yang, Risheng Xia, Zhonghua Fu, Junfeng Li, Yonghong Yan, Shuichi Sakamoto, Yôiti Suzuki:
On the Performance and Robustness of Crosstalk Cancelation with Multiple Loudspeakers. IIH-MSP 2014: 618-621 - [c155]Dongwen Ying, Ruohua Zhou, Junfeng Li, Jielin Pan, Yonghong Yan:
Direction-of-arrival estimation of multiple speakers using a planar array. INTERSPEECH 2014: 2223-2227 - [c154]Chao Wu, Kaiyu Jiang, Yanmeng Guo, Qiang Fu, Yonghong Yan:
A robust step-size control algorithm for frequency domain acoustic echo cancellation. INTERSPEECH 2014: 2819-2823 - [c153]Dongying Liang, Yan Xiao, Yongqiang Feng, Yonghong Yan:
The role of auditory feedback in speech production: Implications for speech perception in the hearing impaired. ISIC 2014: 192-195 - [c152]Hang Ren, Weiqun Xu, Yonghong Yan:
Markovian Discriminative Modeling for Dialog State Tracking. SIGDIAL Conference 2014: 327-331 - [c151]Hang Ren, Weiqun Xu, Yonghong Yan:
Markovian discriminative modeling for cross-domain dialog state tracking. SLT 2014: 342-347 - 2013
- [j41]Junbo Zhang, Fuping Pan, Bin Dong, Qingwei Zhao, Yonghong Yan:
A Novel Discriminative Method for Pronunciation Quality Assessment. IEICE Trans. Inf. Syst. 96-D(5): 1145-1151 (2013) - [j40]Yanling Li, Qingwei Zhao, Yonghong Yan:
Fuzzy Matching of Semantic Class in Chinese Spoken Language Understanding. IEICE Trans. Inf. Syst. 96-D(8): 1845-1852 (2013) - [j39]Hai Yang, Yunfei Xu, Qinwei Zhao, Ruohua Zhou, Yonghong Yan:
Speaker Recognition Using Sparse Probabilistic Linear Discriminant Analysis. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 96-A(10): 1938-1945 (2013) - [j38]Xin Li, Jielin Pan, Qingwei Zhao, Yonghong Yan:
Discriminative Approach to Build Hybrid Vocabulary for Conversational Telephone Speech Recognition of Agglutinative Languages. IEICE Trans. Inf. Syst. 96-D(11): 2478-2482 (2013) - [j37]Yuhong Guo, Qingsheng Yuan, Xuemin Zhao, Jian Liu, Yonghong Yan:
Mixing-attack-proof Randomized Embedding Audio Watermarking System. J. Comput. 8(12): 3243-3250 (2013) - [j36]Zhen Zhang, Ji Xu, Yujing Si, Qingwei Zhao, Yonghong Yan:
Spoken Term Detection Based on Improved Index Structure. J. Softw. 8(11): 2807-2814 (2013) - [j35]Dongwen Ying, Yonghong Yan:
Robust and Fast Localization of Single Speech Source Using a Planar Array. IEEE Signal Process. Lett. 20(9): 909-912 (2013) - [j34]Dongwen Ying, Yonghong Yan:
Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain. IEEE Trans. Speech Audio Process. 21(6): 1145-1157 (2013) - [c150]Dongwen Ying, Junfeng Li, Yongqiang Feng, Yonghong Yan:
Direction of arrival estimation based on weighted minimum mean square error. ChinaSIP 2013: 318-321 - [c149]Junfeng Li, Masato Akagi, Yonghong Yan:
Objective Japanese intelligibility prediction for noisy speech signals before and after noise-reduction processing. ChinaSIP 2013: 352-355 - [c148]Mengzhe Chen, Qingqing Zhang, Zhichao Wang, Jielin Pan, Yonghong Yan:
Web-Based Language Model Domain Adaptation for Real World Voice Retrieval. CIS 2013: 100-104 - [c147]Liming Song, Ming Li, Yonghong Yan:
Automatic Vocal Segments Detection in Popular Music. CIS 2013: 349-352 - [c146]Ji Xu, Yujing Si, Jielin Pan, Yonghong Yan:
Automatic Allophone Deriving for Korean Speech Recognition. CIS 2013: 776-779 - [c145]Junbo Zhang, Bin Dong, Yonghong Yan:
A Computer-Assist Algorithm to Detect Repetitive Stuttering Automatically. IALP 2013: 249-252 - [c144]Junbo Zhang, Fuping Pan, Bin Dong, Yonghong Yan:
A novel discriminative method for pronunciation quality assessment. ICASSP 2013: 8223-8226 - [c143]Andrea Primavera, Stefania Cecchi, Francesco Piazza, Junfeng Li, Yonghong Yan:
Hybrid Reverberator Using Multiple Impulse Responses for Audio Rendering Improvement. IIH-MSP 2013: 314-317 - [c142]Jian Zhang, Risheng Xia, Chundong Xu, Junfeng Li, Yonghong Yan, Shuichi Sakamoto:
Head-Related Transfer Function Modeling Based on Finite-Impulse Response. IIH-MSP 2013: 318-321 - [c141]Junfeng Li, Fei Chen, Masato Akagi, Yonghong Yan:
Comparative investigation of objective speech intelligibility prediction measures for noise-reduced signals in Mandarin and Japanese. INTERSPEECH 2013: 1184-1187 - [c140]Meixu Song, Qingqing Zhang, Jielin Pan, Yonghong Yan:
Discriminative pronunciation modeling based on minimum phone error training. INTERSPEECH 2013: 1941-1945 - [c139]Fei Chen, Junfeng Li, Lena L. N. Wong, Yonghong Yan:
Effect of linguistic masker on the intelligibility of Mandarin sentences. INTERSPEECH 2013: 2099-2102 - [c138]Yujing Si, Qingqing Zhang, Ta Li, Jielin Pan, Yonghong Yan:
Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system. INTERSPEECH 2013: 3419-3423 - [c137]Hang Ren, Weiqun Xu, Yan Zhang, Yonghong Yan:
Dialog State Tracking using Conditional Random Fields. SIGDIAL Conference 2013: 457-461 - 2012
- [j33]Jinchao Yang, Xiang Zhang, Hongbin Suo, Li Lu, Jianping Zhang, Yonghong Yan:
Low-dimensional representation of Gaussian mixture model supervector for language recognition. EURASIP J. Adv. Signal Process. 2012: 47 (2012) - [j32]Jinchao Yang, Xiang Zhang, Hongbin Suo, Li Lu, Jianping Zhang, Yonghong Yan:
Maximum A Posteriori Linear Regression for language recognition. Expert Syst. Appl. 39(4): 4287-4291 (2012) - [j31]Xuemin Zhao, Yuhong Guo, Jian Liu, Yonghong Yan, Qiang Fu:
Logarithmic Adaptive Quantization Projection for Audio Watermarking. IEICE Trans. Inf. Syst. 95-D(5): 1436-1445 (2012) - [j30]Kai Li, Yanmeng Guo, Qiang Fu, Junfeng Li, Yonghong Yan:
Two-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation. IEICE Trans. Inf. Syst. 95-D(5): 1454-1464 (2012) - [j29]Shang Cai, Yeming Xiao, Jielin Pan, Qingwei Zhao, Yonghong Yan:
Noise Robust Feature Scheme for Automatic Speech Recognition Based on Auditory Perceptual Mechanisms. IEICE Trans. Inf. Syst. 95-D(6): 1610-1618 (2012) - [j28]Chunyan Liang, Lin Yang, Qingwei Zhao, Yonghong Yan:
Factor Analysis of Neighborhood-Preserving Embedding for Speaker Verification. IEICE Trans. Inf. Syst. 95-D(10): 2572-2576 (2012) - [j27]Junbo Zhang, Fuping Pan, Bin Dong, Qingwei Zhao, Yonghong Yan:
A Forced Alignment Based Approach for English Passage Reading Assessment. IEICE Trans. Inf. Syst. 95-D(12): 3046-3052 (2012) - [j26]Yali Li, Weiqun Xu, Yonghong Yan:
A Novel Similarity Measure to Induce Semantic Classes and Its Application for Language Model Adaptation in a Dialogue System. J. Comput. Sci. Technol. 27(2): 443-450 (2012) - [c136]Yong Liu, Yeming Xiao, Li Wang, Jielin Pan, Yonghong Yan:
Parallel implementation of neural networks training on graphic processing unit. BMEI 2012: 1571-1574 - [c135]Yujing Si, Ji Xu, Zhen Zhang, Jielin Pan, Yonghong Yan:
An Improved Mandarin Voice Input System Using Recurrent Neural Network Language Model. CIS 2012: 242-246 - [c134]Qingqing Zhang, Shang Cai, Jielin Pan, Yonghong Yan:
Improved acoustic models for Conversational Telephone Speech recognition. FSKD 2012: 1229-1232 - [c133]Yuhong Guo, Ta Li, Yujing Si, Jielin Pan, Yonghong Yan:
Optimized large vocabulary WFST speech recognition system. FSKD 2012: 1243-1247 - [c132]Jinchao Yang, Chunyan Liang, Lin Yang, Hongbin Suo, Junjie Wang, Yonghong Yan:
Factor analysis of Laplacian approach for speaker recognition. ICASSP 2012: 4221-4224 - [c131]Risheng Xia, Junfeng Li, Masato Akagi, Yonghong Yan:
Evaluation of objective intelligibility prediction measures for noise-reduced signals in mandarin. ICASSP 2012: 4465-4468 - [c130]Dongwen Ying, Xugang Lu, Junfeng Li, Yonghong Yan, Jianwu Dang, Frank K. Soong:
Noise estimation using a constrained sequential HMM IN log-spectral domain. ICASSP 2012: 4553-4556 - [c129]Yanmeng Guo, Kai Li, Qiang Fu, Yonghong Yan:
A two-microphone based voice activity detection for distant-talking speech in wide range of direction of arrival. ICASSP 2012: 4901-4904 - [c128]Kai Li, Yanmeng Guo, Qiang Fu, Yonghong Yan:
A two microphone-based approach for speech enhancement in adverse environments. ICCE 2012: 41-42 - [c127]Yanmeng Guo, Kai Li, Qiang Fu, Yonghong Yan:
Target speech detection based on microphone array using inter-channel phase differences. ICCE 2012: 247-248 - [c126]Yujing Si, Ta Li, Shang Cai, Jielin Pan, Yonghong Yan:
Recurrent neural network language model in mandarin voice input system. ICNC 2012: 270-274 - [c125]Chunyan Liang, Jinchao Yang, Lin Yang, Yonghong Yan:
Speaker Verification Using Neighborhood Preserving Embedding. INTERSPEECH 2012: 1560-1563 - [c124]Chunyan Liang, Xiang Zhang, Lin Yang, Yonghong Yan:
Discriminative Decision Function Based Scoring Method in Joint Factor Analysis for Speaker Verification. INTERSPEECH 2012: 1564-1567 - [c123]Yeming Xiao, Zhen Zhang, Shang Cai, Jielin Pan, Yonghong Yan:
A Initial Attempt on Task-Specific Adaptation for Deep Neural Network-based Large Vocabulary Continuous Speech Recognition. INTERSPEECH 2012: 2574-2577 - [c122]Hai Yang, Chunyan Liang, Yunfei Xu, Lin Yang, Yonghong Yan:
Sparse Probabilistic Linear Discriminant Analysis for Speaker Verification. INTERSPEECH 2012: 2658-2661 - [c121]Jian Zhang, Risheng Xia, Zhonghua Fu, Junfeng Li, Yonghong Yan:
A fast two-microphone noise reduction algorithm based on power level ratio for mobile phone. ISCSLP 2012: 206-209 - [c120]Junbo Zhang, Fuping Pan, Yonghong Yan:
Automatic Scoring on English Passage Reading Quality. ICSI (2) 2012: 18-25 - 2011
- [j25]Jie Gao, Qingwei Zhao, Yonghong Yan:
Towards precise and robust automatic synchronization of live speech and its transcripts. Speech Commun. 53(4): 508-523 (2011) - [j24]Dongwen Ying, Yonghong Yan, Jianwu Dang, Frank K. Soong:
Voice Activity Detection Based on an Unsupervised Learning Framework. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2624-2633 (2011) - [c119]Xuemin Zhao, Yuhong Guo, Jian Liu, Yonghong Yan:
Quantization Index Modulation audio watermarking system using a psychoacoustic model. ICICS 2011: 1-4 - [c118]Jinchao Yang, Xiang Zhang, Hongbin Suo, Li Lu, Jianping Zhang, Yonghong Yan:
Language recognition with language total variability. ICCC 2011: 6-9 - [c117]Weiqun Xu, Changchun Bao, Yali Li, Jielin Pan, Yonghong Yan:
Robust understanding of spoken Chinese through character-based tagging and prior knowledge exploitation. ASRU 2011: 413-418 - [c116]Shang Cai, Zhen Zhang, Ta Li, Jielin Pan, Yonghong Yan:
Development of a Chinese song name recognition system. ICNC 2011: 941-945 - [c115]Xuemin Zhao, Yuhong Guo, Jian Liu, Yonghong Yan:
A Spread Spectrum Audio Watermarking System with High Perceptual Quality. CMC 2011: 266-269 - [c114]Ming Li, Xiang Zhang, Yonghong Yan, Shrikanth S. Narayanan:
Speaker Verification Using Sparse Representations on Total Variability i-vectors. INTERSPEECH 2011: 2729-2732 - 2010
- [j23]Haipeng Wang, Xiang Xiao, Xiang Zhang, Jianping Zhang, Yonghong Yan:
A bayesian logistic regression approach to spoken language identification. IEICE Electron. Express 7(6): 390-396 (2010) - [j22]Yanqing Sun, Yu Zhou, Qingwei Zhao, Yonghong Yan:
Acoustic Feature Optimization Based on F-Ratio for Robust Speech Recognition. IEICE Trans. Inf. Syst. 93-D(9): 2417-2430 (2010) - [j21]Yanqing Sun, Yu Zhou, Qingwei Zhao, Pengyuan Zhang, Fuping Pan, Yonghong Yan:
Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition. IEICE Trans. Inf. Syst. 93-D(9): 2431-2439 (2010) - [j20]Yu Zhou, Junfeng Li, Yanqing Sun, Jianping Zhang, Yonghong Yan, Masato Akagi:
A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features. IEICE Trans. Inf. Syst. 93-D(10): 2813-2821 (2010) - [j19]Qingqing Zhang, Jielin Pan, Yonghong Yan:
Development of a Mandarin-English Bilingual Speech Recognition System with Unified Acoustic Models. J. Inf. Sci. Eng. 26(4): 1491-1507 (2010) - [c113]Yali Li, Weiqun Xu, Yonghong Yan:
Semantic class induction and its application for a Chinese voice search system. CIPS-SIGHAN 2010 - [c112]Yanqing Sun, Jie Gao, Yu Zhou, Fuping Pan, Qingwei Zhao, Yonghong Yan:
TBNR: the ThinkIT Broadcast News speech Recognition system. IWACI 2010: 544-548 - [c111]Yanqing Sun, Qingwei Zhao, Qingqing Zhang, Yu Zhou, Yonghong Yan:
Subset selection for articulatory feature based confidence measures. IWACI 2010: 549-553 - [c110]Jie Gao, Qingwei Zhao, Yonghong Yan:
Automatic Synchronization of live speech and its Transcripts based on a frame-synchronous likelihood ratio test. ICASSP 2010: 1622-1625 - [c109]Xiang Zhang, Haipeng Wang, Xiang Xiao, Jianping Zhang, Yonghong Yan:
Maximum a posteriori linear regression for speaker recognition. ICASSP 2010: 4542-4545 - [c108]Qingqing Zhang, Frank K. Soong, Yao Qian, Zhijie Yan, Jielin Pan, Yonghong Yan:
Improved modeling for F0 generation and V/U decision in HMM-based TTS. ICASSP 2010: 4606-4609 - [c107]Changchun Bao, Yali Li, Ta Li, Jielin Pan, Yonghong Yan:
Robust character based tagging with domain lexical features for Chinese spoken language understanding. ICNC 2010: 3410-3414 - [c106]Kai Li, Qiang Fu, Yonghong Yan:
Speech enhancement using improved generalized sidelobe canceller in frequency domain with multi-channel postfiltering. INTERSPEECH 2010: 973-976 - [c105]Xiang Zhang, Chuan Cao, Lin Yang, Hongbin Suo, Jianping Zhang, Yonghong Yan:
Speaker recognition using the resynthesized speech via spectrum modeling. INTERSPEECH 2010: 2142-2145 - [c104]Junfeng Li, Lin Yang, Yonghong Yan, Duc Thanh Chau, Masato Akagi:
Intelligibility investigation of single-channel noise reduction algorithms for Chinese and Japanese. ISCSLP 2010: 7-11 - [c103]Changliang Liu, Fuping Pan, Fengpei Ge, Bin Dong, Yonghong Yan:
Forward optimal measures for automatic mispronunciation detection. ISCSLP 2010: 80-83 - [c102]Xin Li, Shang Cai, Jielin Pan, Yonghong Yan, Yafei Yang:
Large vocabulary Uyghur continuous speech recognition based on stems and suffixes. ISCSLP 2010: 220-223 - [c101]Huilan Wang, Keliang Zhang, Yali Li, Yonghong Yan:
A new linguistic feature for Automated Essay Scoring. IUCS 2010: 340-344
2000 – 2009
- 2009
- [j18]Xiang Zhang, Hongbin Suo, Qingwei Zhao, Yonghong Yan:
Using a Kind of Novel Phonotactic Information for SVM Based Speaker Recognition. IEICE Trans. Inf. Syst. 92-D(4): 746-749 (2009) - [j17]Chuan Cao, Ming Li, Xiao Wu, Hongbin Suo, Jian Liu, Yonghong Yan:
Automatic Singing Performance Evaluation for Untrained Singers. IEICE Trans. Inf. Syst. 92-D(8): 1596-1600 (2009) - [j16]Changliang Liu, Fuping Pan, Fengpei Ge, Bin Dong, Hongbin Suo, Yonghong Yan:
An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference and Error Patterns. IEICE Trans. Inf. Syst. 92-D(9): 1716-1724 (2009) - [j15]Xiang Xiao, Xiang Zhang, Haipeng Wang, Hongbin Suo, Qingwei Zhao, Yonghong Yan:
Approximate Decision Function and Optimization for GMM-UBM Based Speaker Verification. IEICE Trans. Inf. Syst. 92-D(9): 1798-1802 (2009) - [c100]Xiang Wang, Jianping Zhang, Yonghong Yan:
Automatic Detection of Pathological Voices Using GMM-MLLR Approach. BMEI 2009: 1-4 - [c99]Xiang Wang, Jianping Zhang, Yonghong Yan:
Automatic Detection of Pathological Voices Using GMM-SVM Method. BMEI 2009: 1-4 - [c98]Yu Zhou, Jianping Zhang, Ling Wang, Yonghong Yan:
Emotion Recognition and Conversion for Mandarin Speech. FSKD (1) 2009: 179-183 - [c97]Jie Gao, Qingwei Zhao, Ran Xu, Yonghong Yan:
Improved Lattice-Based Confidence Measure for Speech Recognition via a Lattice Cutoff Procedure. FSKD (4) 2009: 473-476 - [c96]Ta Li, Weiqun Xu, Jielin Pan, Yonghong Yan:
Improving Automatic Speech Recognizer of Voice Search Using System Combination. FSKD (4) 2009: 477-480 - [c95]Ran Xu, Qingqing Zhang, Jielin Pan, Yonghong Yan:
Investigations to Minimum Phone Error Training in Bilingual Speech Recognition. FSKD (4) 2009: 486-490 - [c94]Li Lu, Fengpei Ge, Ta Li, Qingwei Zhao, Yonghong Yan:
Sample-Based Automatic Dictionary Generation for Keyword Spotting System. FSKD (5) 2009: 505-508 - [c93]Qingqing Zhang, Jielin Pan, Shui-duen Chan, Yonghong Yan:
Nonnative speech recognition based on bilingual model modification. FUZZ-IEEE 2009: 110-114 - [c92]Jingwei Sun, Jing Yang, Jianping Zhang, Yonghong Yan:
Chinese Prosody Structure Prediction Based on Conditional Random Fields. ICNC (3) 2009: 602-606 - [c91]Yu Zhou, Yanqing Sun, Junfeng Li, Jianping Zhang, Yonghong Yan:
Physiologically-inspired feature extraction for emotion recognition. INTERSPEECH 2009: 1975-1978 - [c90]Jie Gao, Qingwei Zhao, Yonghong Yan:
Online detecting end times of spoken utterances for synchronization of live speech and its transcripts. INTERSPEECH 2009: 2115-2118 - [c89]Qingqing Zhang, Jielin Pan, Yonghong Yan:
Tonal articulatory feature for Mandarin and its application to conversational LVCSR. INTERSPEECH 2009: 3007-3010 - [c88]Changliang Liu, Fengpei Ge, Fuping Pan, Bin Dong, Yonghong Yan:
A one-step tone recognition approach using MSD-HMM for continuous speech. INTERSPEECH 2009: 3015-3018 - [c87]Fengpei Ge, Fuping Pan, Changliang Liu, Bin Dong, Shui-duen Chan, Xinhua Zhu, Yonghong Yan:
An SVM-Based Mandarin Pronunciation Quality Assessment System. ISNN (4) 2009: 255-265 - [c86]Qingqing Zhang, Jielin Pan, Shui-duen Chan, Yonghong Yan:
Nonnative Speech Recognition Based on Bilingual Model Modification at State Level. ISNN (4) 2009: 299-309 - [c85]Changliang Liu, Fuping Pan, Fengpei Ge, Bin Dong, Shuiduen Chen, Yonghong Yan:
Dynamic Multiple Pronunciation Incorporation in a Refined Search Space for Reading Miscue Detection. ISNN (4) 2009: 379-389 - [c84]Jie Gao, Qingwei Zhao, Ta Li, Yonghong Yan:
Simultaneous Synchronization of Text and Speech for Broadcast News Subtitling. ISNN (3) 2009: 576-585 - [c83]Haipeng Wang, Xiang Zhang, Hongbin Suo, Qingwei Zhao, Yonghong Yan:
A Novel Fuzzy-Based Automatic Speaker Clustering Algorithm. ISNN (2) 2009: 639-646 - [c82]Ta Li, Changchun Bao, Weiqun Xu, Jielin Pan, Yonghong Yan:
Improving Voice Search Using Forward-Backward LVCSR System Combination. ISNN (4) 2009: 769-777 - [c81]Jie Gao, Yanqing Sun, Hongbin Suo, Qingwei Zhao, Yonghong Yan:
WAPS: An Audio Program Surveillance System for Large Scale Web Data Stream. WISM 2009: 116-128 - 2008
- [j14]Hongbin Suo, Ming Li, Ping Lu, Yonghong Yan:
Using SVM as Back-End Classifier for Language Identification. EURASIP J. Audio Speech Music. Process. 2008 (2008) - [j13]Lin Yang, Jianping Zhang, Jian Shao, Yonghong Yan:
Effects of the Temporal Fine Structure in Different Frequency Bands on Mandarin Tone Perception. IEICE Trans. Inf. Syst. 91-D(2): 371-374 (2008) - [j12]Qingqing Zhang, Jielin Pan, Yang Lin, Jian Shao, Yonghong Yan:
Development of a Mandarin-English Bilingual Speech Recognition System for Real World Music Retrieval. IEICE Trans. Inf. Syst. 91-D(3): 514-521 (2008) - [j11]Jian Shao, Ta Li, Qingqing Zhang, Qingwei Zhao, Yonghong Yan:
A One-Pass Real-Time Decoder Using Memory-Efficient State Network. IEICE Trans. Inf. Syst. 91-D(3): 529-537 (2008) - [j10]Hongbin Suo, Ming Li, Ping Lu, Yonghong Yan:
Automatic Language Identification with Discriminative Language Characterization Based on SVM. IEICE Trans. Inf. Syst. 91-D(3): 567-575 (2008) - [j9]Xiao Wu, Ming Li, Hongbin Suo, Yonghong Yan:
Melody Track Selection Using Discriminative Language Model. IEICE Trans. Inf. Syst. 91-D(6): 1838-1840 (2008) - [j8]Fengpei Ge, Changliang Liu, Jian Shao, Fuping Pan, Bin Dong, Yonghong Yan:
Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech. IEICE Trans. Inf. Syst. 91-D(10): 2485-2492 (2008) - [j7]Xiang Zhang, Ping Lu, Hongbin Suo, Qingwei Zhao, Yonghong Yan:
Robust Speaker Clustering Using Affinity Propagation. IEICE Trans. Inf. Syst. 91-D(11): 2739-2741 (2008) - [j6]Heng Zhang, Qiang Fu, Yonghong Yan:
Speech Enhancement Using Improved Adaptive Null-Forming in Frequency Domain with Postfilter. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 91-A(12): 3812-3816 (2008) - [c80]Qingqing Zhang, Jielin Pan, Yonghong Yan:
Mandarin-English bilingual Speech Recognition for real world music retrieval. ICASSP 2008: 4253-4256 - [c79]Xiang Zhang, Jie Gao, Ping Lu, Yonghong Yan:
A novel speaker clustering algorithm via supervised affinity propagation. ICASSP 2008: 4369-4372 - [c78]Fuping Pan, Qingwei Zhao, Yonghong Yan:
Mandarin vowel pronunciation quality evaluation by a novel formant classification method and its combination with traditional algorithms. ICASSP 2008: 5061-5064 - [c77]Jie Gao, Jian Shao, Qingqing Zhang, Qingwei Zhao, Yonghong Yan:
Spoken Term Detection Using Dynamic Match Subword Confusion Network. ICNC (4) 2008: 250-254 - [c76]Ran Xu, Jielin Pan, Yonghong Yan:
Using Discriminative Training Techniques in Practical Intelligent Music Retrieval System. ICNC (4) 2008: 286-290 - [c75]Changliang Liu, Fuping Pan, Fengpei Ge, Bin Dong, Qingwei Zhao, Yonghong Yan:
Application of LVCSR to the Detection of Chinese Mandarin Reading Miscues. ICNC (5) 2008: 447-451 - [c74]Yonghong Yan, Yan Xie, Wei Peng, Yun Zeng:
Wide-Band Low-Noise Quadrature VCO Design. IIH-MSP 2008: 1217-1220 - [c73]Ming Li, Chuan Cao, Di Wang, Ping Lu, Qiang Fu, Yonghong Yan:
Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping. INTERSPEECH 2008: 151-154 - [c72]Heng Zhang, Qiang Fu, Yonghong Yan:
A frequency domain approach for speech enhancement with directionality using compact microphone array. INTERSPEECH 2008: 447-450 - [c71]Changchun Bao, Weiqun Xu, Yonghong Yan:
Recognizing named entities in spoken Chinese dialogues with a character-level maximum entropy tagger. INTERSPEECH 2008: 1145-1148 - [c70]Chuan Cao, Ming Li, Jian Liu, Yonghong Yan:
An objective singing evaluation approach by relating acoustic measurements to perceptual ratings. INTERSPEECH 2008: 2058-2061 - [c69]Jian Shao, Roger Peng Yu, Qingwei Zhao, Yonghong Yan, Frank Seide:
Towards vocabulary-independent speech indexing for large-scale repositories. INTERSPEECH 2008: 2150-2153 - [c68]Qingqing Zhang, Ta Li, Jielin Pan, Yonghong Yan:
Nonnative speech recognition based on state-candidate bilingual model modification. INTERSPEECH 2008: 2366-2369 - [c67]Jie Gao, Xiang Zhang, Qingwei Zhao, Yonghong Yan:
Robust speaker change detection using Kernel-Gaussian model. INTERSPEECH 2008: 2494-2497 - [c66]Fengpei Ge, Fuping Pan, Changliang Liu, Bin Dong, Yonghong Yan:
Forward optimal modeling of acoustic confusions in Mandarin CALL system. INTERSPEECH 2008: 2815-2818 - [c65]Ran Xu, Jielin Pan, Yonghong Yan:
Improved Semi-Parametric Mean Trajectory Model Using Discriminatively Trained Centroids. ISCSLP 2008: 205-208 - [c64]Bin Dong, Yonghong Yan:
A Synchronous Method for Automatic Scoring of Language Learning. ISCSLP 2008: 297-301 - [c63]Changliang Liu, Fuping Pan, Fengpei Ge, Bin Dong, Yonghong Yan:
Using Reference to Tune Language Model for Detection of Reading Miscues. ISCSLP 2008: 302-305 - [c62]Xiang Zhang, Xiang Xiao, Haipeng Wang, Hongbin Suo, Qingwei Zhao, Yonghong Yan:
Speaker Recognition using a Kind of Novel Phonotactic Information. ISCSLP 2008: 330-333 - [c61]Jie Gao, Qingwei Zhao, Yonghong Yan, Jian Shao:
Efficient System Combination for Syllable-Confusion-Network-Based Chinese Spoken Term Detection. ISCSLP 2008: 366-369 - 2007
- [c60]Kun Liu, Jianping Zhang, Yonghong Yan:
High Quality Voice Conversion through Phoneme-Based Linear Mapping Functions with STRAIGHT for Mandarin. FSKD (4) 2007: 410-414 - [c59]Pengyuan Zhang, Qingwei Zhao, Yonghong Yan:
A Spoken Dialogue System Based on Keyword Spotting Technology. HCI (3) 2007: 253-261 - [c58]Wei Wang, Ping Lv, Qingwei Zhao, Yonghong Yan:
A Decision-Tree-Based Online Speaker Clustering. IbPRIA (1) 2007: 555-562 - [c57]Yunfeng Du, Wei Hu, Yonghong Yan, Tao Wang, Yimin Zhang:
Audio Segmentation via Tri-Model Bayesian Information Criterion. ICASSP (1) 2007: 205-208 - [c56]Kun Liu, Zhiwei Shuang, Yong Qin, Jianping Zhang, Yonghong Yan:
Mandarin Accent Analysis Based on Formant Frequencies. ICASSP (4) 2007: 637-640 - [c55]Pengyuan Zhang, Jian Shao, Qingwei Zhao, Yonghong Yan:
Keyword Spotting Based on Syllable Confusion Network. ICNC (2) 2007: 656-659 - [c54]Qingwei Zhao, Yonghong Yan, Jielin Pan, Qiang Fu, Jianping Zhang, Ping Lv, Fuping Pan:
Large Vocabulary Mandarin Continuous Speech Recognition under Noisy Environment. ICNC (2) 2007: 660-664 - [c53]Hongbin Suo, Ming Li, Tantan Liu, Ping Lu, Yonghong Yan:
The Design of Backend Classifiers in PPRLM System for Language Identification. ICNC (1) 2007: 678-682 - [c52]Zhaojie Liu, Jian Shao, Pengyuan Zhang, Qingwei Zhao, Yonghong Yan, Ji Feng:
Real Context Model for Tone Recognition in Mandarin Conversational Telephone Speech. ICNC (2) 2007: 696-699 - [c51]Ming Li, Yun Lei, Xiang Zhang, Jian Liu, Yonghong Yan:
Authentication and Quality Monitoring based on Audio Watermark for Analog AM Shortwave Broadcasting. IIH-MSP 2007: 263-266 - [c50]Fuping Pan, Qingwei Zhao, Yonghong Yan:
Mandarin vowel pronunciation quality evaluation by using formant pattern recognition. INTERSPEECH 2007: 202-205 - [c49]Ming Li, Hongbin Suo, Xiao Wu, Ping Lu, Yonghong Yan:
Spoken language identification using score vector modeling and support vector machine. INTERSPEECH 2007: 350-353 - [c48]Lin Yang, Jianping Zhang, Yonghong Yan:
Contributions of temporal fine structure cues to Chinese speech recognition in cochlear implant simulation. INTERSPEECH 2007: 386-389 - [c47]Jian Shao, Qingwei Zhao, Pengyuan Zhang, Zhaojie Liu, Yonghong Yan:
A fast fuzzy keyword spotting algorithm based on syllable confusion network. INTERSPEECH 2007: 2405-2408 - [c46]Yanmeng Guo, Qian Qian, Yonghong Yan:
Robust voice activity detection based on adaptive sub-band energy sequence analysis and harmonic detection. INTERSPEECH 2007: 2949-2952 - [c45]Chuan Cao, Ming Li, Jian Liu, Yonghong Yan:
Singing Melody Extraction in Polyphonic Music by Harmonic Tracking. ISMIR 2007: 373-374 - [c44]Fuping Pan, Qingwei Zhao, Yonghong Yan:
New Machine Scores and Their Combinations for Automatic Mandarin Phonetic Pronunciation Quality Assessment. KES (1) 2007: 821-830 - 2006
- [c43]Ming Li, Yun Lei, Jian Liu, Yonghong Yan:
A Novel Audio Watermarking in Wavelet Domain. IIH-MSP 2006: 27-32 - [c42]Bin Dong, Qingwei Zhao, Yonghong Yan:
Automatic Scoring of Flat Tongue and Raised Tongue in Computer-assisted Mandarin Learning. ISCSLP 2006 - [c41]Ming Li, Jian Liu, Yonghong Yan:
An Efficient and Robust Approach to Audio ID Identification. ISCSLP 2006 - [c40]Yanmeng Guo, Qiang Fu, Yonghong Yan:
Speech Endpoint Detection Based on Sub-band Energy and Harmonic Structure of Voice. ISCSLP 2006 - [c39]Tantan Liu, Xiaoxing Liu, Yonghong Yan:
Speaker Diarization System Based on GMM and BIC. ISCSLP 2006 - [c38]Fuping Pan, Qingwei Zhao, Yonghong Yan:
Improvements in Tone Pronunciation Scoring for Strongly Accented Mandarin Speech. ISCSLP 2006 - [c37]Jian Shao, Pengyuan Zhang, Jiang Han, Jun Yang, Yonghong Yan:
Syllable Based Audio Search Using Confusion Network Arc as Indexing Unit. ISCSLP 2006 - [c36]Xiao Wu, Ming Li, Jian Liu, Jun Yang, Yonghong Yan:
A Top-down Approach to Melody Match in Pitch Contour for Query by Humming. ISCSLP 2006 - [c35]Heng Zhang, Qiang Fu, Yonghong Yan:
Adaptive Null-Forming Algorithm with Auditory Sub-bands. ISCSLP (Selected Papers) 2006: 248-257 - [c34]Pengyuan Zhang, Jian Shao, Jiang Han, Zhaojie Liu, Yonghong Yan:
Keyword Spotting Based on Phoneme Confusion Matrix. ISCSLP 2006 - 2005
- [c33]Bin Dong, Qingwei Zhao, Yonghong Yan:
Fast confidence measure algorithm for continuous speech recognition. INTERSPEECH 2005: 1457-1460 - 2004
- [j5]Chaojun Liu, Yonghong Yan:
Robust state clustering using phonetic decision trees. Speech Commun. 42(3-4): 391-408 (2004) - [j4]Xintian Wu, Yonghong Yan:
Speaker adaptation using constrained transformation. IEEE Trans. Speech Audio Process. 12(2): 168-174 (2004) - [c32]Chengyi Zheng, Yonghong Yan:
Fusion based speech segmentation in DARPA SPINE2 task. ICASSP (1) 2004: 885-888 - [c31]Bin Dong, Qingwei Zhao, Jianping Zhang, Yonghong Yan:
Automatic assessment of pronunciation quality. ISCSLP 2004: 137-140 - 2003
- [c30]Yonghong Yan, Chengyi Zheng, Jianping Zhang, Jielin Pan, Jiang Han, Jian Liu:
A dynamic cross-reference pruning strategy for multiple feature fusion at decoder run time. INTERSPEECH 2003: 1177-1180 - 2002
- [c29]Chengyi Zheng, Yonghong Yan:
Run time information fusion in speech recognition. INTERSPEECH 2002: 1077-1080 - 2001
- [c28]Xiaoxing Liu, Baosheng Yuan, Yonghong Yan:
A context adaptation approach for building context dependent models in LVCSR. INTERSPEECH 2001: 1237-1240 - 2000
- [c27]Xintian Wu, Yonghong Yan:
Markov Random Field Linear Regression. EUSIPCO 2000: 1-4 - [c26]Xintian Wu, Yonghong Yan:
Linear regression under maximum a posteriori criterion with Markov random field prior. ICASSP 2000: 997-1000 - [c25]Qing Guo, Yonghong Yan, Baosheng Yuan, Xiangdong Zhang, Ying Jia, Xiaoxing Liu:
Vocabulary-based acoustic model trim down and task adaptation. INTERSPEECH 2000: 109-112 - [c24]Ying Jia, Yonghong Yan, Baosheng Yuan:
Dynamic threshold setting via Bayesian information criterion (BIC) in HMM training. INTERSPEECH 2000: 169-171 - [c23]Qingwei Zhao, Zhiwei Lin, Baosheng Yuan, Yonghong Yan:
Improvements in search algorithm for large vocabulary continuous speech recognition. INTERSPEECH 2000: 306-309 - [c22]Jielin Pan, Baosheng Yuan, Yonghong Yan:
Effective vector quantization for a highly compact acoustic model for LVCSR. INTERSPEECH 2000: 318-321 - [c21]Chengyi Zheng, Yonghong Yan:
Efficiently using speaker adaptation data. INTERSPEECH 2000: 358-361 - [c20]Xiaoxing Liu, Baosheng Yuan, Yonghong Yan:
An orthogonal GMM based speaker verification system. INTERSPEECH 2000: 462-465 - [c19]Chaojun Liu, Yonghong Yan:
Speaker change detection using minimum message length criterion. INTERSPEECH 2000: 514-517 - [c18]Jiang Han, Yonghong Yan, Zhiwei Lin, Yong Wang, Jian Liu, Danjun Liu, Zhihui Wang:
Office message center - a spoken dialogue system. INTERSPEECH 2000: 704-706 - [c17]Qing Guo, Yonghong Yan, Zhiwei Lin, Baosheng Yuan, Qingwei Zhao, Jian Liu:
Keyword spotting in auto-attendant system. INTERSPEECH 2000: 1050-1052 - [c16]Yonghong Yan:
Toward Making Speech Part of People's Daily Life. ISCSLP 2000 - [c15]Qing Guo, Yonghong Yan, Zhiwei Lin, Baosheng Yuan, Qingwei Zhao, Jian Liu:
Keyword Spotting in Auto-Attendant System. ISCSLP 2000 - [c14]Ying Jia, Yonghong Yan, Baosheng Yuan, Jian Liu:
Word Error Rate Reduction by Bottom-Up Tone Integration to Chinese Continuous Speech Recognition System. ISCSLP 2000 - [c13]Xiangdong Zhang, Baosheng Yuan, Ying Jia, Lingyun Tuo, Yonghong Yan:
Develop Telephony Speech Recognition Systems for Real-world Application. ISCSLP 2000
1990 – 1999
- 1999
- [j3]Yonghong Yan:
Understanding speech recognition using correlation-generated neural network targets. IEEE Trans. Speech Audio Process. 7(3): 350-352 (1999) - [c12]DongHwa Kim, Chaojun Liu, Xintian Wu, Yonghong Yan:
High accuracy acoustic modeling based on multi-stage decision tree. EUROSPEECH 1999 - [c11]Chaojun Liu, Xintian Wu, Yonghong Yan:
High accuracy acoustic modeling using two-level decision-tree based state-tying. EUROSPEECH 1999: 1703-1706 - [c10]Xintian Wu, Yonghong Yan:
Development of the 1998 OGI-FONIX broadcast news transcription system. EUROSPEECH 1999 - 1998
- [c9]Ronald A. Cole, Stephen Sutton, Yonghong Yan, Pieter J. E. Vermeulen, Mark A. Fanty:
Accessible technology for interactive systems: a new approach to spoken language research. ICASSP 1998: 1037-1040 - [c8]Stephen Sutton, Ronald A. Cole, Jacques de Villiers, Johan Schalkwyk, Pieter J. E. Vermeulen, Michael W. Macon, Yonghong Yan, Edward C. Kaiser, Brian Rundle, Khaldoun Shobaki, John-Paul Hosom, Alexander Kain, Johan Wouters, Dominic W. Massaro, Michael M. Cohen:
Universal speech tools: the CSLU toolkit. ICSLP 1998 - 1997
- [j2]Etienne Barnard, Yonghong Yan:
Toward new language adaptation for language identification. Speech Commun. 21(4): 245-254 (1997) - [c7]Yonghong Yan, Mark A. Fanty, Ronald A. Cole:
Speech recognition using neural networks with forward-backward probability generated targets. ICASSP 1997: 3241-3244 - [c6]Xin Tu, Yonghong Yan, Ronald A. Cole:
Matching training and testing criteria in hybrid speech recognition systems. EUROSPEECH 1997: 1943-1946 - 1996
- [j1]Yonghong Yan, Etienne Barnard, Ronald A. Cole:
Development of an approach to automatic language identification based on phone recognition. Comput. Speech Lang. 10(1): 37-54 (1996) - [c5]Yonghong Yan, Etienne Barnard:
Experiments for an approach to language identification with conversational telephone speech. ICASSP 1996: 789-792 - [c4]Ronald A. Cole, Yonghong Yan, Brian Mak, Mark A. Fanty, Troy Bailey:
The contribution of consonants versus vowels to word recognition in fluent speech. ICASSP 1996: 853-856 - [c3]Ronald A. Cole, Yonghong Yan, Troy Bailey:
The influence of bigram constraints on word recognition by humans: implications for computer speech recognition. ICSLP 1996: 829-832 - 1995
- [c2]Yonghong Yan, Etienne Barnard:
An approach to automatic language identification based on language-dependent phone recognition. ICASSP 1995: 3511-3514 - [c1]Yonghong Yan, Etienne Barnard:
An approach to language identification with enhanced language model. EUROSPEECH 1995: 1351-1354
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-07 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint