default search action
11th ISCSLP 2018: Taipei City, Taiwan
- 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018, Taipei City, Taiwan, November 26-29, 2018. IEEE 2018, ISBN 978-1-5386-5627-3
- Haiwei Wu, Ming Li, Zexin Cai, Haibin Zhong:
Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances. 1-5 - Yi-Yang Ding, Ya-Jun Hu, Zhen-Hua Ling:
GTDNN-Based Voice Conversion Using DAEs with Binary Distributed Hidden Units. 1-5 - Haikun Wang, Zhongfu Ye, Jingdong Chen:
A Front-End Speech Enhancement System for Robust Automotive Speech Recognition. 1-5 - Yupeng Shi, Weicong Rong, Nengheng Zheng:
Speech Enhancement using Convolutional Neural Network with Skip Connections. 6-10 - Bin Liu, Jianhua Tao, Yibin Zheng:
A Novel Unified Framework for Speech Enhancement and Bandwidth Extension Based on Jointly Trained Neural Networks. 11-15 - Shih-Kuang Lee, Syu-Siang Wang, Yu Tsao, Jeih-weih Hung:
Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via DiscreteWavelet Transform. 16-20 - Quandong Wang, Sicheng Wang, Fengpei Ge, Chang Woo Han, Jaewon Lee, Lianghao Guo, Chin-Hui Lee:
Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech. 21-25 - Cunhang Fan, Bin Liu, Jianhua Tao, Zhengqi Wen, Jiangyan Yi, Ye Bai:
Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation. 26-30 - Xiaoyong Lu, Yanqin Li, Hongwu Yang:
A Method for Emotional Speech Synthesis Based on Speaker Adaptive Training. 31-35 - Xurong Xie, Xunying Liu, Tan Lee, Lan Wang:
Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion. 36-40 - Weizhao Zhang, Hongwu Yang, Pengpeng Zhi:
Emotional speech synthesis based on DNN and PAD emotional state model. 41-45 - Lijia Chen, Hongwu Yang, Hui Wang:
Research on Dungan speech synthesis based on Deep Neural Network. 46-50 - Wen-Chin Huang, Hsin-Te Hwang, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang:
Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders. 51-55 - Feng-Long Xie, Frank K. Soong, Xi Wang, Lei He, Haifeng Li:
Frame Selection in SI-DNN Phonetic Space with WaveNet Vocoder for Voice Conversion without Parallel Training Data. 56-60 - Yuanyuan Liu, Ying Qin, Siyuan Feng, Tan Lee, P. C. Ching:
Disordered Speech Assessment Using Kullback-Leibler Divergence Features with Multi-Task Acoustic Modeling. 61-65 - Ying Qin, Tan Lee, Yuzhong Wu, Anthony Pak-Hin Kong:
An End-to-End Approach to Automatic Speech Assessment for People with Aphasia. 66-70 - Yahui Shan, Jing Wang, Xiang Xie, Liuchen Meng, Jingming Kuang:
Non-intrusive Speech Quality Assessment Using Deep Belief Network and Backpropagation Neural Network. 71-75 - Xin Wang, Jun Du, Lei Sun, Qing Wang, Chin-Hui Lee:
A Progressive Deep Learning Approach to Child Speech Separation. 76-80 - Jen-Tzung Chien, Kai-Wei Tsou:
Convolutional Neural Turing Machine for Speech Separation. 81-85 - Danyang Liu, Xinxin Wan, Ji Xu, Pengyuan Zhang:
Multilingual Speech Recognition Training and Adaptation with Language-Specific Gate Units. 86-90 - Yuan Jia, Cuiping Li:
Acquisition of English Tense-lax Vowels by Chinese EFL Learners from Different Dialectal Regions. 91-95 - Yuan Jia, Huimin Zhang:
An Acoustic Study of English Monophthongs Acquisition by Chinese EFL Learners from Northeast Region. 96-100 - Yuan Jia, Xinyin Sun:
Chinese EFL Learners' Acquisition of English Monophthongs-A Typological Study of Fuzhou, Ningbo, and Beijing. 101-105 - Bin Li, Yuan Jia:
An Empirical Study of English Vowels Acquisition of EFL Learners in Tianjin and Zibo. 106-110 - Jingyong Hou, Wenping Hu, Frank K. Soong, Lei Xie:
A Refined Query-by-Example Approach to Spoken-Term-Detection on ESL learners' Speech. 111-115 - Wei Wang, Wei Wei, Yanlu Xie, Minghao Guo, Jinsong Zhang:
Improve the Accuracy of Non-native Speech Annotation with a Semi-automatic Approach. 116-120 - Peiyao Sheng, Zhuolin Yang, Hu Hu, Tian Tan, Yanmin Qian:
Data Augmentation using Conditional Generative Adversarial Networks for Robust Speech Recognition. 121-125 - Jie Li, Yahui Shan, Xiaorui Wang, Yan Li:
Improving Gated Recurrent Unit Based Acoustic Modeling with Batch Normalization and Enlarged Context. 126-130 - Yuan-Fu Liao, Matús Pleva, Daniel Hládek, Ján Stas, Peter Viszlay, Martin Lojka, Jozef Juhár:
Gated Module Neural Network for Multilingual Speech Recognition. 131-135 - Lahiru Samarakoon, Brian Mak, Albert Y. S. Lam:
Subspace Based Sequence Discriminative Training of LSTM Acoustic Models with Feed-Forward Layers. 136-140 - Hengguan Huang, Brian Mak:
WaveNet MH-SRU: Deep and Wide Multiple-history Simple Recurrent Unit for Speech Recognition. 141-145 - Zhangyu Xiao, Zhijian Ou, Wei Chu, Hui Lin:
Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units. 146-150 - Srinivas Kantheti, Rohan Kumar Das, Hemant A. Patil:
Combining Phase-based Features for Replay Spoof Detection System. 151-155 - Meng Ge, Longbiao Wang, Seiichi Nakagawa, Yuta Kawakami, Jianwu Dang, Xiangang Li:
Pitch Synchronized Relative Phase with Peak Error Detection For Noise-robust Speaker Recognition. 156-160 - Lei Wang, Fei Chen:
Visual Information Affects Auditory Frequency Discrimination with Random Stimulus Sequences: Evidence from ERPs. 161-164 - Di Zhou, Jinfeng Huang, Jianwu Dang:
Investigation of the Comprehension Process during Silent Reading based on Eye Movements. 165-169 - Ju Lin, Wei Zhang, Linxuan Wei, Yanlu Xie, Jinsong Zhang:
A Multi-modal Soft Targets Approach for Pronunciation Erroneous Tendency Detection. 170-174 - Zhenyu Wang, Qi Zhang, Shuang Zheng, Jinsong Zhang, Yanlu Xie:
A Study on Landmark Verification of Mandarin Alveolar-palatal Consonants. 175-179 - Jinghua Zhong, Helen Meng:
DNN i-vector based Fishervoice and PLDA SVM scoring for NIST SRE 2016. 180-184 - Madhu R. Kamble, Hemant A. Patil:
Novel Amplitude Weighted Frequency Modulation Features for Replay Spoof Detection. 185-189 - Yutian Li, Feng Gao, Zhijian Ou, Jiasong Sun:
Angular Softmax Loss for End-to-end Speaker Verification. 190-194 - Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. 195-199 - Yi Liu, Liang He, Weiwei Liu, Jia Liu:
Exploring a Unified Attention-Based Pooling Framework for Speaker Verification. 200-204 - Yexin Yang, Shuai Wang, Man Sun, Yanmin Qian, Kai Yu:
Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification. 205-209 - Long Zhang, Jia Jia, Fanbo Meng, Suping Zhou, Wei Chen, Cunjun Zhang, Runnan Li:
Emphasis Detection for Voice Dialogue Applications Using Multi-channel Convolutional Bidirectional Long Short-Term Memory Network. 210-214 - Yueheng Li, Biao Luo:
Topic and Prosody Interaction in Chinese Discourse. 215-219 - Xuanda Chen, Yuan Jia, Ziyu Xiong:
Measuring Prosodic Transfer in Vector Space by Weighted Tonal Events. 220-224 - Fang Yu, Chin-Tuan Tan, Fei Chen:
An ERP Study to Evaluate the Quality of Speech Processed by Wiener Filtering. 225-229 - Yongwei Li, Ken-Ichi Sakakibara, Masato Akagi:
Estimation of glottal source waveforms and vocal tract shapes from speech signals based on ARX-LF model. 230-234 - Zexin Cai, Xiaoyi Qin, Danwei Cai, Ming Li, Xinzhong Liu, Haibin Zhong:
The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion. 235-239 - Ivan Fung, Brian Mak:
Multi-Head Attention for End-to-End Neural Machine Translation. 250-254 - Zhaoheng Ni, Rutuja Ubale, Yao Qian, Michael I. Mandel, Su-Youn Yoon, Abhinav Misra, David Suendermann-Oeft:
Unusable Spoken Response Detection with BLSTM Neural Networks. 255-259 - Mu Wang, Zhiyong Wu, Shiyin Kang, Xixin Wu, Jia Jia, Dan Su, Dong Yu, Helen Meng:
Speech Super-Resolution Using Parallel WaveNet. 260-264 - Kun-Yi Huang, Chung-Hsien Wu, Qian-Bei Hong, Ming-Hsiang Su, Yuan-Rong Zeng:
Speech Emotion Recognition using Convolutional Neural Network with Audio Word-based Embedding. 265-269 - Yuan-Fu Liao, Wu-Hua Hsu, Yu-Chen Lin, Yung-Hsiang Shawn Chang, Matús Pleva, Jozef Juhár, Guang-Feng Deng:
Formosa Speech Recognition Challenge 2018: Data, Plan and Baselines. 270-274 - Ye Bai, Jianhua Tao, Jiangyan Yi, Zhengqi Wen, Cunhang Fan:
CLMAD: A Chinese Language Model Adaptation Dataset. 275-279 - Yao Qian, Rutuja Ubale, Patrick L. Lange, Keelan Evanini, Frank K. Soong:
From Speech Signals to Semantics - Tagging Performance at Acoustic, Phonetic and Word Levels. 280-284 - Minglu Liu, Miao Li, Ji Wu, Xiangling Fu, Ji Gao:
Using Dempster-Shafer Evidence Theory for Dialog State Tracking. 285-289 - Yuanyuan Liu, Tan Lee, Thomas K. T. Law, Kathy Y. S. Lee, P. C. Ching:
Prediction of Voice Disorder Severity: Contributions from Sustained Vowels and Continuous Speech. 290-294 - Qing Wang, Jun Du, Li Chai, Li-Rong Dai, Chin-Hui Lee:
A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network. 295-299 - Hongcui Wang, Dongxiao He, Jianwu Dang, Xi Liang:
Manifold-based incremental community detection method for online speaker identification. 300-303 - Ruifang Ji, Junhua Cao, Xinyuan Cai, Bo Xu:
Max Margin Cosine Loss for Speaker Identification on Short Utterances. 304-308 - Minxian Zhu, Xiang Xie, Liqiang Zhang, Jing Wang:
Automatic Personality Perception from Speech in Mandarin. 309-313 - Shengyu Yao, Houjun Huang, Ruohua Zhou, Yonghong Yan:
Text-dependent Speaker Verification Using Word-based Scoring. 314-318 - Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li:
End-to-end Language Identification using NetFV and NetVLAD. 319-323 - Meghna Pandharipande, Rupayan Chakraborty, Ashish Panda, Sunil Kumar Kopparapu:
Robust Front-End Processing For Emotion Recognition In Noisy Speech. 324-328 - Meng Liu, Longbiao Wang, Zeyan Oo, Jianwu Dang, Dongbo Li, Seiichi Nakagawa:
Replay Attacks Detection Using Phase and Magnitude Features with Various Frequency Resolutions. 329-333 - Madhu R. Kamble, Hemlata Tak, Maddala Venkata Siva Krishna, Hemant A. Patil:
Novel Demodulation-Based Features using Classifier-level Fusion of GMM and CNN for Replay Detection. 334-338 - Liang Zhang, Aijun Li, Yingyi Luo:
Chinese Causal Relation: Conjunction, Order and Focus-to-Stress Assignment. 339-343 - Tianyu Liang, Xianhong Chen, Can Xu, Liang He:
Parallel Double Audio Fingerprinting. 344-348 - Wei Zhang, Qi Zhang, Yanlu Xie, Jinsong Zhang:
LSTM-Based Pitch Range Estimation from Spectral Information of Brief Speech Input. 349-353 - Yue Sun, Manwa L. Ng, Chongyuan Lian, Lan Wang, Feng Yang, Nan Yan:
Acoustic and Kinematic Examination of Dysarthria in Cantonese Patients of Parkinson's Disease. 354-358 - Sonal Joshi, Ashish Panda, Biswajit Das:
Enhanced Denoising Auto-Encoder for Robust Speech Recognition in Unseen Noise Conditions. 359-363 - Gaofeng Cheng, Lu Huang, Jiasong Sun, Yonghong Yan:
Bidirectional LSTM with Extended Input Context. 364-368 - Wei Zou, Dongwei Jiang, Shuaijiang Zhao, Guilin Yang, Xiangang Li:
Comparable Study Of Modeling Units For End-To-End Mandarin Speech Recognition. 369-373 - Yiyan Wang, Yanhua Long:
Keyword Spotting Based On CTC and RNN For Mandarin Chinese Speech. 374-378 - Long Wu, Li Wang, Pengyuan Zhang, Ta Li, Yonghong Yan:
Space-Time Residual LSTM Architechture for Distant Speech Recognition. 379-383 - Dongwei Jiang, Wei Zou, Shuaijiang Zhao, Guilin Yang, Xiangang Li:
An Analysis of Decoding for Attention-Based End-to-End Mandarin Speech Recognition. 384-388 - Jiarui Wang, Si Ioi Ng, Dehua Tao, Wing Yee Ng, Tan Lee:
A Study on Acoustic Modeling for Child Speech Based on Multi-Task Learning. 389-393 - Dongbo Li, Longbiao Wang, Jianwu Dang, Meng Ge, Haotian Guan:
Distant-talking Speech Recognition Based on Multi-objective Learning using Phase and Magnitude-based Feature. 394-398 - Shuaishuai Ye, Ting Jiang, Shan Qin, Weixia Zou, Chengyun Deng:
Speech Enhancement Based on A New Architecture of Wasserstein Generative Adversarial Networks. 399-403 - Hengshun Zhou, Xue Bai, Jun Du:
An Investigation of Transfer Learning Mechanism for Acoustic Scene Classification. 404-408 - Junhao Ding, Bin Ren, Nengheng Zheng:
Microphone Array Acoustic Source Localization system based on Deep Learning. 409-413 - Chang Liu, Yike Zhang, Pengyuan Zhang, Yaofeng Wang:
Evaluating Modeling Units and Sub-word Features in Language Models for Turkish ASR. 414-418 - Jiyuan Zhang, Dong Wang:
Chinese Poetry Generation with Flexible Styles. 419-423 - Chao-yu Su, Chiu-yu Tseng:
Perceivable information structure in discourse prosody-Detecting prominent prosodic words in spoken discourse using F0 contour. 424-428 - Chunyu Ge, Aijun Li:
Declination and boundary effect in Cantonese declarative sentence. 429-433 - Xinyi Wen, Yuan Jia, Aijun Li:
Interaction of Syntax, Semantics and Pragmatics on Discourse Prosody in Standard Chinese. 434-438 - Wei Zhang, Yanlu Xie, Jinsong Zhang:
A Preliminary Study on Quantitative Calculation of Prosodic Strength in Mandarin Speech. 439-443 - Zhenyu Wang, Jinsong Zhang, Yanlu Xie:
L2 Mispronunciation Verification Based on Acoustic Phone Embedding and Siamese Networks. 444-448 - Lei Liu, Xuemei Zhai, Wentao Gu:
Comparing Mandarin Lexical Stress Produced by Native Speakers and L2 Learners in Hong Kong. 449-453 - Ziyu Xiong, Maolin Wang:
A study on the pitch realization of focus in Chinese. 454-457 - Ziyu Xiong, Maolin Wang:
Effect of Anticipatory Vowel-to-Vowel Coarticulation at Different Prosodic Boundaries in Chinese. 458-462 - Wai-Sum Lee, Yueh-Chin Chang, Feng-fan Hsieh:
Co-articulation between Consonant and Vowel in Cantonese and Taiwanese CVC Syllables. 463-467 - Qian Li, Yingyi Luo, Aijun Li:
Cross-Dialectal Perception of the Third-Tone Sandhi in Standard Chinese - Evidence from Eye Movements. 468-472 - Xin Li, Rene Kager:
An Acoustic Comparison between Two Pairs of Assimilatory and Dissimilatory Tone Sandhi Processes in Nanjing Mandarin in Categoricalness/Gradience. 473-477 - Aijun Li:
Response Acts in Chinese Conversation: the Coding Scheme and Analysis. 478-482 - Jingdong Li, Hui Zhang, Rui Liu, Xueliang Zhang, Feilong Bao:
End-to-End Mongolian Text-to-Speech System. 483-487 - Gan Huang, Lin Zhu, Aijun Li:
Syntactic Structure and Communicative Function of Echo Questions in Chinese Dialogues. 488-492 - Si Ioi Ng, Dehua Tao, Jiarui Wang, Yi Jiang, Wing Yee Ng, Tan Lee:
An Automated Assessment Tool for Child Speech Disorders. 493-494 - Ji-Yan Han, Wei-Zhong Zheng, Ren-Jie Huang, Yu Tsao, Ying-Hui Lai:
Hearing aids APP design based on deep learning technology. 495-496 - Wen-Huei Liao, Pei-Chun Li, Shuenn-Tsong Young, Ying-Hui Lai, Yu Tsao:
IOS-based Ear Scale application for Clinical Audiology and Otology Usage. 497-498
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.