![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
ISCSLP 2012: Kowloon Tong, China
- 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012, Kowloon Tong, China, December 5-8, 2012. IEEE 2012, ISBN 978-1-4673-2506-6
- Jia Pan, Cong Liu, Zhiguo Wang, Yu Hu, Hui Jiang:
Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMS in acoustic modeling. 301-305 - Maolin Wang, Shengnan Xiong, Jiayun Li, Ziyu Xiong:
A study on the coarticulation of bi-syllabic words in Chinese. 426-430 - Su Jun Leow, Tze Siong Lau, Alvina Goh, Han Meng Peh, Teck Khim Ng, Sabato Marco Siniscalchi
, Chin-Hui Lee:
A new confidence measure combining Hidden Markov Models and Artificial Neural Networks of phonemes for effective keyword spotting. 112-116 - Zhengqi Wen, Jianhua Tao, Hao Che:
Statistical modification based post-filtering technique for HMM-based speech synthesis. 146-149 - Xin Wang
, Zhen-Hua Ling, Li-Rong Dai:
Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis. 84-87 - Wai-Sum Lee
:
A cross-dialect comparison of vowel dispersion and vowel variability. 25-29 - Feng-Long Xie, Yi-Jian Wu, Frank K. Soong:
Cross validation and Minimum Generation Error for improved model clustering in HMM-based TTS. 60-63 - Ying-Lang Chang, Jen-Tzung Chien
:
Bayesian nonparametric language models. 188-192 - Chenhao Zhang, Thomas Fang Zheng, Ruxin Chen:
Text-Dependent Speaker Recognition with long-term features based on functional data analysis. 340-344 - Xuan Ji, Jing Wang, Hailong He, Jingming Kuang:
The lossless adaptive arithmetic coding based on context for ITU-T G.719 at variable rate. 210-214 - Qiang Wang, Zhiyuan Guo, Gang Liu, Jun Guo:
Boundary-expanding locality sensitive hashing. 358-362 - Wei-Fan Chen, Chin-Kuan Kuo, Yih-Ru Wang, Sin-Horng Chen:
A syllable-based prosody modeling for L1 and L2 English speeches. 281-285 - Kui Wu, Yan Song, Wu Guo, Li-Rong Dai:
Intra-conversation intra-speaker variability compensation for speaker clustering. 330-334 - Liang He, Jia Li:
Discriminant local information distance preserving projection for text-independent speaker recognition. 349-352 - Xingyu Na, Xiang Xie, Jingming Kuang, Yaling He:
An improved tone labeling and prediction method with non-uniform segmentation of F0 contour. 252-255 - Xiaotian Zhang, Yao Qian, Hai Zhao, Frank K. Soong:
Break index labeling of mandarin text via syntactic-to-prosodic tree mapping. 256-260 - Cheng Hsien Lin, Po Kai Huang, Cheng-Yuan Lin, Chih-Chung Kuo:
Effective sentence selection based on phone/model coverage maximization for speaker adaptation in HMM-based speech synthesis. 74-78 - I-Fan Su, Sin-Ting Yeung, Brendan S. Weekes
, Sam-Po Law:
Locus of orthographic facilitation effect in spoken word production: Evidence from cantonese Chinese. 440-444 - Lei Xie, Chenglin Xu, Xiaoxuan Wang:
Prosody-based sentence boundary detection in Chinese broadcast news. 261-265 - Wai-Sum Lee
:
Articulatory and spectral characteristics of Cantonese vowels. 45-49 - Cheng-Yuan Lin, Chien-Hung Huang, Chih-Chung Kuo:
A simple and effective pitch re-estimation method for rich prosody and speaking styles in HMM-based speech synthesis. 286-290 - Zhiyang He, Ping Lv, Wei Li, Ji Wu:
A synchronized pruning composition algorithm of weighted finite state transducers for large vocabulary speech recognition. 11-15 - Chao-Hong Liu
, Chung-Hsien Wu
, David Sarwono:
Alternative hypothesis generation using a weighted kernel feature matrix for ASR substitution error correction. 1-5 - Qinghua Wu, Xiao-Lei Zhang, Ping Lv, Ji Wu:
Perceptual similarity between audio clips and feature selection for its measurement. 387-391 - Hsin-Te Hwang, Yu Tsao
, Hsin-Min Wang
, Yih-Ru Wang, Sin-Horng Chen:
Exploring mutual information for GMM-based spectral conversion. 50-54 - Jinfu Ni, Yoshinori Shiga, Hisashi Kawai, Hideki Kashioka:
Resonance-based spectral deformation in HMM-based speech synthesis. 88-92 - Yong Xu, Wu Guo, Li-Rong Dai:
A hybrid fragment / syllable-based system for improved OOV term detection. 378-382 - Yong Xu, Wu Guo, Shan Su, Li-Rong Dai:
Spoken term detection for OOV terms based on triphone confusion matrix. 98-102 - Maolin Wang, Wei Shi, Ruixian Huang, Ziyu Xiong:
The temporal effect of speaking rate, focus and prosody in Chinese. 445-449 - Ruofei Chen, Cheung-Fat Chan
:
Hierarchical clustering and robust identification for block-based autoregressive speech parameter estimation. 103-107 - Shixiang Lu, Wei Wei, Xiaoyin Fu, Lichun Fan, Bo Xu:
Phrase-based data selection for language model adaptation in spoken language translation. 193-196 - Syu-Siang Wang
, Jeih-Weih Hung, Yu Tsao
:
A study on cepstral sub-band normalization for robust ASR. 141-145 - Chen Zhao, Hongcui Wang, Songgun Hyon, Jianguo Wei, Jianwu Dang:
Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese. 345-348 - Duy Khanh Ninh
, Masanori Morise, Yoichi Yamashita:
Incorporating dynamic features into minimum generation error training for HMM-based speech synthesis. 55-59 - Guo Li, Peggy Mok:
Preliminary study on the interlanguage speech intelligibility benefit for English-Mandarin bilingual l2 learners. 409-412 - Huijun Ding, Tan Lee
, Ing Yann Soon:
Two objective measures for speech distortion and noise reduction evaluation of enhanced speech signals. 117-121 - Tao Jiang, Zhiyong Wu, Jia Jia, Lianhong Cai:
Perceptual clustering based unit selection optimization for concatenative text-to-speech synthesis. 64-68 - Weifeng Li, Qingmin Liao:
Keyword-specific normalization based keyword spotting for spontaneous speech. 233-237 - Siu Wa Lee, Minghui Dong, Haizhou Li
:
A study of F0 modelling and generation with lyrics and shape characterization for singing voice synthesis. 150-154 - Yuguang Wang, Hongcui Wang, Jiaqi Gao, Jianguo Wei, Jianwu Dang:
Detailed morphological analysis of mandarin sustained steady vowels. 413-416 - Chunrong Li, Zhiyong Wu, Fanbo Meng, Helen M. Meng, Lianhong Cai:
Detection and emphatic realization of contrastive word pairs for expressive text-to-speech synthesis. 93-97 - Yan Li, Si Li, Weiran Xu, Jun Guo:
Analyzing semantic orientation of terms using Affinity Propagation. 30-34 - Xixin Wu, Zhiyong Wu, Jia Jia, Lianhong Cai:
Adaptive named entity recognition based on conditional random fields with automatic updated dynamic gazetteers. 363-367 - Van Hai Do, Xiong Xiao, Engsiong Chng
, Haizhou Li
:
Context dependant phone mapping for cross-lingual acoustic modeling. 16-20 - Xiaoyin Fu, Wei Wei, Lichun Fan, Shixiang Lu, Bo Xu:
Nesting hierarchical phrase-based model for speech-to-speech translation. 368-372 - Yu Zou, Yan Wang, Wei He:
Diachronic contrastive analysis on read speech in broadcast news: Evidence from pitch and duration. 291-295 - Masashi Unoki
, Xugang Lu:
Unified denoising and dereverberation method used in restoration of MTF-based power envelope. 215-219 - Xugang Lu, Masashi Unoki
, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Controlling the tradeoff property in a regularization framework for noise reduction. 201-205 - Xugang Lu, Yu Tsao
, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Acoustic space partition based on broad phonetic class for ensemble acoustic modeling. 311-314 - Fei Chen
, Tian Guan, Lena L. N. Wong:
Effects of excitation spread on the intelligibility of Mandarin speech in cochlear implant simulations. 35-39 - Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong
, Haizhou Li
:
An analysis of vector Taylor series model compensation for non-stationary noise in speech recognition. 131-135 - Wenping Hu, Yao Qian, Frank K. Soong:
Pitch accent detection and prediction with DCT features and CRF model. 266-270 - Dazuo Wang, Xiuxiu Wang, Gang Peng
:
Effects of carriers on Mandarin tone categorical perception. 417-421 - Yao Qian, Frank K. Soong:
A unified trajectory tiling approach to high quality TTS and cross-lingual voice transformation. 165-169 - Pengfei Liu
, Ka-Wa Yuen, Wai-Kim Leung, Helen M. Meng:
mENUNCIATE: Development of a computer-aided pronunciation training system on a cross-platform framework for mobile, speech-enabled application development. 170-173 - Yingying Gao, Weibin Zhu:
How to describe speech emotion more completely - An investigation on Chinese broadcast news speech. 450-453 - Cheung-Chi Leung, Bin Ma, Haizhou Li
:
Phonotactic spoken language recognition: Using diversely adapted acoustic models in parallel phone recognizers. 108-111 - Cuiling Zhang:
Acoustic analysis of disguised voices with raised and lowered pitch. 353-357 - Jian Xu, Zhi-Jie Yan, Qiang Huo:
A comparative study of fMPE and RDLT approaches to LVCSR. 21-24 - Jian Zhang, Risheng Xia, Zhonghua Fu, Junfeng Li, Yonghong Yan:
A fast two-microphone noise reduction algorithm based on power level ratio for mobile phone. 206-209 - Jian Xu, Zhi-Jie Yan, Qiang Huo:
A feature-transform based approach to unsupervised task adaptation and personalization. 229-232 - Kuan-Lang Huang, Tai-Shih Chi:
TDOA information based vad for robust speech recognition in directional and diffuse noise field. 126-130 - Dac-Thang Hoang, Hsiao-Chuan Wang:
A phone segmentation method and its evaluation on Mandarin speech corpus. 373-377 - Mengxue Cao, Aijun Li, Qiang Fang, Jianguo Wei, Chan Song, Jianwu Dang:
Acoustic and articulatory analysis on Japanese vowels in emotional speech. 40-44 - Po-Yi Shih, Bo-Wei Chen, Jhing-Fa Wang, Jhing-Wei Wu:
Enhanced lengthening cancellation using bidirectional pitch similarity alignment for spontaneous speech. 238-242 - Jinfu Ni, Yoshinori Shiga, Hisashi Kawai, Hideki Kashioka:
Experiments on unsupervised statistical parametric speech synthesis. 155-159 - Yasuaki Kanai, Masashi Unoki
:
Robust voice activity detection using empirical mode decomposition and modulation spectrum analysis. 400-404 - Kun Li, Helen M. Meng:
Perceptually-motivated assessment of automatically detected lexical stress in L2 learners' speech. 179-183 - Na Li, Yu Qiao:
Voice conversion using Bayesian mixture of Probabilistic Linear Regressions and dynamic kernel features. 69-73 - Chen-Yu Yang, Georgina Brown, Liang Lu, Junichi Yamagishi, Simon King
:
Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation. 220-223 - Junhong Zhao, Weiqiang Zhang, Hua Yuan, Jia Liu, Shanhong Xia:
Automatic pitch accent detection using auto-context with acoustic features. 247-251 - Xian-Jun Xia, Zhen-Hua Ling, Chen-Yu Yang, Li-Rong Dai:
Improved unit selection speech synthesis method utilizing subjective evaluation results on synthetic speech. 160-164 - Zhanlei Yang, Wenju Liu, Hao Chao:
An improved steady segment based decoding algorithm by using response probability for LVCSR. 306-310 - Yang Li, Xunying Liu, Lan Wang:
Structured modeling based on generalized variable parameter HMMs and speaker adaptation. 136-140 - Wei Rao, Man-Wai Mak
:
Alleviating the small sample-size problem in i-vector based speaker verification. 335-339 - Chen-Yu Chiang, Sabato Marco Siniscalchi
, Yih-Ru Wang, Sin-Horng Chen, Chin-Hui Lee:
A study on cross-language knowledge integration in Mandarin LVCSR. 315-319 - Lichun Fan, Dengfeng Ke, Xiaoyin Fu, Shixiang Lu, Bo Xu:
Power-normalized PLP (PNPLP) feature for robust speech recognition. 224-228 - Jia Jia, Wai-Kim Leung, Ye Tian, Lianhong Cai, Helen M. Meng:
Analysis on mispronunciations in CAPT based on computational speech perception. 174-178 - Ching-feng Yeh, Yiu-Chang Lin, Lin-Shan Lee:
Minimum Phone Error model training on merged acoustic units for transcribing bilingual code-switched speech. 320-324 - Guoli Ye, Brian Mak
:
Speaker-ensemble hidden Markov modeling for automatic speech recognition. 6-10 - Song Wang
, Shen Liu, Jianguo Wei, Qiang Fang, Jianwu Dang:
Reconstruction of vocal tract based on multi-source image information. 396-399 - Ye Tian, Jia Jia, Yongxin Wang, Lianhong Cai:
A real-time tone enhancement method for continuous Mandarin speeches. 405-408 - Chiu-yu Tseng, Chao-yu Su:
Information allocation and prosodic expressiveness in continuous speech: A Mandarin cross-genre analysis. 243-246 - Yi-Chin Huang, Chung-Hsien Wu
, Sz-Ting Weng:
Hierarchical prosodic pattern selection based on Fujisaki model for natural mandarin speech synthesis. 79-83 - Chan Song, Jianguo Wei, Qiang Fang, Shen Liu, Yuguang Wang, Jianwu Dang:
Tongue shape synthesis based on Active Shape Model. 383-386 - Hua Yuan, Junhong Zhao, Jia Liu:
Improve mispronunciation detection with Tandem feature. 184-187 - Bin Li
, Rong Rong:
Tones in whispered Mandarin. 422-425 - Ting Zou, Jinsong Zhang
, Wen Cao:
A comparative study of perception of tone 2 and tone 3 in Mandarin by native speakers and Japanese learners. 431-435 - Sagun Dhakhwa, Jens Allwood:
Self documentation of endangered languages. 392-395 - Jun Du, Qiang Huo:
Synthesized stereo-based stochastic mapping with data selection for robust speech recognition. 122-125 - Hongwei Ding, Daniel Hirst:
A preliminary investigation of the third tone sandhi in standard Chinese with a prosodic corpus. 436-439 - Xin Chen, Jian Cheng:
Acoustic modeling for native and non-native Mandarin speech recognition. 325-329 - Yinghao Li, Jinghua Zhang, Jiangping Kong:
The coarticulation resistance of consonants in standard Chinese - An electropalatographic and acoustic study. 454-458 - Aijun Li, Qiang Fang, Yuan Jia, Jianwu Dang:
More targets? Simulating emotional intonation of mandarin with PENTA. 271-275 - Yuan Jia, Aijun Li:
Phonetic realization of accent from Chinese English learners in various dialectal regions. 296-300 - Xinhui Hu, Youzheng Wu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Collecting sentences from web resources for constructing spontaneous Chinese language model. 197-200 - Helen M. Meng:
Welcome message from the conference chair. - Brian Mak, Bin Ma:
Welcome message from the technical program chairs.
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.