default search action
ISCSLP 2006: Singapore - Selected Papers
- Qiang Huo, Bin Ma, Chng Eng Siong, Haizhou Li:
Chinese Spoken Language Processing, 5th International Symposium, ISCSLP 2006, Singapore, December 13-16, 2006, Selected Papers. Lecture Notes in Computer Science 4274, Springer 2006, ISBN 3-540-49665-3
Plenary
- Stephanie Seneff:
Interactive Computer Aids for Acquiring Proficiency in Mandarin. 1-12 - Klaus R. Scherer:
The Affective and Pragmatic Coding of Prosody. 13-14 - Franz Josef Och:
Challenges in Machine Translation. 15 - Tat-Seng Chua:
Automatic Indexing and Retrieval of Large Broadcast News Video Collections - The TRECVID Experience. 16
Tutorial
- Keiichi Tokuda:
An HMM-Based Approach to Flexible Speech Synthesis. 17 - Hang Li:
Text Information Extraction and Retrieval. 18
Topics in Speech Science
- Jiahong Yuan:
Mechanisms of Question Intonation in Mandarin. 19-30 - Wentao Gu, Keikichi Hirose, Hiroya Fujisaki:
Comparison of Perceived Prosodic Boundaries and Global Characteristics of Voice Fundamental Frequency Contours in Mandarin Speech. 31-42 - Shu-Chuan Tseng:
Linguistic Markings of Units in Spontaneous Mandarin. 43-54 - Yuan Jia, Ziyu Xiong, Aijun Li:
Phonetic and Phonological Analysis of Focal Accents of Disyllabic Words in Standard Chinese. 55-66 - Lu Zhang, Yi-Qing Zu, Run-Qiang Yan:
Focus, Lexical Stress and Boundary Tone: Interaction of Three Prosodic Features. 67-75
Speech Analysis
- Dongwen Ying, Yu Shi, Frank K. Soong, Jianwu Dang, Xugang Lu:
A Robust Voice Activity Detection Based on Noise Eigenspace Projection. 76-86 - Jian Liu, Thomas Fang Zheng, Wenhu Wu:
Pitch Mean Based Frequency Warping. 87-94 - Kuang-Ting Sung, Hsiao-Chuan Wang:
A Study of Knowledge-Based Features for Obstruent Detection and Classification in Continuous Mandarin Speech. 95-105 - Yih-Ru Wang:
Speaker-and-Environment Change Detection in Broadcast News Using Maximum Divergence Common Component GMM. 106-115 - Jing Deng, Thomas Fang Zheng, Wenhu Wu:
UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection. 116-125 - Hemant A. Patil, T. K. Basu:
Design of Cubic Spline Wavelet for Open Set Speaker Classification in Marathi. 126-137
Speech Synthesis and Generation
- Min Chu, Yunjia Wang:
Rhythmic Organization of Mandarin Utterances - A Two-Stage Process. 138-148 - Xiaonan Zhang, Jun Xu, Lianhong Cai:
Prosodic Boundary Prediction Based on Maximum Entropy Model with Error-Driven Modification. 149-160 - Heng Kang, Wenju Liu:
Prosodic Words Prediction from Lexicon Words with CRF and TBL Joint Method. 161-168 - Honghui Dong, Jianhua Tao, Bo Xu:
Prosodic Word Prediction Using a Maximum Entropy Approach. 169-178 - Keh-Jiann Chen, Chiu-yu Tseng, Chia-Hung Tai:
Predicting Prosody from Text. 179-188 - Jianhua Tao, Jian Yu, Yongguo Kang:
Nonlinear Emotional Prosody Generation and Annotation. 189-199 - Guohong Fu, Min Zhang, Guodong Zhou, Kang-Kwong Luke:
A Unified Framework for Text Analysis in Chinese TTS. 200-210 - Qiang Fang, Jianwu Dang:
Speech Synthesis Based on a Physiological Articulatory Model. 211-222 - Yao Qian, Frank K. Soong, Yining Chen, Min Chu:
An HMM-Based Mandarin Chinese Text-To-Speech System. 223-232 - Long Qin, Zhen-Hua Ling, Yi-Jian Wu, Bu-Fan Zhang, Ren-Hua Wang:
HMM-Based Emotional Speech Synthesis Using Average Emotion Model. 233-240 - Hsiu-Min Yu, Hsin-Te Hwang, Dong-Yi Lin, Sin-Horng Chen:
A Hakka Text-To-Speech System. 241-247
Speech Enhancement
- Heng Zhang, Qiang Fu, Yonghong Yan:
Adaptive Null-Forming Algorithm with Auditory Sub-bands. 248-257 - Junfeng Li, Masato Akagi, Yôiti Suzuki:
Multi-channel Noise Reduction in Noisy Environments. 258-269
Acoustic Modeling for Automatic Speech Recognition
- Jia-Yu Chen, Chia-Yu Wan, Yi Chen, Berlin Chen, Lin-Shan Lee:
Minimum Phone Error (MPE) Model and Feature Training on Mandarin Broadcast News Task. 270-281 - Linquan Liu, Thomas Fang Zheng, Wenhu Wu:
State-Dependent Phoneme-Based Model Merging for Dialectal Chinese Speech Recognition. 282-293 - Peng Liu, Jian-Lai Zhou, Frank K. Soong:
Non-uniform Kernel Allocation Based Parsimonious HMM. 294-302 - Yiu-Pong Lai, Man-Hung Siu:
Consistent Modeling of the Static and Time-Derivative Cepstrums for Speech Recognition Using HSPTM. 303-314
Robust Speech Recognition
- Xiong Xiao, Haizhou Li, Engsiong Chng:
Vector Autoregressive Model for Missing Feature Reconstruction. 315-324 - Xugang Lu, Jianwu Dang:
Auditory Contrast Spectrum for Robust Speech Recognition. 325-334 - Zhi-Jie Yan, Jian-Lai Zhou, Frank K. Soong, Ren-Hua Wang:
Signal Trajectory Based Noise Compensation for Robust Speech Recognition. 335-345 - Yu Hu, Qiang Huo:
An HMM Compensation Approach Using Unscented Transformation for Noisy Speech Recognition. 346-357 - Jun Du, Peng Liu, Frank K. Soong, Jian-Lai Zhou, Ren-Hua Wang:
Noisy Speech Recognition Performance of Discriminative HMMs. 358-369 - Yih-Ru Wang, Bo-Xuan Lu, Yuan-Fu Liao, Sin-Horng Chen:
Distributed Speech Recognition of Mandarin Digits String. 370-379
Speech Adaptation/Normalization
- Tsz-Chung Lai, Brian Mak:
Unsupervised Speaker Adaptation Using Reference Speaker Weighting. 380-389 - Shih-Sian Cheng, Yeong-Yuh Xu, Hsin-Min Wang, Hsin-Chia Fu:
Automatic Construction of Regression Class Tree for MLLR Via Model-Based Hierarchical Clustering. 390-398
General Topics in Speech Recognition
- Jen-Wei Kuo, Hsin-Min Wang:
A Minimum Boundary Error Framework for Automatic Phonetic Segmentation. 399-409
Large Vocabulary Continuous Speech Recognition
- Yong Qin, Qin Shi, Yi Y. Liu, Hagai Aronowitz, Stephen M. Chu, Hong-Kwang Kuo, Geoffrey Zweig:
Advances in Mandarin Broadcast Speech Transcription at IBM Under the DARPA GALE Program. 410-421 - Yi-Sheng Fu, Yi-Cheng Pan, Lin-Shan Lee:
Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-Based Consensus Networks. 422-434 - Yun Tang, Wenju Liu, Bo Xu:
All-Path Decoding Algorithm for Segmental Based Speech Recognition. 435-444 - Huanliang Wang, Yao Qian, Frank K. Soong, Jian-Lai Zhou, Jiqing Han:
Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone Models. 445-453 - Tzan-Hwei Chen, Berlin Chen, Hsin-Min Wang:
On Using Entropy Information to Improve Posterior Probability-Based Confidence Measures. 454-463 - Quan Vu, Kris Demuynck, Dirk Van Compernolle:
Vietnamese Automatic Speech Recognition: The FLaVoR Approach. 464-474
Multilingual Recognition and Identification
- Dau-Cheng Lyu, Ren-Yuan Lyu, Yuang-Chin Chiang, Chun-Nan Hsu:
Language Identification by Using Syllable-Based Duration Classification on Code-Switching Speech. 475-484
Speaker Recognition and Characterization
- Thomas Fang Zheng, Zhanjiang Song, Lihong Zhang, Michael Brasser, Wei Wu, Jing Deng:
CCC Speaker Recognition Evaluation 2006: Overview, Methods, Data, Results and Perspective. 485-493 - Kong-Aik Lee, Hanwu Sun, Rong Tong, Bin Ma, Minghui Dong, Changhuai You, Donglai Zhu, Chin-Wei Eugene Koh, Lei Wang, Tomi Kinnunen, Chng Eng Siong, Haizhou Li:
The IIR Submission to CSLP 2006 Speaker Recognition Evaluation. 494-505 - Yi-Hsiang Chao, Hsin-Min Wang, Ruei-Chuan Chang:
A Novel Alternative Hypothesis Characterization Using Kernel Classifiers for LLR-Based Speaker Verification. 506-517 - Nengheng Zheng, Ning Wang, Tan Lee, P. C. Ching:
Speaker Verification Using Complementary Information from Vocal Source and Vocal Tract. 518-528 - Carlos E. Vivaracho:
ISCSLP SR Evaluation, UVA-CS_es System Description. A System Based on ANNs. 529-538 - Shingo Kuroiwa, Satoru Tsuge, Masahiko Kita, Fuji Ren:
Evaluation of EMD-Based Speaker Recognition Using ISCSLP2006 Chinese Speaker Recognition Evaluation Corpus. 539-548 - Nengheng Zheng, P. C. Ching, Ning Wang, Tan Lee:
Integrating Complementary Features with a Confidence Measure for Speaker Identification. 549-557 - Hao Yang, Yuan Dong, Xianyu Zhao, Jian Zhao, Haila Wang:
Discriminative Transformation for Sufficient Adaptation in Text-Independent Speaker Verification. 558-565 - Rong Tong, Bin Ma, Kong-Aik Lee, Changhuai You, Donglai Zhu, Tomi Kinnunen, Hanwu Sun, Minghui Dong, Chng Eng Siong, Haizhou Li:
Fusion of Acoustic and Tokenization Features for Speaker Recognition. 566-577
Spoken Language Understanding
- Jui-Feng Yeh, Chung-Hsien Wu, Wei-Yen Wu:
Contextual Maximum Entropy Model for Edit Disfluency Detection of Spontaneous Speech. 578-589
Human Language Acquisition, Development and Learning
- Li Zhang, Chao Huang, Min Chu, Frank K. Soong, Xianda Zhang, Yudong Chen:
Automatic Detection of Tone Mispronunciation in Mandarin. 590-601 - Mitchell Peabody, Stephanie Seneff:
Towards Automatic Tone Correction in Non-native Mandarin. 602-613
Spoken and Multimodal Dialog Systems
- Zhiyong Wu, Helen M. Meng, Hui Ning, Sam C. Tse:
A Corpus-Based Approach for Cooperative Response Generation in a Dialog System. 614-626 - Lei Xie, Helen Meng, Zhi-Qiang Liu:
A Cantonese Speech-Driven Talking Face Using Translingual Audio-to-Visual Conversion. 627-639 - Sen Zhang, Yves Laprie:
The Implementation of Service Enabling with Spoken Language of a Multi-modal System Ozone. 640-647 - Bo-June Paul Hsu, James R. Glass:
Spoken Correction for Chinese Text Entry. 648-659
Speech Data Mining and Document Retrieval
- Yi-Ting Chen, Suhan Yu, Hsin-Min Wang, Berlin Chen:
Extractive Chinese Spoken Document Summarization Using Probabilistic Ranking Models. 660-671 - Manuel Giuliani, Tin Lay Nwe, Haizhou Li:
Meeting Segmentation Using Two-Layer Cascaded Subband Filters. 672-682 - Lin-Shan Lee, Sheng-yi Kong, Yi-Cheng Pan, Yi-Sheng Fu, Yu-tsun Huang, Chien-Chih Wang:
A Multi-layered Summarization System for Multi-media Archives by Understanding and Structuring of Chinese Spoken Documents. 683-692 - Devon Li, Wai Kit Lo, Helen M. Meng:
Initial Experiments on Automatic Story Segmentation in Chinese Spoken Documents Using Lexical Cohesion of Extracted Named Entities. 693-703
Machine Translation of Speech
- Zhendong Yang, Wei Pang, Jinhua Du, Wei Wei, Bo Xu:
Some Improvements in Phrase-Based Statistical Machine Translation. 704-711 - Rile Hu, Xia Wang:
Automatic Spoken Language Translation Template Acquisition Based on Boosting Structure Extraction and Alignment. 712-723
Spoken Language Resources and Annotation
- Yi Liu, Pascale Fung, Yongsheng Yang, Christopher Cieri, Shudong Huang, David Graff:
HKUST/MTS: A Very Large Scale Mandarin Telephone Speech Corpus. 724-735 - Min Chu, Yong Zhao, Yining Chen, Lijuan Wang, Frank K. Soong:
The Paradigm for Creating Multi-lingual Text-To-Speech Voice Databases. 736-747 - Hsi-Chun Hsiao, Hsiu-Min Yu, Yih-Ru Wang, Sin-Horng Chen:
Multilingual Speech Corpora for TTS System Development. 748-759 - Muyun Yang, Hongfei Jiang, Tiejun Zhao, Sheng Li:
Construct Trilingual Parallel Corpus on Demand. 760-767 - Jack Halpern:
The Contribution of Lexical Resources to Natural Language Processing of CJK Languages. 768-780 - Toshiyuki Takezawa:
Multilingual Spoken Language Corpus Development for Communication Research. 781-791 - K. Samudravijaya:
Development of Multi-lingual Spoken Corpora of Indian Languages. 792-801
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.