default search action
27th O-COCOSDA 2024: Hsinchu City, Taiwan
- 27th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2024, Hsinchu City, Taiwan, October 17-19, 2024. IEEE 2024, ISBN 979-8-3315-0603-2
- Mika Enomoto, Yuichi Ishimoto, Yasuharu Den:
Modeling Response Relevance using Dialog Act and Utterance-Design Features: A Corpus-Based Analysis. 1-6 - Iram Fatima, Sahar Rauf:
Acoustic Realization of /S/ Across Accents of Urdu. 1-6 - Maolin Wang, Ying Liu, Han Yu, Ziyu Xiong, Qiguang Lin:
Research on the Temporal Effect of Focus on Trisyllabic Sequences in Leizhou Min. 1-6 - Felix Burkhardt, Bagus Tris Atmaja, Anna Derington, Florian Eyben, Björn W. Schuller:
Check Your Audio Data: Nkululeko for Bias Detection. 1-6 - Li-Wei Chen, Hung-Shin Lee, Chen-Chi Chang:
VoxHakka: A Dialectally Diverse Multi-Speaker Text-to-Speech System for Taiwanese Hakka. 1-6 - Pantarat Vichathai, Puchit Bunpleng, Patharapol Laolakkana, Sasiporn Usanavasin, Phondanai Khanti, Kasorn Galajit, Jessada Karnjana:
Speech Watermarking for Tampering Detection Using Singular Spectrum Analysis With Quantization Index Modulation and Psychoacoustic Model. 1-6 - Ausdang Thangthai:
Developing a Thai Name Pronunciation Dictionary from Road Signs and Naming websites. 1-6 - Yi-Hsiang Hung, Yi-Chin Huang:
A Neural Machine Translation System for the Low-Resource Sixian Hakka Language. 1-6 - Yu-Chun Liu, Yi-Cheng Wang, Li-Ting Pai, Jia-Liang Lu, Berlin Chen:
Gated Adapters with Balanced Activation for Effective Contextual Speech Recognition. 1-6 - Li-Jen Yang, Jen-Tzung Chien:
Continual Gated Adapter for Bilingual Codec Text-to-Speech. 1-6 - Pei-Chung Su, Cheng-Hsiu Cho, Chih-Chung Kuo, Yen-Chun Lai, Yan-Ming Lin, Chao-Shih Huang, Yuan-Fu Liao:
A Preliminary Study On End-to-End Multimodal Subtitle Recognition for Taiwanese TV Programs. 1-6 - Anjali Mathew, Raniya, Harsha Sanjan, Amjith S. B, Reni K. Cherian, Starlet Ben Alex, Priyanka Srivastava, Chiranjeevi Yarra:
InStant-EMDB: A Multi Model Spontaneous English and Malayalam Speech Corpora for Depression Detection. 1-6 - Chao-Yang Chang, Yan-Ming Lin, Chih-Chung Kuo, Yen-Chun Lai, Chao-Shih Huang, Yuan-Fu Liao, Tsun-guan Thiann:
A Preliminary Study on Taiwanese POS Taggers: Leveraging Chinese in the Absence of Taiwanese POS Annotation Datasets. 1-6 - Peppina Po-Lun Lee, Mosi He, Bin Li:
An Investigation of Chinese Speech Under Alcohol Influence: Database Construction and Phonetic Analysis. 1-5 - Sayaka Toma, Tomoki Ariga, Yosuke Higuchi, Ichiju Hayasaka, Rie Shigyo, Tetsuji Ogawa:
Exploring Impact of Prioritizing Intra-Singer Acoustic Variations on Singer Embedding Extractor Construction for Singer Verification. 1-6 - Hsin-Li Chang, Enoch Hsin-Ho Huang, Yi-Ching Wang, Yu Tsao:
Using Automatic Speech Recognition for Speech Comprehension Evaluation in the Cochlear Implant. 1-5 - Jong In Kim, Sunhee Kim, Minhwa Chung:
Developing a Robust Mispronunciation Detection by Data Augmentation Based on Automatic Phone Annotation. 1-5 - Chun-Hsiang Wang, Chung-Che Wang, Jun-You Wang, Jyh-Shing Roger Jang, Yen-Hsun Chu:
Improving Real-Time Music Accompaniment Separation with MMDenseNet. 1-6 - Yuki Sato, Yuya Chiba, Ryuichiro Higashinaka:
Effects of Multiple Japanese Datasets for Training Voice Activity Projection Models. 1-6 - Wenze Ren, Kuo-Hsuan Hung, Rong Chao, You-Jin Li, Hsin-Min Wang, Yu Tsao:
Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments With Advanced Post-Processing. 1-6 - Akash Deep, Puja Bharati, Sabyasachi Chandra, Debolina Pramanik, Korra Siva Naik, Shayamal Kumar Das Mandal:
Enhancing Phoneme Recognition in the Bengali Language Through Fine-Tuning of Multilingual Model. 1-5 - Hsiu-Min Shih, Tzu-Mi Lin, Yu-Wen Tzeng, Jung-Ying Chang, Kuo-Kai Shyu, Lung-Hao Lee:
Chinese Psychological Counseling Corpus Construction for Valence-Arousal Sentiment Intensity Prediction. 1-5 - Hao-Chien Lu, Chung-Chun Wang, Jhen-Ke Lin, Berlin Chen:
DEVELOPMENT OF AN ENGLISH ORAL ASSESSMENT SYSTEM WITH THE GEPT DATASET. 1-6 - Meiyun Chen:
Clapping Hands to Word Stress Improves Children's L2 English Pronunciation Accuracy in a Word Imitation Task: Evidence from a Classroom Study. 1-6 - Pattara Tipakasorn, Oatsada Chatthong, Ren Yonehana, Kwanchiva Thangthai:
Comprehensive Benchmarking and Analysis of Open Pretrained Thai Speech Recognition Models. 1-7 - Xingzi Gao, Yujie Gao, Sichang Gao:
The Effectiveness of Audio-Visual Feedback for L2 Chinese Sentence Stress Perception and Production. 1-6 - Yu-Lan Chuang, Hsiu-Ray Hsu, Di Tam Luu, Yi-Wen Liu, Ching-Ting Hsin:
Computer-Assisted Pronunciation Training System for Atayal, an Indigenous Language in Taiwan. 1-6 - Satoshi Tamura, Aristidis de Jesus Ornai:
2024 Country Report Timor Leste. 1-6 - Chen-Han Wu, Kuan-Yu Chen:
An N-Best List Selection Framework for ASR N-Best Rescoring. 1-6 - Dhiya Dewangga, Dessi Puji Lestari, Ayu Purwarianti, Dipta Tanaya, Kurniawati Azizah, Sakriani Sakti:
An Evaluation of Neural Vocoder-Based Voice Cloning System for Dysphonia Speech Disorder. 1-7 - Yu Tsao, Chi-Chun Lee:
Message from the Program Chair. vii - Yi-Hao Jiang, Jia-Hui Li, Jia-Wei Chen, Yi-Chang Wu, Ying-Hui Lai:
Overcoming The Impact of Different Materials on Optical Microphones For Speech Capture Using Deep Learning. 1-5 - Christa Thomas, Guneesh Vats, Aravind Johnson, Ashin George, Talit Sara George, Reni K. Cherian, Priyanka Srivastava, Chiranjeevi Yarra:
IIITSaint-EmoMDB: Carefully Curated Malayalam Speech Corpus with Emotion and Self-Reported Depression Ratings. 1-6 - Aulia Adila, Dessi Puji Lestari, Ayu Purwarianti, Dipta Tanaya, Kurniawati Azizah, Sakriani Sakti:
Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities. 1-6 - Po-Chuan Chen, Mahdin Rohmatillah, You-Teng Lin, Jen-Tzung Chien:
Convcounsel: A Conversational Dataset for Student Counseling. 1-6 - Yuan Jia, Mingshuai Yin:
Comparative Study on the Phonetic Characteristics of Chinese Vowels Between Kyrgyz and Kirgiz Learners. 1-6 - Chang-Shing Lee, Mei-Hui Wang, Guan-Ying Tseng, Chao-Cyuan Yue, Hao-Chun Hsieh, Marek Z. Reformat:
Cao Robot for Taiwanese/English Knowledge Graph Application. 1-6 - Jiewen Zheng, Tianxin Zheng, Mengxue Cao:
CL-Child Corpus: The Phonological Development of Putonghua in Children from Dialect-Speaking Regions. 1-6 - Yuta Kamiya, Shogo Miwa, Atsuhiko Kai:
A Parameter-Efficient Multi-Step Fine-Tuning of Multilingual and Multi-Task Learning Model for Japanese Dialect Speech Recognition. 1-6 - Wai-Sum Lee:
Age-Related and Gender-Related Differences in Cantonese Vowels. 1-6 - Zhanhang Zhang, Sakriani Sakti:
A Feedback-Driven Self-Improvement Strategy and Emotion-Aware Vocoder for Emotional Voice Conversion. 1-6 - Namita Gokavi, Padala Sri Ramulu, Kandregula Nanda Kishore, Sunil Saumya, Deepak K. T:
A DEEP LEARNING BASED APPROACH WITH DATA AUGMENTATION FOR INFANT CRY SOUND VERIFICATION. 1-6 - Yuan-Fu Liao, Hsin-Min Wang:
Oriental COCOSDA - Country Report 2024 Language Resources Developed in Taiwan. 1-6 - Hong-Hsiang Liu, Yi-Wen Liu:
AGENT-DRIVEN LARGE LANGUAGE MODELS FOR MANDARIN LYRIC GENERATION. 1-6 - Aomin, Dahu Baiyila, Aijun Li:
Exploration of Mongolian Word Stress Research Methods Based on Intonation Synthesis Technology. 1-7 - Xiaoyan Zhang, Aijun Li, Zhiqiang Li:
Right-Prominent Trisyllabic Tone Sandhi in Taifeng Chinese. 1-5 - Lokesh Kumar, Kumar Kaustubh, Shashaank Aswatha Mattur, S. R. Mahadeva Prasanna:
Depression Classification Using Log-Mel Spectrograms: A Comparative Analysis of Window Size-Based Data Augmentation and Deep Learning Models. 1-6 - Myat Aye Aye Aung, Hay Mar Soe Naing, Aye Mya Hlaing, Win Pa Pa, Kasorn Galajit, Candy Olivia Mawalim:
Analysis of Pathological Features for Spoof Detection. 1-8 - Kartik Jagtap, Namita Gokavi, Sunil Saumya:
Infant Cry Verification with Multi-View Self-Attention Vision Transformers. 1-6 - Sunil Kumar Kopparapu, Ashish Panda:
Unified Spoken Language Proficiency Assessment System. 1-6 - Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Kohei Yatabe:
Proposal of Protocols for Speech Materials Acquisition and Presentation Assisted By Tools Based on Structured Test Signals. 1-6 - Sakriani Sakti:
The Message of the O-COCOSDA Convenor. v-vi - Shu-Hua Chen, Wei-Ting Huang, Cheng-Hao Lai, Yu-Lun Lin, Ming-Hsiang Su:
Analysis and Discussion of Feature Extraction Technology for Musical Genre Classification. 1-4 - Yih-Liang Shen, Ya-Ching Lai, Tai-Shih Chi:
Multi-Resolution Singing Voice Separation. 1-6 - Wu-Hao Li, Chen-Yu Chiang:
WikiTND24: A Chinese Text Normalization Database. 1-5 - Nathaniel Oco, Kenichiro Kurusu:
2024 Philippine Country Report. 1-6 - Zhe-Jia Xu, Yeou-Jiunn Chen, Qian-Bei Hong:
Multilingual Speech Translator for Medical Consultation. 1-5 - Ying-Lung Lin, Shao-Ying Lu, Ling-Chih Yu:
Benchmarking Clickbait Detection from News Headlines. 1-5 - Hao-Tian Zheng, Berlin Chen:
Improving Speech Recognition by Enhancing Accent Discrimination. 1-6 - Hsin-Te Hwang, Chia-Hua Wu, Ming-Chi Yen, Yu Tsao, Hsin-Min Wang:
Exemplar-Based Methods for Mandarin Electrolaryngeal Speech Voice Conversion. 1-6 - Iqbal Pahlevi Amin, Haotian Tan, Kurniawati Azizah, Sakriani Sakti:
Chunk Size Scheduling for Optimizing the Quality-Latency Trade-off in Simultaneous Speech Translation. 1-6 - Komal Bharti, Pradip K. Das:
Fusion of Multiple Audio Descriptors for the Recognition of Dysarthric Speech. 1-6 - Do Van Hai, Luong Chi Mai:
Updated Activities on Resources Development for Vietnamese Speech and NLP. 1-6 - Ahmad Alfani Handoyo, Chung Tran, Dessi Puji Lestari, Sakriani Sakti:
Indonesian-English Code-Switching Speech Synthesizer Utilizing Multilingual STEN-TTS and Bert LID. 1-6 - Hsuan-Yu Lin, Xuanjun Chen, Jyh-Shing Roger Jang:
Singer Separation for Karaoke Content Generation. 1-5 - Yuan Jia, Linjiao Pan:
A Study on the Acquisition of Triphthong Vowels by Altaic Chinese Learners Under the 'Belt and Road' Initiative. 1-6 - Sumonmas Thatphithakkul, Kwanchiva Thangthai, Vataya Chunwijitra:
The Development of LOTUS-TRD: A Thai Regional Dialect Speech Corpus. 1-6 - Keisuke Kadota, Seima Oyama, Yasuharu Den:
Annotation of Addressing Behavior in Multi-Party Conversation. 1-6 - Anindita Mondal, Anil Kumar Vuppala, Chiranjeevi Yarra:
IIIT-Speech Twins 1.0: An English-Hindi Parallel Speech Corpora for Speech-to-Speech Machine Translation and Automatic Dubbing. 1-6 - Bryan Gautama Ngo, Mahdin Rohmatillah, Jen-Tzung Chien:
Learning Contrastive Emotional Nuances in Speech Synthesis. 1-6 - Yen-Chun Lai, Yi-Jun Zheng, Wen-Han Hsu, Yan-Ming Lin, Cheng-Hsiu Cho, Chlh-Chung Kuo, Chao-Shih Huang, Yuan-Fu Liao:
Construction of Large Language Models for Taigi and Hakka Using Transfer Learning. 1-6 - Mikey Elmers, Koji Inoue, Divesh Lala, Keiko Ochi, Tatsuya Kawahara:
Analysis and Detection of Differences in Spoken User Behaviors Between Autonomous and Wizard-of-Oz Systems. 1-6 - Hay Mar Soe Naing, Win Pa Pa, Aye Mya Hlaing, Myat Aye Aye Aung, Kasorn Galajit, Candy Olivia Mawalim:
UCSYSpoof: A Myanmar Language Dataset for Voice Spoofing Detection. 1-5 - Chen-Chi Chang, Ching-Yuan Chen, Hung-Shin Lee, Chih-Cheng Lee:
Benchmarking Cognitive Domains for LLMS: Insights from Taiwanese Hakka Culture. 1-6 - Bagus Tris Atmaja, Akira Sasou, Felix Burkhardt:
Uncertainty-Based Ensemble Learning for Speech Classification. 1-6 - Geoffrey Tyndall, Kurniawati Azizah, Dipta Tanaya, Ayu Purwarianti, Dessi Puji Lestari, Sakriani Sakti:
Continual Learning in Machine Speech Chain Using Gradient Episodic Memory. 1-6 - Pei-Ying Lee, HauYun Guo, Tien-Hong Lo, Berlin Chen:
Exploring Branchformer-Based End-to-End Speaker Diarization with Speaker-Wise VAD Loss. 1-6
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.