default search action
21st SPECOM 2019: Istanbul, Turkey
- Albert Ali Salah, Alexey Karpov, Rodmonga Potapova:
Speech and Computer - 21st International Conference, SPECOM 2019, Istanbul, Turkey, August 20-25, 2019, Proceedings. Lecture Notes in Computer Science 11658, Springer 2019, ISBN 978-3-030-26060-6 - Odette Scharenborg:
The Representation of Speech and Its Processing in the Human Brain and Deep Neural Networks. 1-8 - Arda Akdemir, Tunga Güngör:
A Detailed Analysis and Improvement of Feature-Based Named Entity Recognition for Turkish. 9-19 - Oleg Akhtiamov, Dmitrii Fedotov, Wolfgang Minker:
A Comparative Study of Classical and Deep Classifiers for Textual Addressee Detection in Human-Human-Machine Conversations. 20-30 - Sergei Astapov, Gleb Svirskiy, Aleksandr Lavrentyev, Tatyana Prisyach, Dmitriy Popov, Dmitriy Ubskiy, Vladimir Kabarov:
Acoustic Event Mixing to Multichannel AMI Data for Distant Speech Recognition and Acoustic Event Classification Benchmarking. 31-42 - Mohammad A. Ateeq, Abualsoud Hanani:
Speech-Based L2 Call System for English Foreign Speakers. 43-53 - Umut Avci, Gamze Akkurt, Devrim Ünay:
A Pattern Mining Approach in Feature Extraction for Emotion Recognition from Speech. 54-63 - Johanna Dobbriner, Oliver Jokisch:
Towards a Dialect Classification in German Speech Samples. 64-74 - Ghania Droua-Hamdani:
Classification of Regional Accent Using Speech Rhythm Metrics. 75-81 - Kamil Ekstein:
PocketEAR: An Assistive Sound Classification System for Hearing-Impaired. 82-92 - Dmitrii Fedotov, Bobae Kim, Alexey Karpov, Wolfgang Minker:
Time-Continuous Emotion Recognition Using Spectrogram Based CNN-RNN Modelling. 93-102 - Olga V. Frolova, Viktor Gorodnyi, Aleksandr Nikolaev, Aleksey Grigorev, Severin Grechanyi, Elena E. Lyakso:
Developmental Disorders Manifestation in the Characteristics of the Child's Voice and Speech: Perceptual and Acoustic Study. 103-112 - Lenar Gabdrakhmanov, Rustem Garaev, Evgenii Razinkov:
RUSLAN: Russian Spoken Language Corpus for Speech Synthesis. 113-121 - Gábor Gosztolya, András Beke, Tilda Neuberger:
Differentiating Laughter Types via HMM/DNN and Probabilistic Sampling. 122-132 - Fernando García-Granada, Emilio Sanchis, María José Castro Bleda, José-Ángel González, Lluís-F. Hurtado:
Word Discovering in Low-Resources Languages Through Cross-Lingual Phonemes. 133-141 - Ivan Gruber, Miroslav Hlavác, Marek Hrúz, Milos Zelezný:
Semantic Segmentation of Historical Documents via Fully-Convolutional Neural Network. 142-149 - Mahfoud Hamidia, Abderrahmane Amrouche:
A New Approach of Adaptive Filtering Updating for Acoustic Echo Cancellation. 150-159 - Injy Hamed, Moritz Zhu, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu:
Code-Switching Language Modeling with Bilingual Word Embeddings: A Case Study for Egyptian Arabic-English. 160-170 - Marek Hrúz, Petr Salajka, Ivan Gruber, Miroslav Hlavác:
Identity Extraction from Clusters of Multi-modal Observations. 171-179 - Oliver Jokisch, Ingo Siegert, Michael Maruschke, Tilo Strutz, Andrey Ronzhin:
Don't Talk to Noisy Drones - Acoustic Interaction with Unmanned Aerial Vehicles. 180-190 - Ildar Kagirov, Dmitry Ryumin, Alexandr Axyonov:
Method for Multimodal Recognition of One-Handed Sign Language Gestures Through 3D Convolution and LSTM Neural Networks. 191-200 - Arman Kaliyev:
LSTM-Based Kazakh Speech Synthesis. 201-208 - Jakub Kanis, Zdenek Krnoul, Marek Hrúz:
Combination of Positions and Angles for Hand Pose Estimation. 209-218 - Irina S. Kipyatkova:
LSTM-Based Language Models for Very Large Vocabulary Continuous Russian Speech Recognition System. 219-226 - Uliana E. Kochetkova:
Svarabhakti Vowel Occurrence and Duration in Rhotic Clusters in French Lyric Singing. 227-236 - Evgeny Kostuchenko, Dariya Novokhrestova, Marina Tirskaya, Alexander Alexandrovich Shelupanov, Mikhail Nemirovich-Danchenko, Evgeny L. Choynzonov, Lidiya N. Balatskaya:
The Evaluation Process Automation of Phrase and Word Intelligibility Using Speech Recognition Systems. 237-246 - Marie Kunesová, Marek Hrúz, Zbynek Zajíc, Vlasta Radová:
Detection of Overlapping Speech for the Purposes of Speaker Diarization. 247-257 - Ludwig Kürzinger, Tobias Watzel, Lujun Li, Robert Baumgartner, Gerhard Rigoll:
Exploring Hybrid CTC/Attention End-to-End Speech Recognition with Gaussian Processes. 258-269 - Dmitriy Levonevskiy, Dmitrii Malov, Irina V. Vatamaniuk:
Estimating Aggressiveness of Russian Texts by Means of Machine Learning. 270-279 - Boris Lobanov, Vladimir Zhitko:
Software Subsystem Analysis of Prosodic Signs of Emotional Intonation. 280-288 - José Vicente Egas López, László Tóth, Ildikó Hoffmann, János Kálmán, Magdolna Pákáski, Gábor Gosztolya:
Assessing Alzheimer's Disease from Speech Using the i-vector Approach. 289-298 - Elena E. Lyakso, Olga V. Frolova, Arman Kaliyev, Viktor Gorodnyi, Aleksey Grigorev, Yuri N. Matveev:
AD-Child.Ru: Speech Corpus for Russian Children with Atypical Development. 299-308 - Lyes Demri, Leila Falek, Hocine Teffahi:
Building a Pronunciation Dictionary for the Kabyle Language. 309-316 - Eman Mansour, Rand Sandouka, Dima Jaber, Abualsoud Hanani:
Speech-Based Automatic Assessment of Question Making Skill in L2 Language. 317-326 - Maxim Markitantov, Oxana Verkholyak:
Automatic Recognition of Speaker Age and Gender Based on Deep Neural Networks. 327-336 - Nikita Markovnikov, Irina S. Kipyatkova:
Investigating Joint CTC-Attention Models for End-to-End Russian Speech Recognition. 337-347 - Polina Panicheva, Olga Litvinova, Tatiana Litvinova:
Author Clustering with and Without Topical Features. 348-358 - Evgeny Kostuchenko, Dariya Novokhrestova, Svetlana Pekarskikh, Alexander Alexandrovich Shelupanov, Mikhail Nemirovich-Danchenko, Evgeny L. Choynzonov, Lidiya N. Balatskaya:
Assessment of Syllable Intelligibility Based on Convolutional Neural Networks for Speech Rehabilitation After Speech Organs Surgical Interventions. 359-369 - Velka Popova, Dimitar Popov:
Corpus Study of Early Bulgarian Onomatopoeias in the Terms of CHILDES. 370-380 - Rodmonga Potapova, Vsevolod Potapov, Nataliya Lebedeva, Ekaterina Karimova, Nikolay Bobrov:
EEG Investigation of Brain Bioelectrical Activity (Regarding Perception of Multimodal Polycode Internet Discourse). 381-391 - Rodmonga Potapova, Vsevolod Potapov, Liliya Komalova, Andrey Dzhunkovskiy:
Some Peculiarities of Internet Multimodal Polycode Corpora Annotation. 392-400 - Daria Pozdeeva, Tatiana Y. Shevchenko, Alexey Abyzov:
New Perspectives on Canadian English Digital Identity Based on Word Stress Patterns in Lexicon and Spoken Corpus. 401-413 - Nuzhah Gooda Sahib-Kaudeer, Baby Gobin-Rahimbux, Bibi Saamiyah Bahsu, Maryam Farheen Aasiyah Maghoo:
Automatic Speech Recognition for Kreol Morisien: A Case Study for the Health Domain. 414-422 - Meysam Shamsi, Damien Lolive, Nelly Barbot, Jonathan Chevelu:
Script Selection Using Convolutional Auto-encoder for TTS Speech Corpus. 423-432 - Natalia Bogdanova-Beglarian, Tatiana Y. Sherstinova, Olga Blinova, Gregory Y. Martynenko:
Pragmatic Markers Distribution in Russian Everyday Speech: Frequency Lists and Other Statistics for Discourse Modeling. 433-443 - Jakub Sido, Miloslav Konopík:
Curriculum Learning in Sentiment Analysis. 444-450 - Tatiana Shevchenko, Tatiana Sokoreva:
First Minute Timing in American Telephone Talks: A Cognitive Approach. 451-458 - Anton Stepikhov, Anastassia Loukina, Natella Stepikhova:
Syntactic Segmentation of Spontaneous Speech: Psychological and Cognitive Aspects. 459-470 - Mikhail Stolbov, Quan Trong The:
Dual-Microphone Speech Enhancement System Attenuating both Coherent and Diffuse Background Noise. 471-480 - László Tóth, Gábor Gosztolya:
Reducing the Inter-speaker Variance of CNN Acoustic Models Using Unsupervised Adversarial Multi-task Training. 481-490 - Teruki Toya, Peter Birkholz, Masashi Unoki:
Estimates of Transmission Characteristics Related to Perception of Bone-Conducted Speech Using Real Utterances and Transcutaneous Vibration on Larynx. 491-500 - Liliya Tsirulnik, Shlomo Dubnov:
Singing Voice Database. 501-509 - Vasilisa Verkhodanova, Sanne Timmermans, Matt Coler, Roel Jonkers, Bauke M. de Jong, Wander Lowie:
How Dysarthric Prosody Impacts Naïve Listeners' Recognition. 510-519 - Marina Volkova, Andzhukaev Tseren, Galina Lavrentyeva, Sergey Novoselov, Alexander Kozlov:
Light CNN Architecture Enhancement for Different Types Spoofing Attack Detection. 520-529 - Tobias Watzel, Lujun Li, Ludwig Kürzinger, Gerhard Rigoll:
Deep Neural Network Quantizers Outperforming Continuous Speech Recognition Systems. 530-539 - Jianguo Yu, Konstantin Markov, Alexey Karpov:
Speaking Style Based Apparent Personality Recognition. 540-548 - Zbynek Zajíc, Josef V. Psutka, Lucie Zajícová, Ludek Müller, Petr Salajka:
Diarization of the Language Consulting Center Telephone Calls. 549-558 - Jan Zelinka, Jakub Kanis, Petr Salajka:
NN-Based Czech Sign Language Synthesis. 559-568 - Aleksandar Zivanovic, Vlado Delic, Sinisa Suzic, Ivana Sokolovac, Maja Markovic:
Re-evaluation of Words Used in Speech Audiometry. 569-577
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.