default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 22,741 matches
- 2023
- Ying Shi, Dong Wang, Lantian Li, Jiqing Han, Shi Yin:
Spot Keywords From Very Noisy and Mixed Speech. INTERSPEECH 2023: 1488-1492 - Anusha Prakash, Arun Kumar A, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K. V. Vikram, Mano Ranjith Kumar M., Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda N. Sukhadia, Dipti Misra Sharma, Hema A. Murthy, Pushpak Bhattacharyya, Srinivasan Umesh, Rajeev Sangal:
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages. INTERSPEECH 2023: 3683-3684 - Roshan Sharma, Siddhant Arora, Kenneth Zheng, Shinji Watanabe, Rita Singh, Bhiksha Raj:
BASS: Block-wise Adaptation for Speech Summarization. INTERSPEECH 2023: 1454-1458 - Jesús Villalba, Jonas Borgstrom, Maliha Jahan, Saurabh Kataria, Leibny Paola García, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22. INTERSPEECH 2023: 521-525 - Mutian He, Philip N. Garner:
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding. INTERSPEECH 2023: 1109-1113 - Ziping Zhao, Tian Gao, Haishuai Wang, Björn W. Schuller:
SWRR: Feature Map Classifier Based on Sliding Window Attention and High-Response Feature Reuse for Multimodal Emotion Recognition. INTERSPEECH 2023: 2433-2437 - Sun-Kyung Lee, Jong-Hwan Kim:
Video Multimodal Emotion Recognition System for Real World Applications. INTERSPEECH 2023: 668-669 - Yuan Gong, Sameer Khurana, Leonid Karlinsky, James R. Glass:
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers. INTERSPEECH 2023: 2798-2802 - Ruilin Xu, Gurunandan Krishnan, Changxi Zheng, Shree K. Nayar:
Personalized Dereverberation of Speech. INTERSPEECH 2023: 3859-3863 - Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic:
Streaming Audio-Visual Speech Recognition with Alignment Regularization. INTERSPEECH 2023: 1598-1602 - László Tóth, Amin Honarmandi Shandiz, Gábor Gosztolya, Tamás Gábor Csapó:
Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks. INTERSPEECH 2023: 1169-1173 - Hongfu Liu, Mingqian Shi, Ye Wang:
Zero-Shot Automatic Pronunciation Assessment. INTERSPEECH 2023: 1009-1013 - Minh Tran, Yufeng Yin, Mohammad Soleymani:
Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition. INTERSPEECH 2023: 636-640 - Minh Tran, Mohammad Soleymani:
Privacy-preserving Representation Learning for Speech Understanding. INTERSPEECH 2023: 2858-2862 - Ashutosh Pandey, Ke Tan, Buye Xu:
A Simple RNN Model for Lightweight, Low-compute and Low-latency Multichannel Speech Enhancement in the Time Domain. INTERSPEECH 2023: 2478-2482 - Yong Xu, Vinay Kothapally, Meng Yu, Shixiong Zhang, Dong Yu:
Zoneformer: On-device Neural Beamformer For In-car Multi-zone Speech Separation, Enhancement and Echo Cancellation. INTERSPEECH 2023: 5117-5121 - Yi Luo, Jianwei Yu:
FRA-RIR: Fast Random Approximation of the Image-source Method. INTERSPEECH 2023: 3884-3888 - Cheng Lu, Hailun Lian, Wenming Zheng, Yuan Zong, Yan Zhao, Sunan Li:
Learning Local to Global Feature Aggregation for Speech Emotion Recognition. INTERSPEECH 2023: 1908-1912 - Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Tai-Shih Chi, Hsin-Min Wang, Yu Tsao:
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features. INTERSPEECH 2023: 5018-5022 - Zheng Yuan, Aldo Pastore, Dorina De Jong, Hao Xu, Luciano Fadiga, Alessandro D'Ausilio:
The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN. INTERSPEECH 2023: 132-136 - Ankit Gupta, Abhijeet Bishnu, Mandar Gogate, Kia Dashtipour, Tughrul Arslan, Ahsan Adeel, Amir Hussain, Tharmalingam Ratnarajah, Mathini Sellathurai:
5G-IoT Cloud based Demonstration of Real-Time Audio-Visual Speech Enhancement for Multimodal Hearing-aids. INTERSPEECH 2023: 686-687 - Rui Liu, Jinhua Zhang, Guanglai Gao, Haizhou Li:
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion. INTERSPEECH 2023: 3999-4003 - Rui Liu, Haolin Zuo, De Hu, Guanglai Gao, Haizhou Li:
Explicit Intensity Control for Accented Text-to-speech. INTERSPEECH 2023: 22-26 - Zihan Wu, Neil Scheidwasser-Clow, Karl El Hajal, Milos Cernak:
Speaker Embeddings as Individuality Proxy for Voice Stress Detection. INTERSPEECH 2023: 1838-1842 - Zhe Liu, Fuchun Peng:
Modeling Dependent Structure for Utterances in ASR Evaluation. INTERSPEECH 2023: 3237-3241 - Jun Chen, Wei Rao, Zilin Wang, Jiuxin Lin, Yukai Ju, Shulin He, Yannan Wang, Zhiyong Wu:
MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation. INTERSPEECH 2023: 4034-4038 - Qing Wang, Jixun Yao, Ziqian Wang, Pengcheng Guo, Lei Xie:
Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification. INTERSPEECH 2023: 3994-3998 - Jie Zhang, Qing-Tian Xu, Qiu-Shi Zhu, Zhen-Hua Ling:
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions. INTERSPEECH 2023: 3117-3121 - Wei Zhou, Eugen Beck, Simon Berger, Ralf Schlüter, Hermann Ney:
RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition. INTERSPEECH 2023: 4094-4098 - Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Modeling Approach to Efficient Speech Separation. INTERSPEECH 2023: 3784-3788
skipping 22,711 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-10-06 10:05 CEST from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint