default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

> Home

Publication search results

found 22,741 matches

2023
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/00010L0Y23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/00010L0Y23
Ying Shi, Dong Wang, Lantian Li, Jiqing Han, Shi Yin:
Spot Keywords From Very Noisy and Mixed Speech. INTERSPEECH 2023: 1488-1492
- view
  - electronic edition @ isca-archive.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/0001ASMGKFVMMWN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001ASMGKFVMMWN23
Anusha Prakash, Arun Kumar A, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K. V. Vikram, Mano Ranjith Kumar M., Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda N. Sukhadia, Dipti Misra Sharma, Hema A. Murthy, Pushpak Bhattacharyya, Srinivasan Umesh, Rajeev Sangal:
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages. INTERSPEECH 2023: 3683-3684
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001AZ0SR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001AZ0SR23
Roshan Sharma, Siddhant Arora, Kenneth Zheng, Shinji Watanabe, Rita Singh, Bhiksha Raj:
BASS: Block-wise Adaptation for Speech Summarization. INTERSPEECH 2023: 1454-1458
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001BJKGTD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001BJKGTD23
Jesús Villalba, Jonas Borgstrom, Maliha Jahan, Saurabh Kataria, Leibny Paola García, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22. INTERSPEECH 2023: 521-525
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001G23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001G23
Mutian He, Philip N. Garner:
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding. INTERSPEECH 2023: 1109-1113
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001GWS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001GWS23
Ziping Zhao, Tian Gao, Haishuai Wang, Björn W. Schuller:
SWRR: Feature Map Classifier Based on Sliding Window Attention and High-Response Feature Reuse for Multimodal Emotion Recognition. INTERSPEECH 2023: 2433-2437
- view
  - electronic edition @ isca-archive.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/0001K23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001K23
Sun-Kyung Lee, Jong-Hwan Kim:
Video Multimodal Emotion Recognition System for Real World Applications. INTERSPEECH 2023: 668-669
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001KKG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001KKG23
Yuan Gong, Sameer Khurana, Leonid Karlinsky, James R. Glass:
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers. INTERSPEECH 2023: 2798-2802
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001KZN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001KZN23
Ruilin Xu, Gurunandan Krishnan, Changxi Zheng, Shree K. Nayar:
Personalized Dereverberation of Speech. INTERSPEECH 2023: 3859-3863
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001MPFP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001MPFP23
Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic:
Streaming Audio-Visual Speech Recognition with Alignment Regularization. INTERSPEECH 2023: 1598-1602
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001SGC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001SGC23
László Tóth, Amin Honarmandi Shandiz, Gábor Gosztolya, Tamás Gábor Csapó:
Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks. INTERSPEECH 2023: 1169-1173
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0002S023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0002S023
Hongfu Liu, Mingqian Shi, Ye Wang:
Zero-Shot Automatic Pronunciation Assessment. INTERSPEECH 2023: 1009-1013
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/00040023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/00040023
Minh Tran, Yufeng Yin, Mohammad Soleymani:
Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition. INTERSPEECH 2023: 636-640
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004023
Minh Tran, Mohammad Soleymani:
Privacy-preserving Representation Learning for Speech Understanding. INTERSPEECH 2023: 2858-2862
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/00040X23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/00040X23
Ashutosh Pandey, Ke Tan, Buye Xu:
A Simple RNN Model for Lightweight, Low-compute and Low-latency Multichannel Speech Enhancement in the Time Domain. INTERSPEECH 2023: 2478-2482
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004K0Z023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004K0Z023
Yong Xu, Vinay Kothapally, Meng Yu, Shixiong Zhang, Dong Yu:
Zoneformer: On-device Neural Beamformer For In-car Multi-zone Speech Separation, Enhancement and Echo Cancellation. INTERSPEECH 2023: 5117-5121
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004Y23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004Y23
Yi Luo, Jianwei Yu:
FRA-RIR: Fast Random Approximation of the Image-source Method. INTERSPEECH 2023: 3884-3888
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0005LZZZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0005LZZZL23
Cheng Lu, Hailun Lian, Wenming Zheng, Yuan Zong, Yan Zhao, Sunan Li:
Learning Local to Global Feature Aggregation for Speech Emotion Recognition. INTERSPEECH 2023: 1908-1912
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0006CYTCW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0006CYTCW023
Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Tai-Shih Chi, Hsin-Min Wang, Yu Tsao:
Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features. INTERSPEECH 2023: 5018-5022
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0006PJXFD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0006PJXFD23
Zheng Yuan, Aldo Pastore, Dorina De Jong, Hao Xu, Luciano Fadiga, Alessandro D'Ausilio:
The ART of Conversation: Measuring Phonetic Convergence and Deliberate Imitation in L2-Speech with a Siamese RNN. INTERSPEECH 2023: 132-136
- view
  - electronic edition @ isca-archive.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/0008BGDAA0RS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008BGDAA0RS23
Ankit Gupta, Abhijeet Bishnu, Mandar Gogate, Kia Dashtipour, Tughrul Arslan, Ahsan Adeel, Amir Hussain, Tharmalingam Ratnarajah, Mathini Sellathurai:
5G-IoT Cloud based Demonstration of Real-Time Audio-Visual Speech Enhancement for Multimodal Hearing-aids. INTERSPEECH 2023: 686-687
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0008ZG023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008ZG023
Rui Liu, Jinhua Zhang, Guanglai Gao, Haizhou Li:
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion. INTERSPEECH 2023: 3999-4003
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0008ZHG023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0008ZHG023
Rui Liu, Haolin Zuo, De Hu, Guanglai Gao, Haizhou Li:
Explicit Intensity Control for Accented Text-to-speech. INTERSPEECH 2023: 22-26
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0009SHC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0009SHC23
Zihan Wu, Neil Scheidwasser-Clow, Karl El Hajal, Milos Cernak:
Speaker Embeddings as Individuality Proxy for Voice Stress Detection. INTERSPEECH 2023: 1838-1842
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0011P23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0011P23
Zhe Liu, Fuchun Peng:
Modeling Dependent Structure for Utterances in ASR Evaluation. INTERSPEECH 2023: 3237-3241
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0024RWLJHW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0024RWLJHW023
Jun Chen, Wei Rao, Zilin Wang, Jiuxin Lin, Yukai Ju, Shulin He, Yannan Wang, Zhiyong Wu:
MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation. INTERSPEECH 2023: 4034-4038
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0039YWG023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0039YWG023
Qing Wang, Jixun Yao, Ziqian Wang, Pengcheng Guo, Lei Xie:
Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification. INTERSPEECH 2023: 3994-3998
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0042XZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0042XZL23
Jie Zhang, Qing-Tian Xu, Qiu-Shi Zhu, Zhen-Hua Ling:
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions. INTERSPEECH 2023: 3117-3121
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0043BBSN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0043BBSN23
Wei Zhou, Eugen Beck, Simon Berger, Ralf Schlüter, Hermann Ney:
RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition. INTERSPEECH 2023: 4094-4098
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0075YLHKC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0075YLHKC23
Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Modeling Approach to Efficient Speech Separation. INTERSPEECH 2023: 3784-3788

skipping 22,711 more matches

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results