default search action
Rita Singh
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Fan Yang, Muqiao Yang, Xiang Li, Yuxuan Wu, Zhiyuan Zhao, Bhiksha Raj, Rita Singh:
A closer look at reinforcement learning-based automatic speech recognition. Comput. Speech Lang. 87: 101641 (2024) - [c102]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj:
QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition. CVPR 2024: 3402-3413 - [c101]Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazaki, Hao Chen, Xiaonan Huang, Bhiksha Raj:
R2-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations. ECCV (9) 2024: 211-230 - [c100]Jiayi Zhang, Rita Singh:
Vocal Fold Dynamics for Automatic Detection of Amyotrophic Lateral Sclerosis from Voice. ICASSP 2024: 311-315 - [c99]Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. ICASSP 2024: 371-375 - [c98]Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj:
Importance of Negative Sampling in Weak Label Learning. ICASSP 2024: 7530-7534 - [c97]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Prompting Audios Using Acoustic Properties for Emotion Representation. ICASSP 2024: 11936-11940 - [c96]Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
A General Framework for Learning from Weak Supervision. ICML 2024 - [c95]Xiang Li, Yinpeng Chen, Chung-Ching Lin, Hao Chen, Kai Hu, Rita Singh, Bhiksha Raj, Lijuan Wang, Zicheng Liu:
Completing Visual Objects via Bridging Generation and Segmentation. ICML 2024 - [c94]Roshan Sharma, Ruchira Sharma, Hira Dhamyal, Rita Singh, Bhiksha Raj:
R-BASS : Relevance-aided Block-wise Adaptation for Speech Summarization. NAACL-HLT (Findings) 2024: 848-857 - [i60]Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang:
PAM: Prompting Audio-Language Models for Audio Quality Assessment. CoRR abs/2402.00282 (2024) - [i59]Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
A General Framework for Learning from Weak Supervision. CoRR abs/2402.01922 (2024) - [i58]Soham Deshmukh, Rita Singh, Bhiksha Raj:
Domain Adaptation for Contrastive Audio-Language Models. CoRR abs/2402.09585 (2024) - [i57]Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazaki, Hao Chen, Xiaonan Huang, Bhiksha Raj:
R2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations. CoRR abs/2403.04924 (2024) - [i56]Xiang Li, Kai Qiu, Hao Chen, Jason Kuen, Zhe Lin, Rita Singh, Bhiksha Raj:
ControlVAR: Exploring Controllable Visual Autoregressive Modeling. CoRR abs/2406.09750 (2024) - [i55]Ying Song, Rita Singh, Balaji Palanisamy:
Krait: A Backdoor Attack Against Graph Prompt Tuning. CoRR abs/2407.13068 (2024) - [i54]Hazim T. Bukhari, Soham Deshmukh, Hira Dhamyal, Bhiksha Raj, Rita Singh:
SELM: Enhancing Speech Emotion Recognition for Out-of-Domain Scenarios. CoRR abs/2407.15300 (2024) - [i53]Soham Deshmukh, Shuo Han, Hazim T. Bukhari, Benjamin Elizalde, Hannes Gamper, Rita Singh, Bhiksha Raj:
Audio Entailment: Assessing Deductive Reasoning for Audio Understanding. CoRR abs/2407.18062 (2024) - [i52]Roshan S. Sharma, Suwon Shon, Mark Lindsey, Hira Dhamyal, Rita Singh, Bhiksha Raj:
Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization? CoRR abs/2408.07277 (2024) - [i51]Massa Baali, Abdulhamid Aldoobi, Hira Dhamyal, Rita Singh, Bhiksha Raj:
PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification. CoRR abs/2409.05799 (2024) - [i50]Ksheeraja Raghavan, Samiran Gode, Ankit Shah, Surabhi Raghavan, Wolfram Burgard, Bhiksha Raj, Rita Singh:
Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection. CoRR abs/2410.03904 (2024) - [i49]Satvik Dixit, Massa Baali, Rita Singh, Bhiksha Raj:
Improving Speaker Representations Using Contrastive Losses on Multi-scale Features. CoRR abs/2410.05037 (2024) - [i48]Hira Dhamyal, Rita Singh:
Objective Measurements of Voice Quality. CoRR abs/2410.09578 (2024) - [i47]Abdul Waheed, Hanin Atwany, Bhiksha Raj, Rita Singh:
What Do Speech Foundation Models Not Learn About Speech? CoRR abs/2410.12948 (2024) - 2023
- [j7]Rita Singh:
A Gene-Based Algorithm for Identifying Factors That May Affect a Speaker's Voice. Entropy 25(6): 897 (2023) - [j6]Wayne Zhao, Rita Singh:
Deriving Vocal Fold Oscillation Information from Recorded Voice Signals Using Models of Phonation. Entropy 25(7): 1039 (2023) - [j5]Weiyang Liu, Yandong Wen, Bhiksha Raj, Rita Singh, Adrian Weller:
SphereFace Revived: Unifying Hyperspherical Face Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 2458-2474 (2023) - [c93]Roshan S. Sharma, William Chen, Takatomo Kano, Ruchira Sharma, Siddhant Arora, Shinji Watanabe, Atsunori Ogawa, Marc Delcroix, Rita Singh, Bhiksha Raj:
Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems. ASRU 2023: 1-8 - [c92]Xiang Li, Jinglu Wang, Xiaohao Xu, Muqiao Yang, Fan Yang, Yizhou Zhao, Rita Singh, Bhiksha Raj:
Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text. EMNLP 2023: 2283-2296 - [c91]Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj:
Token Prediction as Implicit Classification to Identify LLM-Generated Text. EMNLP 2023: 13112-13120 - [c90]Yandong Wen, Weiyang Liu, Yao Feng, Bhiksha Raj, Rita Singh, Adrian Weller, Michael J. Black, Bernhard Schölkopf:
Pairwise Similarity Learning is SimPLE. ICCV 2023: 5285-5295 - [c89]Roshan Sharma, Siddhant Arora, Kenneth Zheng, Shinji Watanabe, Rita Singh, Bhiksha Raj:
BASS: Block-wise Adaptation for Speech Summarization. INTERSPEECH 2023: 1454-1458 - [c88]Liao Qu, Xianwei Zou, Xiang Li, Yandong Wen, Rita Singh, Bhiksha Raj:
The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features. INTERSPEECH 2023: 2578-2582 - [c87]Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj:
Rethinking Voice-Face Correlation: A Geometry View. ACM Multimedia 2023: 2458-2467 - [c86]Soham Deshmukh, Benjamin Elizalde, Rita Singh, Huaming Wang:
Pengi: An Audio Language Model for Audio Tasks. NeurIPS 2023 - [c85]Xiang Li, Chung-Ching Lin, Yinpeng Chen, Zicheng Liu, Jinglu Wang, Rita Singh, Bhiksha Raj:
PaintSeg: Painting Pixels for Training-free Segmentation. NeurIPS 2023 - [i46]Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj:
GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content. CoRR abs/2305.07969 (2023) - [i45]Soham Deshmukh, Benjamin Elizalde, Rita Singh, Huaming Wang:
Pengi: An Audio Language Model for Audio Tasks. CoRR abs/2305.11834 (2023) - [i44]Hao Chen, Ankit Shah, Jindong Wang, Ran Tao, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations. CoRR abs/2305.12715 (2023) - [i43]Roshan S. Sharma, Kenneth Zheng, Siddhant Arora, Shinji Watanabe, Rita Singh, Bhiksha Raj:
BASS: Block-wise Adaptation for Speech Summarization. CoRR abs/2307.08217 (2023) - [i42]Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj:
Rethinking Voice-Face Correlation: A Geometry View. CoRR abs/2307.13948 (2023) - [i41]Liao Qu, Xianwei Zou, Xiang Li, Yandong Wen, Rita Singh, Bhiksha Raj:
The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features. CoRR abs/2307.13953 (2023) - [i40]Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. CoRR abs/2309.07372 (2023) - [i39]Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj:
Importance of negative sampling in weak label learning. CoRR abs/2309.13227 (2023) - [i38]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj:
Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition. CoRR abs/2310.00132 (2023) - [i37]Dareen Alharthi, Roshan Sharma, Hira Dhamyal, Soumi Maiti, Bhiksha Raj, Rita Singh:
Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech. CoRR abs/2310.00706 (2023) - [i36]Xiang Li, Yinpeng Chen, Chung-Ching Lin, Rita Singh, Bhiksha Raj, Zicheng Liu:
Completing Visual Objects via Bridging Generation and Segmentation. CoRR abs/2310.00808 (2023) - [i35]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Prompting Audios Using Acoustic Properties For Emotion Representation. CoRR abs/2310.02298 (2023) - [i34]Muhammad Ahmed Shah, Roshan Sharma, Hira Dhamyal, Raphaël Olivier, Ankit Shah, Joseph Konan, Dareen Alharthi, Hazim T. Bukhari, Massa Baali, Soham Deshmukh, Michael Kuhlmann, Bhiksha Raj, Rita Singh:
LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model. CoRR abs/2310.04445 (2023) - [i33]Yandong Wen, Weiyang Liu, Yao Feng, Bhiksha Raj, Rita Singh, Adrian Weller, Michael J. Black, Bernhard Schölkopf:
Pairwise Similarity Learning is SimPLE. CoRR abs/2310.09449 (2023) - [i32]Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj:
Token Prediction as Implicit Classification to Identify LLM-Generated Text. CoRR abs/2311.08723 (2023) - 2022
- [c84]Yandong Wen, Weiyang Liu, Adrian Weller, Bhiksha Raj, Rita Singh:
SphereFace2: Binary Classification is All You Need for Deep Face Recognition. ICLR 2022 - [c83]Hira Dhamyal, Bhiksha Raj, Rita Singh:
Positional Encoding for Capturing Modality Specific Cadence for Emotion Detection. INTERSPEECH 2022: 166-170 - [i31]Ankit Shah, Hira Dhamyal, Yang Gao, Rita Singh, Bhiksha Raj:
On the pragmatism of using binary classifiers over data intensive neural network classifiers for detection of COVID-19 from voice. CoRR abs/2204.04802 (2022) - [i30]Roshan Sharma, Tyler Vuong, Mark Lindsey, Hira Dhamyal, Rita Singh, Bhiksha Raj:
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction. CoRR abs/2206.12568 (2022) - [i29]Roshan Sharma, Hira Dhamyal, Bhiksha Raj, Rita Singh:
Unifying the Discrete and Continuous Emotion labels for Speech Emotion Recognition. CoRR abs/2210.16642 (2022) - [i28]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Describing emotions with acoustic property prompts for speech emotion recognition. CoRR abs/2211.07737 (2022) - 2021
- [c82]Mahmoud Al Ismail, Soham Deshmukh, Rita Singh:
Detection of Covid-19 Through the Analysis of Vocal Fold Oscillations. ICASSP 2021: 1035-1039 - [c81]Soham Deshmukh, Mahmoud Al Ismail, Rita Singh:
Interpreting Glottal Flow Dynamics for Detecting Covid-19 From Voice. ICASSP 2021: 1055-1059 - [c80]Yandong Wen, Weiyang Liu, Bhiksha Raj, Rita Singh:
Self-Supervised 3D Face Reconstruction via Conditional Estimation. ICCV 2021: 13269-13278 - [c79]Soham Deshmukh, Bhiksha Raj, Rita Singh:
Improving Weakly Supervised Sound Event Detection with Self-Supervised Auxiliary Tasks. Interspeech 2021: 596-600 - [c78]Yang Gao, Tyler Vuong, Mahsa Elyasi, Gaurav Bharaj, Rita Singh:
Generalized Spoofing Detection Inspired from Audio Generation Artifacts. Interspeech 2021: 4184-4188 - [c77]Jiachen Lian, Aiswarya Vinod Kumar, Hira Dhamyal, Bhiksha Raj, Rita Singh:
Masked Proxy Loss for Text-Independent Speaker Verification. Interspeech 2021: 4638-4642 - [c76]Yang Gao, Jiachen Lian, Bhiksha Raj, Rita Singh:
Detection and Evaluation of Human and Machine Generated Speech in Spoofing Attacks on Automatic Speaker Verification Systems. SLT 2021: 544-551 - [i27]Yang Gao, Tyler Vuong, Mahsa Elyasi, Gaurav Bharaj, Rita Singh:
Generalized Spoofing Detection Inspired from Audio Generation Artifacts. CoRR abs/2104.04111 (2021) - [i26]Soham Deshmukh, Bhiksha Raj, Rita Singh:
Improving weakly supervised sound event detection with self-supervised auxiliary tasks. CoRR abs/2106.06858 (2021) - [i25]Hao Liang, Lulan Yu, Guikang Xu, Bhiksha Raj, Rita Singh:
Controlled AutoEncoders to Generate Faces from Voices. CoRR abs/2107.07988 (2021) - [i24]Yandong Wen, Weiyang Liu, Adrian Weller, Bhiksha Raj, Rita Singh:
SphereFace2: Binary Classification is All You Need for Deep Face Recognition. CoRR abs/2108.01513 (2021) - [i23]Weiyang Liu, Yandong Wen, Bhiksha Raj, Rita Singh, Adrian Weller:
SphereFace Revived: Unifying Hyperspherical Face Recognition. CoRR abs/2109.05565 (2021) - [i22]Rita Singh, Ankit Shah, Hira Dhamyal:
An Overview of Techniques for Biomarker Discovery in Voice Signal. CoRR abs/2110.04678 (2021) - [i21]Yandong Wen, Weiyang Liu, Bhiksha Raj, Rita Singh:
Self-Supervised 3D Face Reconstruction via Conditional Estimation. CoRR abs/2110.04800 (2021) - 2020
- [c75]Wenbo Zhao, Rita Singh:
Speech-Based Parameter Estimation of an Asymmetric Vocal Fold Oscillation Model and its Application in Discriminating Vocal Fold Pathologies. ICASSP 2020: 7344-7348 - [c74]Rowland Chen, Roger B. Dannenberg, Bhiksha Raj, Rita Singh:
Artificial Creative Intelligence: Breaking the Imitation Barrier. ICCC 2020: 319-325 - [c73]Wenbo Zhao, Yang Gao, Shahan Ali Memon, Bhiksha Raj, Rita Singh:
Hierarchical Routing Mixture of Experts. ICPR 2020: 7900-7906 - [c72]Hira Dhamyal, Shahan Ali Memon, Bhiksha Raj, Rita Singh:
The Phonetic Bases of Vocal Expressed Emotion: Natural versus Acted. INTERSPEECH 2020: 3451-3455 - [c71]Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh, Joseph Keshet:
Hide and Speak: Towards Deep Neural Networks for Speech Steganography. INTERSPEECH 2020: 4656-4660 - [c70]Hao Liang, Lulan Yu, Guikang Xu, Bhiksha Raj, Rita Singh:
Controlled AutoEncoders to Generate Faces from Voices. ISVC (1) 2020: 476-487 - [i20]Soham Deshmukh, Bhiksha Raj, Rita Singh:
Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection. CoRR abs/2008.07085 (2020) - [i19]Mahmoud Al Ismail, Soham Deshmukh, Rita Singh:
Detection of COVID-19 through the analysis of vocal fold oscillations. CoRR abs/2010.10707 (2020) - [i18]Soham Deshmukh, Mahmoud Al Ismail, Rita Singh:
Interpreting glottal flow dynamics for detecting COVID-19 from voice. CoRR abs/2010.16318 (2020) - [i17]Yang Gao, Jiachen Lian, Bhiksha Raj, Rita Singh:
Detection and Evaluation of human and machine generated speech in spoofing attacks on automatic speaker verification systems. CoRR abs/2011.03689 (2020) - [i16]Jiachen Lian, Aiswarya Vinod Kumar, Hira Dhamyal, Bhiksha Raj, Rita Singh:
Mask Proxy Loss for Text-Independent Speaker Recognition. CoRR abs/2011.04491 (2020)
2010 – 2019
- 2019
- [c69]Hira Dhamyal, Tianyan Zhou, Bhiksha Raj, Rita Singh:
Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification. ASRU 2019: 742-748 - [c68]Daanish Ali Khan, Saquib Razak, Bhiksha Raj, Rita Singh:
Human Behaviour Recognition Using Wifi Channel State Information. ICASSP 2019: 7625-7629 - [c67]Yandong Wen, Mahmoud Al Ismail, Weiyang Liu, Bhiksha Raj, Rita Singh:
Disjoint Mapping Network for Cross-modal Matching of Voices and Faces. ICLR (Poster) 2019 - [c66]Shahan Ali Memon, Wenbo Zhao, Bhiksha Raj, Rita Singh:
Neural Regression Trees. IJCNN 2019: 1-8 - [c65]Yandong Wen, Bhiksha Raj, Rita Singh:
Face Reconstruction from Voice using Generative Adversarial Networks. NeurIPS 2019: 5266-5275 - [i15]Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh, Joseph Keshet:
Hide and Speak: Deep Neural Networks for Speech Steganography. CoRR abs/1902.03083 (2019) - [i14]Wenbo Zhao, Yang Gao, Shahan Ali Memon, Bhiksha Raj, Rita Singh:
Hierarchical Routing Mixture of Experts. CoRR abs/1903.07756 (2019) - [i13]Yandong Wen, Rita Singh, Bhiksha Raj:
Reconstructing faces from voices. CoRR abs/1905.10604 (2019) - [i12]Daanish Ali Khan, Linhong Li, Ninghao Sha, Zhuoran Liu, Abelino Jimenez, Bhiksha Raj, Rita Singh:
Non-Determinism in Neural Networks for Adversarial Robustness. CoRR abs/1905.10906 (2019) - [i11]Wenbo Zhao, Rita Singh:
Speech-Based Parameter Estimation of an Asymmetric Vocal Fold Oscillation Model and Its Application in Discriminating Vocal Fold Pathologies. CoRR abs/1910.08886 (2019) - [i10]Shahan Ali Memon, Hira Dhamyal, Oren Wright, Daniel Justice, Vijaykumar Palat, William Boler, Yandong Wen, Bhiksha Raj, Rita Singh:
Detecting gender differences in perception of emotion in crowdsourced data. CoRR abs/1910.11386 (2019) - [i9]Hira Dhamyal, Shahan Ali Memon, Bhiksha Raj, Rita Singh:
The phonetic bases of vocal expressed emotion: natural versus acted. CoRR abs/1911.05733 (2019) - 2018
- [c64]Yang Gao, Rita Singh, Bhiksha Raj:
Voice Impersonation Using Generative Adversarial Networks. ICASSP 2018: 2506-2510 - [c63]Yandong Wen, Tianyan Zhou, Rita Singh, Bhiksha Raj:
A Corrective Learning Approach for Text-Independent Speaker Verification. ICASSP 2018: 4894-4898 - [i8]Yang Gao, Rita Singh, Bhiksha Raj:
Voice Impersonation using Generative Adversarial Networks. CoRR abs/1802.06840 (2018) - [i7]Yandong Wen, Mahmoud Al Ismail, Bhiksha Raj, Rita Singh:
Optimal Strategies for Matching and Retrieval Problems by Comparing Covariates. CoRR abs/1807.04834 (2018) - [i6]Yandong Wen, Mahmoud Al Ismail, Weiyang Liu, Bhiksha Raj, Rita Singh:
Disjoint Mapping Network for Cross-modal Matching of Voices and Faces. CoRR abs/1807.04836 (2018) - [i5]Shahan Ali Memon, Wenbo Zhao, Bhiksha Raj, Rita Singh:
Neural Regression Trees. CoRR abs/1810.00974 (2018) - 2017
- [j4]Rita Singh, Abelino Jiménez, Anders Øland:
Voice disguise by mimicry: deriving statistical articulometric evidence to evaluate claimed impersonation. IET Biom. 6(4): 282-289 (2017) - [c62]Keiichi Osako, Yuki Mitsufuji, Rita Singh, Bhiksha Raj:
Supervised monaural source separation based on autoencoders. ICASSP 2017: 11-15 - [i4]Rita Singh, Justin T. Baker, Luciana Pennant, Louis-Philippe Morency:
Deducing the severity of psychiatric symptoms from the human voice. CoRR abs/1703.05344 (2017) - [i3]Wenbo Zhao, Yang Gao, Rita Singh:
Speaker identification from the sound of the human breath. CoRR abs/1712.00171 (2017) - 2016
- [c61]Rita Singh, Joseph Keshet, Deniz Gençaga, Bhiksha Raj:
The relationship of voice onset time and Voice Offset Time to physical age. ICASSP 2016: 5390-5394 - [c60]Jill Fain Lehman, Rita Singh:
Estimation of Children's Physical Characteristics from Their Voices. INTERSPEECH 2016: 1417-1421 - [c59]Rita Singh, Deniz Gençaga, Bhiksha Raj:
Formant manipulations in voice disguise by mimicry. IWBF 2016: 1-6 - [c58]Rita Singh, Bhiksha Raj, James Baker:
Short-term analysis for estimating physical parameters of speakers. IWBF 2016: 1-6 - [c57]Rita Singh, Bhiksha Raj, Deniz Gençaga:
Forensic anthropometry from voice: An articulatory-phonetic approach. MIPRO 2016: 1375-1380 - [c56]Shareef Babu Kalluri, Ashwin Vijayakumar, Deepu Vijayasenan, Rita Singh:
Estimating multiple physical parameters from speech data. MLSP 2016: 1-5 - [c55]Rita Singh:
Mereological algebras as mechanisms for reasoning about sounds. MLSP 2016: 1-6 - [p5]Rita Singh:
Minimizing Free Energy of Stochastic Functions of Markov Chains. Recent Advances in Nonlinear Speech Processing 2016: 227-233 - [i2]Rahul Radhakrishnan Iyer, Sanjeel Parekh, Vikas Mohandoss, Anush Ramsurat, Bhiksha Raj, Rita Singh:
Content-based Video Indexing and Retrieval Using Corr-LDA. CoRR abs/1602.08581 (2016) - 2015
- [c54]Rita Singh, Ken'ichi Kumatani:
Free energy for speech recognition. ICASSP 2015: 4515-4519 - [c53]Sundar Harshavardhan, Jill Fain Lehman, Rita Singh:
Keyword spotting in multi-player voice driven games for children. INTERSPEECH 2015: 1660-1664 - [c52]Shoou-I Yu, Lu Jiang, Zhongwen Xu, Zhenzhong Lan, Shicheng Xu, Xiaojun Chang, Xuanchong Li, Zexi Mao, Chuang Gan, Yajie Miao, Xingzhong Du, Yang Cai, Lara J. Martin, Nikolas Wolfe, Anurag Kumar, Huan Li, Ming Lin, Zhigang Ma, Yi Yang, Deyu Meng, Shiguang Shan, Pinar Duygulu Sahin, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Teruko Mitamura, Richard M. Stern, Alexander G. Hauptmann:
CMU Informedia@TRECVID 2015: MED/SIN/LNK/SED. TRECVID 2015 - [c51]Keiichi Osako, Rita Singh, Bhiksha Raj:
Complex recurrent neural networks for denoising speech signals. WASPAA 2015: 1-5 - [i1]Soham De, Indradyumna Roy, Tarunima Prabhakar, Kriti Suneja, Sourish Chaudhuri, Rita Singh, Bhiksha Raj:
Plagiarism Detection in Polyphonic Music using Monaural Signal Separation. CoRR abs/1503.00022 (2015) - 2014
- [c50]Anurag Kumar, Rita Singh, Bhiksha Raj:
Detecting sound objects in audio recordings. EUSIPCO 2014: 905-909 - [c49]Rita Singh:
Audio Classification with Thermodynamic Criteria. IC2E 2014: 526-533 - [c48]Pallavi Baljekar, Jill Fain Lehman, Rita Singh:
Online word-spotting in continuous speech with recurrent neural networks. SLT 2014: 536-541 - [c47]Shoou-I Yu, Lu Jiang, Zhongwen Xu, Zhenzhong Lan, Shicheng Xu, Xiaojun Chang, Xuanchong Li, Zexi Mao, Chuang Gan, Yajie Miao, Xingzhong Du, Yang Cai, Lara J. Martin, Nikolas Wolfe, Anurag Kumar, Huan Li, Ming Lin, Zhigang Ma, Yi Yang, Deyu Meng, Shiguang Shan, Pinar Duygulu Sahin, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Teruko Mitamura, Richard M. Stern, Alexander G. Hauptmann, Anil Armagan, Yicheng Zhao:
Informedia @ TRECVID 2014. TRECVID 2014 - 2013
- [c46]Anurag Kumar, Rajesh M. Hegde, Rita Singh, Bhiksha Raj:
Event detection in short duration audio using Gaussian Mixture Model and Random Forest Classifier. EUSIPCO 2013: 1-5 - [c45]Ken'ichi Kumatani, Rita Singh, Friedrich Faubel, John W. McDonough, Youssef Oualil:
Joint constrained maximum likelihood regression for overlapping speech recognition. ICASSP 2013: 121-125 - [c44]Shubhranshu Barnwal, Rohit Barnwal, Rajesh M. Hegde, Rita Singh, Bhiksha Raj:
Doppler based speed estimation of vehicles using passive sensor. ICME Workshops 2013: 1-4 - [c43]Benjamin Lambert, Bhiksha Raj, Rita Singh:
Discriminatively trained dependency language modeling for conversational speech recognition. INTERSPEECH 2013: 3414-3418 - [c42]Zhenzhong Lan, Lu Jiang, Shoou-I Yu, Chenqiang Gao, Shourabh Rawat, Yang Cai, Shicheng Xu, Haoquan Shen, Xuanchong Li, Yipei Wang, Waito Sze, Yan Yan, Zhigang Ma, Nicolas Ballas, Deyu Meng, Wei Tong, Yi Yang, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Richard M. Stern, Teruko Mitamura, Eric Nyberg, Alexander G. Hauptmann:
Informedia@TRECVID 2013. TRECVID 2013 - 2012
- [c41]Ken'ichi Kumatani, Takayuki Arakawa, Kazumasa Yamamoto, John W. McDonough, Bhiksha Raj, Rita Singh, Ivan Tashev:
Microphone array processing for distant speech recognition: Towards real-world deployment. APSIPA 2012: 1-10 - [c40]Anurag Kumar, Pranay Dighe, Rita Singh, Sourish Chaudhuri, Bhiksha Raj:
Audio event detection from acoustic unit occurrence patterns. ICASSP 2012: 489-492 - [c39]Rita Singh:
Compensating for denoising artifacts. ICASSP 2012: 4661-4664 - [c38]Shubhranshu Barnwal, Kamal Sahni, Rita Singh, Bhiksha Raj:
Spectrographic seam patterns for discriminative word spotting. ICASSP 2012: 4725-4728 - [c37]Kamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj:
Language identification using spectro-temporal patch features. SAPA@INTERSPEECH 2012: 110-113 - [c36]Ken'ichi Kumatani, Bhiksha Raj, Rita Singh, John W. McDonough:
Microphone Array Post-filter based on Spatially-Correlated Noise Measurements for Distant Speech Recognition. INTERSPEECH 2012: 298-301 - [c35]Sourish Chaudhuri, Rita Singh, Bhiksha Raj:
Exploiting Temporal Sequence Structure for Semantic Analysis of Multimedia. INTERSPEECH 2012: 1728-1731 - [c34]Soham De, Indradyumna Roy, Tarunima Prabhakar, Kriti Suneja, Sourish Chaudhuri, Rita Singh, Bhiksha Raj:
Plagiarism Detection in Polyphonic Music using Monaural Signal Separation. INTERSPEECH 2012: 1744-1747 - [c33]Rita Singh, Ken'ichi Kumatani, John W. McDonough, Chen Liu:
A signal-separation-based array postfilter for distant speech recognition. INTERSPEECH 2012: 1934-1937 - [c32]Shoou-I Yu, Zhongwen Xu, Duo Ding, Waito Sze, Francisco Vicente, Zhenzhong Lan, Yang Cai, Shourabh Rawat, Peter F. Schulam, Nisarga Markandaiah, Sohail Bahmani, Antonio Juárez, Wei Tong, Yi Yang, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Richard M. Stern, Teruko Mitamura, Eric Nyberg, Lu Jiang, Qiang Chen, Lisa M. Brown, Ankur Datta, Quanfu Fan, Rogério Schmidt Feris, Shuicheng Yan, Alexander G. Hauptmann, Sharath Pankanti:
Informedia @TRECVID 2012. TRECVID 2012 - [p4]Tuomas Virtanen, Rita Singh, Bhiksha Raj:
Introduction. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 1-5 - [p3]Rita Singh, Bhiksha Raj, Tuomas Virtanen:
The Basics of Automatic Speech Recognition. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 7-30 - [p2]Bhiksha Raj, Tuomas Virtanen, Rita Singh:
The Problem of Robustness in Automatic Speech Recognition. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 31-50 - [e1]Tuomas Virtanen, Rita Singh, Bhiksha Raj:
Techniques for Noise Robustness in Automatic Speech Recognition. Wiley 2012, ISBN 978-1-119-97088-0 [contents] - 2011
- [c31]Kshitiz Kumar, Rita Singh, Bhiksha Raj, Richard M. Stern:
Gammatone sub-band magnitude-domain dereverberation for ASR. ICASSP 2011: 4604-4607 - [c30]Kshitiz Kumar, Bhiksha Raj, Rita Singh, Richard M. Stern:
An iterative least-squares technique for dereverberation. ICASSP 2011: 5488-5491 - [c29]Bhiksha Raj, Rita Singh, James Baker:
A paired test for recognizer selection with untranscribed data. ICASSP 2011: 5676-5679 - [c28]Bhiksha Raj, Rita Singh, Tuomas Virtanen:
Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures. INTERSPEECH 2011: 1217-1220 - [p1]Bhiksha Raj, Rita Singh:
Reconstructing Noise-Corrupted Spectrographic Components for Robust Speech Recognition. Robust Speech Recognition of Uncertain or Missing Data 2011: 127-156 - 2010
- [c27]Rita Singh, Bhiksha Raj, Paris Smaragdis:
Latent-variable decomposition based dereverberation of monaural and multi-channel signals. ICASSP 2010: 1914-1917 - [c26]Bhiksha Raj, Tuomas Virtanen, Sourish Chaudhuri, Rita Singh:
Non-negative matrix factorization based compensation of music for automatic speech recognition. INTERSPEECH 2010: 717-720 - [c25]Benjamin Lambert, Rita Singh, Bhiksha Raj:
Creating a linguistic plausibility dataset with non-expert annotators. INTERSPEECH 2010: 1906-1909 - [c24]Rita Singh, Benjamin Lambert, Bhiksha Raj:
The use of sense in unsupervised training of acoustic models for ASR systems. INTERSPEECH 2010: 2938-2941
2000 – 2009
- 2009
- [c23]Dhananjay Bansal, Nishanth Ulhas Nair, Rita Singh, Bhiksha Raj:
A joint decoding algorithm for multiple-example-based addition of words to a pronunciation lexicon. ICASSP 2009: 4293-4296 - 2007
- [c22]Bhiksha Raj, Rita Singh, Madhusudana V. S. Shashanka, Paris Smaragdis:
Bandwidth Expansionwith a pólya URN Model. ICASSP (4) 2007: 597-600 - [c21]Rita Singh, Evandro B. Gouvêa, Bhiksha Raj:
Probabilistic deduction of symbol mappings for extension of lexicons. INTERSPEECH 2007: 1789-1792 - 2005
- [c20]Bhiksha Raj, Rita Singh:
Feature compensation with secondary sensor measurements for robust speech recognition. EUSIPCO 2005: 1-4 - [c19]Bhiksha Raj, Rita Singh, Paris Smaragdis:
Recognizing speech from simultaneous speakers. INTERSPEECH 2005: 3317-3320 - [c18]Chiman Kwan, Xiaokun Li, Debang Lao, Yunbin Deng, Zhubing Ren, Bhiksha Raj, Rita Singh, Richard M. Stern:
Voice driven applications in non-stationary and chaotic environment. ROBIO 2005: 127-132 - 2004
- [j3]Rita Singh, Bhiksha Raj:
Classification in Likelihood Spaces. Technometrics 46(3): 318-329 (2004) - [c17]Bhiksha Raj, Rita Singh, Richard M. Stern:
On tracking noise with linear dynamical system models. ICASSP (1) 2004: 965-968 - [c16]Antoine Raux, Rita Singh:
Maximum - likelihod adaptation of semi-continuous HMMs by latent variable decomposition of state distributions. INTERSPEECH 2004: 5-8 - 2003
- [j2]Bhiksha Raj, Rita Singh:
Classifier-based non-linear projection for adaptive endpointing of continuous speech. Comput. Speech Lang. 17(1): 5-26 (2003) - [c15]Rita Singh, Bhiksha Raj:
Tracking noise via dynamical systems with a continuum of states. ICASSP (1) 2003: 396-399 - [c14]Paul Lamere, Philip Kwok, William Walker, Evandro B. Gouvêa, Rita Singh, Bhiksha Raj, Peter Wolf:
Design of the CMU sphinx-4 decoder. INTERSPEECH 2003: 1181-1184 - [c13]Rita Singh, Manfred K. Warmuth, Bhiksha Raj, Paul Lamere:
Classification with free energy at raised temperatures. INTERSPEECH 2003: 1773-1776 - 2002
- [j1]Rita Singh, Bhiksha Raj, Richard M. Stern:
Automatic generation of subword units for speech recognition systems. IEEE Trans. Speech Audio Process. 10(2): 89-99 (2002) - [c12]Xiang Li, Rita Singh, Richard M. Stern:
Combining search spaces of heterogeneous recognizers for improved speech recogniton. INTERSPEECH 2002: 405-408 - [c11]Alan W. Black, Ralf D. Brown, Robert E. Frederking, Kevin A. Lenzo, John Moody, Alexander I. Rudnicky, Rita Singh, Eric Steinbrecher:
Rapid development of speech-to-speech translation systems. INTERSPEECH 2002: 1709-1712 - 2001
- [c10]Rita Singh, Michael L. Seltzer, Bhiksha Raj, Richard M. Stern:
Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination. ICASSP 2001: 273-276 - [c9]Daniel P. W. Ellis, Rita Singh, Sunil Sivadas:
Tandem acoustic modeling in large-vocabulary recognition. ICASSP 2001: 517-520 - 2000
- [c8]Rita Singh, Bhiksha Raj, Richard M. Stern:
Automatic generation of phone sets and lexical transcriptions. ICASSP 2000: 1691-1694 - [c7]Alexander I. Rudnicky, Christina L. Bennett, Alan W. Black, Ananlada Chotimongkol, Kevin A. Lenzo, Alice Oh, Rita Singh:
Task and domain specific modelling in the Carnegie Mellon communicator system. INTERSPEECH 2000: 130-134 - [c6]Rita Singh, Bhiksha Raj, Richard M. Stern:
Structured redefinition of sound units by merging and splitting for improved speech recognition. INTERSPEECH 2000: 151-154 - [c5]Jon P. Nedel, Rita Singh, Richard M. Stern:
Phone transition acoustic modeling: application to speaker independent and spontaneous speech systems. INTERSPEECH 2000: 572-575 - [c4]Jon P. Nedel, Rita Singh, Richard M. Stern:
Automatic subword unit refinement for spontaneous speech recognition via phone splitting. INTERSPEECH 2000: 588-591
1990 – 1999
- 1999
- [c3]Rita Singh, Bhiksha Raj, Richard M. Stern:
Automatic clustering and generation of contextual questions for tied states in hidden Markov models. ICASSP 1999: 117-120 - [c2]Rita Singh, Bhiksha Raj, Richard M. Stern:
Domain adduced state tying for cross-domain acoustic modelling. EUROSPEECH 1999: 1707-1710 - 1998
- [c1]Bhiksha Raj, Rita Singh, Richard M. Stern:
Inference of missing spectrographic features for robust speech recognition. ICSLP 1998
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-25 23:41 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint