default search action

combined dblp search
author search
venue search
publication search

ask others

Pranay Dighe

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KrishnaDRDAAT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KrishnaDRDAAT24
Gautam Krishna, Sameer Dharur, Oggi Rudovic, Pranay Dighe, Saurabh Adya, Ahmed Hussen Abdelaziz, Ahmed H. Tewfik:
Modality Drop-Out for Multimodal Device Directed Speech Detection Using Verbal and Non-Verbal Features. ICASSP 2024: 8240-8244
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DigheSZLGNT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DigheSZLGNT24
Pranay Dighe, Yi Su, Shangshang Zheng, Yunshu Liu, Vineet Garg, Xiaochuan Niu, Ahmed H. Tewfik:
Leveraging Large Language Models for Exploiting ASR Uncertainty. ICASSP 2024: 12231-12235
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-21075
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-21075
Tom Gunter, Zirui Wang, Chong Wang, Ruoming Pang, Andy Narayanan, Aonan Zhang, Bowen Zhang, Chen Chen, Chung-Cheng Chiu, David Qiu, Deepak Gopinath, Dian Ang Yap, Dong Yin, Feng Nan, Floris Weers, Guoli Yin, Haoshuo Huang, Jianyu Wang, Jiarui Lu, John Peebles, Ke Ye, Mark Lee, Nan Du, Qibin Chen, Quentin Keunebroek, Sam Wiseman, Syd Evans, Tao Lei, Vivek Rathod, Xiang Kong, Xianzhi Du, Yanghao Li, Yongqiang Wang, Yuan Gao, Zaid Ahmed, Zhaoyang Xu, Zhiyun Lu, Al Rashid, Albin Madappally Jose, Alec Doane, Alfredo Bencomo, Allison Vanderby, Andrew Hansen, Ankur Jain, Anupama Mann Anupama, Areeba Kamal, Bugu Wu, Carolina Brum, Charlie Maalouf, Chinguun Erdenebileg, Chris Dulhanty, Dominik Moritz, Doug Kang, Eduardo Jimenez, Evan Ladd, Fangping Shi, Felix Bai, Frank Chu, Fred Hohman, Hadas Kotek, Hannah Gillis Coleman, Jane Li, Jeffrey P. Bigham, Jeffery Cao, Jeff Lai, Jessica Cheung, Jiulong Shan, Joe Zhou, John Li, Jun Qin, Karanjeet Singh, Karla Vega, Kelvin Zou, Laura Heckman, Lauren Gardiner, Margit Bowler, Maria Cordell, Meng Cao, Nicole Hay, Nilesh Shahdadpuri, Otto Godwin, Pranay Dighe, Pushyami Rachapudi, Ramsey Tantawi, Roman Frigg, Sam Davarnia, Sanskruti Shah, Saptarshi Guha, Sasha Sirovica, Shen Ma, Shuang Ma, Simon Wang, Sulgi Kim, Suma Jayaram, Vaishaal Shankar, Varsha Paidi, Vivek Kumar, Xin Wang, Xin Zheng, Walker Cheng:
Apple Intelligence Foundation Language Models. CoRR abs/2407.21075 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-00023
Ognjen Rudovic, Pranay Dighe, Yi Su, Vineet Garg, Sameer Dharur, Xiaochuan Niu, Ahmed Hussen Abdelaziz, Saurabh Adya, Ahmed H. Tewfik:
Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models. CoRR abs/2411.00023 (2024)
2023
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DigheNRMNT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DigheNRMNT23
Pranay Dighe, Prateeth Nayak, Oggi Rudovic, Erik Marchi, Xiaochuan Niu, Ahmed H. Tewfik:
Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR. ICASSP 2023: 1-5
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RudovicCGDSBAKMA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RudovicCGDSBAKMA23
Oggi Rudovic, Wonil Chang, Vineet Garg, Pranay Dighe, Pramod Simha, Jack Berkowitz, Ahmed Hussen Abdelaziz, Sachin Kajarekar, Erik Marchi, Saurabh Adya:
Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types. ICASSP 2023: 1-5
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-04842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-04842
Pranay Dighe, Yi Su, Shangshang Zheng, Yunshu Liu, Vineet Garg, Xiaochuan Niu, Ahmed H. Tewfik:
Leveraging Large Language Models for Exploiting ASR Uncertainty. CoRR abs/2309.04842 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-15261
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-15261
Gautam Krishna, Sameer Dharur, Oggi Rudovic, Pranay Dighe, Saurabh Adya, Ahmed Hussen Abdelaziz, Ahmed H. Tewfik:
Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features. CoRR abs/2310.15261 (2023)
2022
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RudovicBGSDK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RudovicBGSDK22
Ognjen (Oggi) Rudovic, Akanksha Bindal, Vineet Garg, Pramod Simha, Pranay Dighe, Sachin Kajarekar:
Streaming on-Device Detection of Device Directed Speech from Voice and Touch-Based Invocation. ICASSP 2022: 491-495
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GargRDAMADT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GargRDAMADT22
Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed Hussen Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models. INTERSPEECH 2022: 1258-1262
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15975
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15975
Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed Hussen Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models. CoRR abs/2203.15975 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12134
Pranay Dighe, Prateeth Nayak, Oggi Rudovic, Erik Marchi, Xiaochuan Niu, Ahmed H. Tewfik:
Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR. CoRR abs/2210.12134 (2022)
2021
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DigheMVKN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DigheMVKN21
Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik:
Knowledge Transfer for Efficient on-Device False Trigger Mitigation. ICASSP 2021: 6838-6842
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GargCSASDD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GargCSASDD21
Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir:
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation. Interspeech 2021: 4209-4213
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-06598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-06598
Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir:
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation. CoRR abs/2105.06598 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04656
Ognjen Rudovic, Akanksha Bindal, Vineet Garg, Pramod Simha, Pranay Dighe, Sachin Kajarekar:
Streaming on-device detection of device directed speech from voice and touch-based invocation. CoRR abs/2110.04656 (2021)
2020
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/DigheAB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/DigheAB20
Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
On quantifying the quality of acoustic models in hybrid DNN-HMM ASR. Speech Commun. 119: 24-35 (2020)
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DigheALVNSMPW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DigheALVNSMPW20
Pranay Dighe, Saurabh Adya, Nuoyu Li, Srikanth Vishnubhotla, Devang Naik, Adithya Sagar, Ying Ma, Stephen Pulman, Jason D. Williams:
Lattice-Based Improvements for Voice Triggering Using Graph Neural Networks. ICASSP 2020: 7459-7463
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AgarwalNDVBN20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AgarwalNDVBN20
Rishika Agarwal, Xiaochuan Niu, Pranay Dighe, Srikanth Vishnubhotla, Sameer Badaskar, Devang Naik:
Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation. INTERSPEECH 2020: 4288-4292
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-10822
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-10822
Pranay Dighe, Saurabh Adya, Nuoyu Li, Srikanth Vishnubhotla, Devang Naik, Adithya Sagar, Ying Ma, Stephen Pulman, Jason D. Williams:
Lattice-based Improvements for Voice Triggering Using Graph Neural Networks. CoRR abs/2001.10822 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-08113
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-08113
Rishika Agarwal, Xiaochuan Niu, Pranay Dighe, Srikanth Vishnubhotla, Sameer Badaskar, Devang Naik:
Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation. CoRR abs/2008.08113 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-10591
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-10591
Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik:
Knowledge Transfer for Efficient On-device False Trigger Mitigation. CoRR abs/2010.10591 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/DigheAB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/DigheAB19
Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Low-rank and sparse subspace modeling of speech for DNN based acoustic modeling. Speech Commun. 109: 34-45 (2019)
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VyasDTB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VyasDTB19
Apoorv Vyas, Pranay Dighe, Sibo Tong, Hervé Bourlard:
Analyzing Uncertainties in Speech Recognition Using Dropout. ICASSP 2019: 6730-6734
2018
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/DigheAB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/DigheAB18
Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Far-Field ASR Using Low-Rank and Sparse Soft Targets from Parallel Data. SLT 2018: 581-587
2017
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DigheAB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DigheAB17
Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Low-rank and sparse soft targets to learn better DNN acoustic models. ICASSP 2017: 5265-5269
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DigheAB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DigheAB17
Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Exploiting Eigenposteriors for Semi-Supervised Training of DNN Acoustic Models with Sequence Discrimination. INTERSPEECH 2017: 3552-3556
2016
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/DigheAB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/DigheAB16
Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Sparse modeling of neural network posterior probabilities for exemplar-based speech recognition. Speech Commun. 76: 230-244 (2016)
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DigheLAB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DigheLAB16
Pranay Dighe, Gil Luyet, Afsaneh Asaei, Hervé Bourlard:
Exploiting low-dimensional structures to enhance DNN based acoustic modeling in speech recognition. ICASSP 2016: 5690-5694
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuyetDAB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuyetDAB16
Gil Luyet, Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Low-Rank Representation of Nearest Neighbor Posterior Probabilities to Enhance DNN Based Acoustic Modeling. INTERSPEECH 2016: 3449-3453
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DigheLAB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DigheLAB16
Pranay Dighe, Gil Luyet, Afsaneh Asaei, Hervé Bourlard:
Exploiting Low-dimensional Structures to Enhance DNN Based Acoustic Modeling in Speech Recognition. CoRR abs/1601.05936 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DigheAB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DigheAB16
Pranay Dighe, Afsaneh Asaei, Hervé Bourlard:
Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models. CoRR abs/1610.05688 (2016)
2015
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamADB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamADB15
Dhananjay Ram, Afsaneh Asaei, Pranay Dighe, Hervé Bourlard:
Sparse modeling of posterior exemplars for keyword detection. INTERSPEECH 2015: 3690-3694
2014
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DigheFB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DigheFB14
Pranay Dighe, Marc Ferras, Hervé Bourlard:
Detecting and labeling speakers on overlapping speech using vector taylor series. INTERSPEECH 2014: 592-596
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/DigheFB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DigheFB14
Pranay Dighe, Marc Ferras, Hervé Bourlard:
Modeling Overlapping Speech using Vector Taylor Series. Odyssey 2014: 194-199
2013
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/DigheAKTR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/DigheAKTR13
Pranay Dighe, Parul Agarwal, Harish Karnick, Siddartha Thota, Bhiksha Raj:
Scale independent raga identification using chromagram patterns and swara based features. ICME Workshops 2013: 1-4
[c3]
- view
  - electronic edition @ pucpr.br
  - details & citations
- export record
  dblp key:
  - conf/ismir/DigheKR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/DigheKR13
Pranay Dighe, Harish Karnick, Bhiksha Raj:
Swara Histogram Based Structural Analysis And Identification Of Indian Classical Ragas. ISMIR 2013: 35-40
2012
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarDSCR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarDSCR12
Anurag Kumar, Pranay Dighe, Rita Singh, Sourish Chaudhuri, Bhiksha Raj:
Audio event detection from acoustic unit occurrence patterns. ICASSP 2012: 489-492
[c1]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/SahniDSR12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SahniDSR12
Kamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj:
Language identification using spectro-temporal patch features. SAPA@INTERSPEECH 2012: 110-113

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.