default search action

combined dblp search
author search
venue search
publication search

ask others

Nirmesh J. Shah

Nirmesh Shah

> Home > Persons

Person information

affiliation: Sony Research India

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/MhaskarSZGWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/MhaskarSZGWS24
Shivam Mhaskar, Nirmesh Shah, Mohammadi Zaki, Ashishkumar P. Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah:
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning. NAACL-HLT (Findings) 2024: 3966-3976
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-15469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-15469
Shivam Ratnakant Mhaskar, Nirmesh J. Shah, Mohammadi Zaki, Ashishkumar P. Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah:
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning. CoRR abs/2403.15469 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08076
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08076
Ashishkumar P. Gudmalwar, Nirmesh Shah, Sai Akarsh, Pankaj Wasnik, Rajiv Ratn Shah:
VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech. CoRR abs/2406.08076 (2024)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08802
Neha Sahipjohn, Ashishkumar P. Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Rajiv Ratn Shah:
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing. CoRR abs/2406.08802 (2024)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-20359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-20359
Ashishkumar Gudmalwar, Ishan D. Biyani, Nirmesh Shah, Pankaj Wasnik, Rajiv Ratn Shah:
EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion. CoRR abs/2412.20359 (2024)
2023
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShahSTO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShahSTO23
Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe:
Nonparallel Emotional Voice Conversion for Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing. ICASSP 2023: 1-5
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10536
Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe:
Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing. CoRR abs/2302.10536 (2023)
2022
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChudasamaKGSWO22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChudasamaKGSWO22
Vishal M. Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki Onoe:
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation. CVPR Workshops 2022: 4651-4660
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BandarupalliRSO22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BandarupalliRSO22
Tarun Sai Bandarupalli, Shakti Rath, Nirmesh Shah, Naoyuki Onoe, Sriram Ganapathy:
Semi-supervised Acoustic and Language Modeling for Hindi ASR. INTERSPEECH 2022: 3528-3532
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02187
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02187
Vishal M. Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki Onoe:
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation. CoRR abs/2206.02187 (2022)
2021
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/ShahSPPV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ShahSPPV21
Nirmesh J. Shah, M. Ali Basha Shaik, P. Periyasamy, Hemant A. Patil, Vikram Vij:
Exploiting Phase-based Features for Whisper vs. Speech Classification. EUSIPCO 2021: 21-25
2020
[c27]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/ShahRMSP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ShahRMSP20
Neil Shah, Sreeraj R, Maulik C. Madhavi, Nirmesh J. Shah, Hemant A. Patil:
Query-By-Example Spoken Term Detection Using Generative Adversarial Network. APSIPA 2020: 644-648
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/spcom/PurohitPMPPSDP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/spcom/PurohitPMPPSDP20
Mirali Purohit, Maitreya Patel, Harshit Malaviya, Ankur T. Patil, Mihir Parmar, Nirmesh J. Shah, Savan Doshi, Hemant A. Patil:
Intelligibility Improvement of Dysarthric Speech using MMSE DiscoGAN. SPCOM 2020: 1-5

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/ShahP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/ShahP19
Nirmesh J. Shah, Hemant A. Patil:
A novel approach to remove outliers for parallel voice conversion. Comput. Speech Lang. 58: 127-152 (2019)
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/PatelPDSP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/PatelPDSP19
Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh J. Shah, Hemant A. Patil:
Novel Adaptive Generative Adversarial Network for Voice Conversion. APSIPA 2019: 1273-1281
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/ParmarDSPP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ParmarDSPP19
Mihir Parmar, Savan Doshi, Nirmesh J. Shah, Maitreya Patel, Hemant A. Patil:
Effectiveness of Cross-Domain Architectures for Whisper-to-Normal Speech Conversion. EUSIPCO 2019: 1-5
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShahP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShahP19
Nirmesh J. Shah, Hemant A. Patil:
Novel Metric Learning for Non-parallel Voice Conversion. ICASSP 2019: 3722-3726
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahP19
Nirmesh J. Shah, Hemant A. Patil:
Phone Aware Nearest Neighbor Technique Using Spectral Transition Measure for Non-Parallel Voice Conversion. INTERSPEECH 2019: 639-643
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahSP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahSP19
Nirmesh J. Shah, Hardik B. Sailor, Hemant A. Patil:
Whether to Pretrain DNN or not?: An Empirical Analysis for Voice Conversion. INTERSPEECH 2019: 1586-1590
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/PatelPDSP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/PatelPDSP19
Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh Shah, Hemant A. Patil:
Novel Inception-GAN for Whispered-to-Normal Speech Conversion. SSW 2019: 87-92
2018
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ShahSSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ShahSSP18
Nirmesh J. Shah, R. Sreeraj, Neil Shah, Hemant A. Patil:
Novel Inter Mixture Weighted GMM Posteriorgram for DNN and GAN-based Voice Conversion. APSIPA 2018: 1776-1781
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahP18
Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of Dynamic Features in INCA and Temporal Context-INCA. INTERSPEECH 2018: 711-715
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahMP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahMP18
Nirmesh J. Shah, Maulik C. Madhavi, Hemant A. Patil:
Unsupervised Vocal Tract Length Warped Posterior Features for Non-Parallel Voice Conversion. INTERSPEECH 2018: 1968-1972
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahSP18
Neil Shah, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of Generative Adversarial Network for Non-Audible Murmur-to-Whisper Speech Conversion. INTERSPEECH 2018: 3157-3161
2017
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ShahP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ShahP17
Nirmesh J. Shah, Hemant A. Patil:
On the convergence of INCA algorithm. APSIPA 2017: 559-562
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ShahBP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ShahBP17
Nirmesh J. Shah, Pramod B. Bachhav, Hemant A. Patil:
A novel filtering-based F0 estimation algorithm with an application to voice conversion. APSIPA 2017: 1528-1531
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RajpalSZP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RajpalSZP17
Avni Rajpal, Nirmesh J. Shah, Mohammadi Zaki, Hemant A. Patil:
Quality assessment of voice converted speech using articulatory features. ICASSP 2017: 5515-5519
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShahP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShahP17
Nirmesh J. Shah, Hemant A. Patil:
Novel Amplitude Scaling method for bilinear frequency Warping-based Voice Conversion. ICASSP 2017: 5520-5524
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/premi/ShahP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/premi/ShahP17
Nirmesh J. Shah, Hemant A. Patil:
Analysis of Features and Metrics for Alignment in Text-Dependent Voice Conversion. PReMI 2017: 299-307
2016
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/RaoSP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/RaoSP16
Sushant V. Rao, Nirmesh J. Shah, Hemant A. Patil:
Novel Pre-processing using Outlier Removal in Voice Conversion. SSW 2016: 134-139
2015
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/ZakiSP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ZakiSP15
Mohammadi Zaki, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of multiscale fractal dimension for improvement of frame classification rate. EUSIPCO 2015: 1018-1022
2014
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/ZakiSP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/ZakiSP14
Mohammadi Zaki, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of multiscale fractal dimension-based phonetic segmentation in speech synthesis for low resource language. IALP 2014: 103-106
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/ShahZP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/ShahZP14
Nirmesh J. Shah, Mohammadi Zaki, Hemant A. Patil:
Influence of various asymmetrical contextual factors for TTS in a low resource language. IALP 2014: 107-110
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShahVSP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShahVSP14
Nirmesh J. Shah, Bhavik B. Vachhani, Hardik B. Sailor, Hemant A. Patil:
Effectiveness of PLP-based phonetic segmentation for speech synthesis. ICASSP 2014: 270-274
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZakiSP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZakiSP14
Mohammadi Zaki, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of fractal dimension for ASR in low resource language. ISCSLP 2014: 464-468
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ShahPMSP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ShahPMSP14
Nirmesh J. Shah, Hemant A. Patil, Maulik C. Madhavi, Hardik B. Sailor, Tanvina B. Patel:
Deterministic annealing EM algorithm for developing TTS system in Gujarati. ISCSLP 2014: 526-530
2013
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/TalesaraPPSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/TalesaraPPSS13
Swati Talesara, Hemant A. Patil, Tanvina B. Patel, Hardik B. Sailor, Nirmesh J. Shah:
A Novel Gaussian Filter-Based Automatic Labeling of Speech Data for TTS System in Gujarati Language. IALP 2013: 139-142
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/PatilPSSKKNCKRK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/PatilPSSKKNCKRK13
Hemant A. Patil, Tanvina B. Patel, Nirmesh J. Shah, Hardik B. Sailor, Raghava Krishnan, G. R. Kasthuri, T. Nagarajan, S. Lilly Christina, Naresh Kumar, Veera Raghavendra, S. Prahallad Kishore, S. R. Mahadeva Prasanna, Nagaraj Adiga, Sanasam Ranbir Singh, Anand Konjengbam, Pranaw Kumar, Bira Chandra Singh, S. L. Binil Kumar, T. G. Bhadran, T. Sajini, Arup Saha, Tulika Basu, K. Sreenivasa Rao, N. P. Narendra, Anil Kumar Sao, Rakesh Kumar, Pranhari Talukdar, Purnendu Acharyaa, Somnath Chandra, Swaran Lata, Hema A. Murthy:
A syllable-based framework for unit selection synthesis in 13 Indian languages. O-COCOSDA/CASLRE 2013: 1-8
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/PatilPTSSVAKGP13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/PatilPTSSVAKGP13
Hemant A. Patil, Tanvina B. Patel, Swati Talesara, Nirmesh J. Shah, Hardik B. Sailor, Bhavik B. Vachhani, Janki Akhani, Bhargav Kanakiya, Yashesh Gaur, Vibha Prajapati:
Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati. O-COCOSDA/CASLRE 2013: 1-7

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.