default search action
Nirmesh J. Shah
Person information
- affiliation: Sony Research India
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c32]Shivam Mhaskar, Nirmesh Shah, Mohammadi Zaki, Ashishkumar P. Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah:
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning. NAACL-HLT (Findings) 2024: 3966-3976 - [i6]Shivam Ratnakant Mhaskar, Nirmesh J. Shah, Mohammadi Zaki, Ashishkumar P. Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah:
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning. CoRR abs/2403.15469 (2024) - [i5]Ashishkumar P. Gudmalwar, Nirmesh Shah, Sai Akarsh, Pankaj Wasnik, Rajiv Ratn Shah:
VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech. CoRR abs/2406.08076 (2024) - [i4]Neha Sahipjohn, Ashishkumar P. Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Rajiv Ratn Shah:
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing. CoRR abs/2406.08802 (2024) - [i3]Ashishkumar Gudmalwar, Ishan D. Biyani, Nirmesh Shah, Pankaj Wasnik, Rajiv Ratn Shah:
EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion. CoRR abs/2412.20359 (2024) - 2023
- [c31]Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe:
Nonparallel Emotional Voice Conversion for Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing. ICASSP 2023: 1-5 - [i2]Nirmesh Shah, Mayank Kumar Singh, Naoya Takahashi, Naoyuki Onoe:
Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing. CoRR abs/2302.10536 (2023) - 2022
- [c30]Vishal M. Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki Onoe:
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation. CVPR Workshops 2022: 4651-4660 - [c29]Tarun Sai Bandarupalli, Shakti Rath, Nirmesh Shah, Naoyuki Onoe, Sriram Ganapathy:
Semi-supervised Acoustic and Language Modeling for Hindi ASR. INTERSPEECH 2022: 3528-3532 - [i1]Vishal M. Chudasama, Purbayan Kar, Ashish Gudmalwar, Nirmesh Shah, Pankaj Wasnik, Naoyuki Onoe:
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation. CoRR abs/2206.02187 (2022) - 2021
- [c28]Nirmesh J. Shah, M. Ali Basha Shaik, P. Periyasamy, Hemant A. Patil, Vikram Vij:
Exploiting Phase-based Features for Whisper vs. Speech Classification. EUSIPCO 2021: 21-25 - 2020
- [c27]Neil Shah, Sreeraj R, Maulik C. Madhavi, Nirmesh J. Shah, Hemant A. Patil:
Query-By-Example Spoken Term Detection Using Generative Adversarial Network. APSIPA 2020: 644-648 - [c26]Mirali Purohit, Maitreya Patel, Harshit Malaviya, Ankur T. Patil, Mihir Parmar, Nirmesh J. Shah, Savan Doshi, Hemant A. Patil:
Intelligibility Improvement of Dysarthric Speech using MMSE DiscoGAN. SPCOM 2020: 1-5
2010 – 2019
- 2019
- [j1]Nirmesh J. Shah, Hemant A. Patil:
A novel approach to remove outliers for parallel voice conversion. Comput. Speech Lang. 58: 127-152 (2019) - [c25]Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh J. Shah, Hemant A. Patil:
Novel Adaptive Generative Adversarial Network for Voice Conversion. APSIPA 2019: 1273-1281 - [c24]Mihir Parmar, Savan Doshi, Nirmesh J. Shah, Maitreya Patel, Hemant A. Patil:
Effectiveness of Cross-Domain Architectures for Whisper-to-Normal Speech Conversion. EUSIPCO 2019: 1-5 - [c23]Nirmesh J. Shah, Hemant A. Patil:
Novel Metric Learning for Non-parallel Voice Conversion. ICASSP 2019: 3722-3726 - [c22]Nirmesh J. Shah, Hemant A. Patil:
Phone Aware Nearest Neighbor Technique Using Spectral Transition Measure for Non-Parallel Voice Conversion. INTERSPEECH 2019: 639-643 - [c21]Nirmesh J. Shah, Hardik B. Sailor, Hemant A. Patil:
Whether to Pretrain DNN or not?: An Empirical Analysis for Voice Conversion. INTERSPEECH 2019: 1586-1590 - [c20]Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh Shah, Hemant A. Patil:
Novel Inception-GAN for Whispered-to-Normal Speech Conversion. SSW 2019: 87-92 - 2018
- [c19]Nirmesh J. Shah, R. Sreeraj, Neil Shah, Hemant A. Patil:
Novel Inter Mixture Weighted GMM Posteriorgram for DNN and GAN-based Voice Conversion. APSIPA 2018: 1776-1781 - [c18]Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of Dynamic Features in INCA and Temporal Context-INCA. INTERSPEECH 2018: 711-715 - [c17]Nirmesh J. Shah, Maulik C. Madhavi, Hemant A. Patil:
Unsupervised Vocal Tract Length Warped Posterior Features for Non-Parallel Voice Conversion. INTERSPEECH 2018: 1968-1972 - [c16]Neil Shah, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of Generative Adversarial Network for Non-Audible Murmur-to-Whisper Speech Conversion. INTERSPEECH 2018: 3157-3161 - 2017
- [c15]Nirmesh J. Shah, Hemant A. Patil:
On the convergence of INCA algorithm. APSIPA 2017: 559-562 - [c14]Nirmesh J. Shah, Pramod B. Bachhav, Hemant A. Patil:
A novel filtering-based F0 estimation algorithm with an application to voice conversion. APSIPA 2017: 1528-1531 - [c13]Avni Rajpal, Nirmesh J. Shah, Mohammadi Zaki, Hemant A. Patil:
Quality assessment of voice converted speech using articulatory features. ICASSP 2017: 5515-5519 - [c12]Nirmesh J. Shah, Hemant A. Patil:
Novel Amplitude Scaling method for bilinear frequency Warping-based Voice Conversion. ICASSP 2017: 5520-5524 - [c11]Nirmesh J. Shah, Hemant A. Patil:
Analysis of Features and Metrics for Alignment in Text-Dependent Voice Conversion. PReMI 2017: 299-307 - 2016
- [c10]Sushant V. Rao, Nirmesh J. Shah, Hemant A. Patil:
Novel Pre-processing using Outlier Removal in Voice Conversion. SSW 2016: 134-139 - 2015
- [c9]Mohammadi Zaki, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of multiscale fractal dimension for improvement of frame classification rate. EUSIPCO 2015: 1018-1022 - 2014
- [c8]Mohammadi Zaki, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of multiscale fractal dimension-based phonetic segmentation in speech synthesis for low resource language. IALP 2014: 103-106 - [c7]Nirmesh J. Shah, Mohammadi Zaki, Hemant A. Patil:
Influence of various asymmetrical contextual factors for TTS in a low resource language. IALP 2014: 107-110 - [c6]Nirmesh J. Shah, Bhavik B. Vachhani, Hardik B. Sailor, Hemant A. Patil:
Effectiveness of PLP-based phonetic segmentation for speech synthesis. ICASSP 2014: 270-274 - [c5]Mohammadi Zaki, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of fractal dimension for ASR in low resource language. ISCSLP 2014: 464-468 - [c4]Nirmesh J. Shah, Hemant A. Patil, Maulik C. Madhavi, Hardik B. Sailor, Tanvina B. Patel:
Deterministic annealing EM algorithm for developing TTS system in Gujarati. ISCSLP 2014: 526-530 - 2013
- [c3]Swati Talesara, Hemant A. Patil, Tanvina B. Patel, Hardik B. Sailor, Nirmesh J. Shah:
A Novel Gaussian Filter-Based Automatic Labeling of Speech Data for TTS System in Gujarati Language. IALP 2013: 139-142 - [c2]Hemant A. Patil, Tanvina B. Patel, Nirmesh J. Shah, Hardik B. Sailor, Raghava Krishnan, G. R. Kasthuri, T. Nagarajan, S. Lilly Christina, Naresh Kumar, Veera Raghavendra, S. Prahallad Kishore, S. R. Mahadeva Prasanna, Nagaraj Adiga, Sanasam Ranbir Singh, Anand Konjengbam, Pranaw Kumar, Bira Chandra Singh, S. L. Binil Kumar, T. G. Bhadran, T. Sajini, Arup Saha, Tulika Basu, K. Sreenivasa Rao, N. P. Narendra, Anil Kumar Sao, Rakesh Kumar, Pranhari Talukdar, Purnendu Acharyaa, Somnath Chandra, Swaran Lata, Hema A. Murthy:
A syllable-based framework for unit selection synthesis in 13 Indian languages. O-COCOSDA/CASLRE 2013: 1-8 - [c1]Hemant A. Patil, Tanvina B. Patel, Swati Talesara, Nirmesh J. Shah, Hardik B. Sailor, Bhavik B. Vachhani, Janki Akhani, Bhargav Kanakiya, Yashesh Gaur, Vibha Prajapati:
Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati. O-COCOSDA/CASLRE 2013: 1-7
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-27 21:48 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint