​
Login / Signup
Nirmesh Shah
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 8
Top Topics
Multi Modal Fusion
Language Model
Endpoint Detection
Text To Speech
Top Venues
CoRR
ICASSP
INTERSPEECH
SSW
</>
Publications
</>
Ashishkumar P. Gudmalwar
,
Nirmesh Shah
,
Sai Akarsh
,
Pankaj Wasnik
,
Rajiv Ratn Shah
VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech.
CoRR
(2024)
Neha Sahipjohn
,
Ashishkumar P. Gudmalwar
,
Nirmesh Shah
,
Pankaj Wasnik
,
Rajiv Ratn Shah
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing.
CoRR
(2024)
Nirmesh Shah
,
Mayank Kumar Singh
,
Naoya Takahashi
,
Naoyuki Onoe
Nonparallel Emotional Voice Conversion for Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing.
ICASSP
(2023)
Nirmesh Shah
,
Mayank Kumar Singh
,
Naoya Takahashi
,
Naoyuki Onoe
Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing.
CoRR
(2023)
Vishal Chudasama
,
Purbayan Kar
,
Ashish Gudmalwar
,
Nirmesh Shah
,
Pankaj Wasnik
,
Naoyuki Onoe
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation.
CoRR
(2022)
Vishal Chudasama
,
Purbayan Kar
,
Ashish Gudmalwar
,
Nirmesh Shah
,
Pankaj Wasnik
,
Naoyuki Onoe
M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation.
CVPR Workshops
(2022)
Tarun Sai Bandarupalli
,
Shakti Rath
,
Nirmesh Shah
,
Naoyuki Onoe
,
Sriram Ganapathy
Semi-supervised Acoustic and Language Modeling for Hindi ASR.
INTERSPEECH
(2022)
Maitreya Patel
,
Mihir Parmar
,
Savan Doshi
,
Nirmesh Shah
,
Hemant A. Patil
Novel Inception-GAN for Whispered-to-Normal Speech Conversion.
SSW
(2019)