Login / Signup
Rohit Paturi
Publication Activity (10 Years)
Years Active: 2015-2024
Publications (10 Years): 12
Top Topics
Conversational Speech
Speaker Diarization
Error Correction
Automatic Speech Recognition
Top Venues
CoRR
INTERSPEECH
EMNLP
ASRU
</>
Publications
</>
Nilaksh Das
,
Saket Dingliwal
,
Srikanth Ronanki
,
Rohit Paturi
,
Zhaocheng Huang
,
Prashant Mathur
,
Jie Yuan
,
Dhanush Bekal
,
Xing Niu
,
Sai Muralidhar Jayanthi
,
Xilai Li
,
Karel Mundnich
,
Monica Sunkara
,
Sundararajan Srinivasan
,
Kyu J. Han
,
Katrin Kirchhoff
SpeechVerse: A Large-scale Generalizable Audio Language Model.
CoRR
(2024)
Rohit Paturi
,
Xiang Li
,
Sundararajan Srinivasan
AG-LSEC: Audio Grounded Lexical Speaker Error Correction.
CoRR
(2024)
Xiang Li
,
Vivek Govindan
,
Rohit Paturi
,
Sundararajan Srinivasan
Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization.
CoRR
(2024)
Juan Pablo Zuluaga-Gomez
,
Zhaocheng Huang
,
Xing Niu
,
Rohit Paturi
,
Sundararajan Srinivasan
,
Prashant Mathur
,
Brian Thompson
,
Marcello Federico
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation.
EMNLP
(2023)
Rohit Paturi
,
Sundararajan Srinivasan
,
Xiang Li
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction.
CoRR
(2023)
Yogesh Virkar
,
Brian Thompson
,
Rohit Paturi
,
Sundararajan Srinivasan
,
Marcello Federico
Speaker Diarization of Scripted Audiovisual Content.
CoRR
(2023)
Veera Raghavendra Elluru
,
Devang Kulshreshtha
,
Rohit Paturi
,
Sravan Bodapati
,
Srikanth Ronanki
Generalized zero-shot audio-to-intent classification.
CoRR
(2023)
Rohit Paturi
,
Sundararajan Srinivasan
,
Xiang Li
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction.
INTERSPEECH
(2023)
Veera Raghavendra Elluru
,
Devang Kulshreshtha
,
Rohit Paturi
,
Sravan Bodapati
,
Srikanth Ronanki
Generalized Zero-Shot Audio-to-Intent Classification.
ASRU
(2023)
Juan Zuluaga-Gomez
,
Zhaocheng Huang
,
Xing Niu
,
Rohit Paturi
,
Sundararajan Srinivasan
,
Prashant Mathur
,
Brian Thompson
,
Marcello Federico
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation.
CoRR
(2023)
Rohit Paturi
,
Sundararajan Srinivasan
,
Katrin Kirchhoff
,
Daniel Garcia-Romero
Directed speech separation for automatic speech recognition of long form conversational speech.
INTERSPEECH
(2022)
Rohit Paturi
,
Sundararajan Srinivasan
,
Katrin Kirchhoff
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech.
CoRR
(2021)
Jinxi Guo
,
Rohit Paturi
,
Gary Yeung
,
Steven M. Lulich
,
Harish Arsikere
,
Abeer Alwan
Age-dependent height estimation and speaker normalization for children's speech using the first three subglottal resonances.
INTERSPEECH
(2015)