Login / Signup
Varun Nagaraja
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 13
Top Topics
Discriminative Training
Speech Recognition
Causal Theories
Audio Visual
Top Venues
CoRR
ICASSP
Interspeech
</>
Publications
</>
Gaël Le Lan
,
Varun Nagaraja
,
Ernie Chang
,
David Kant
,
Zhaoheng Ni
,
Yangyang Shi
,
Forrest N. Iandola
,
Vikas Chandra
Stack-and-Delay: A New Codebook Pattern for Music Generation.
ICASSP
(2024)
Ernie Chang
,
Sidd Srinivasan
,
Mahi Luthra
,
Pin-Jie Lin
,
Varun Nagaraja
,
Forrest N. Iandola
,
Zechun Liu
,
Zhaoheng Ni
,
Changsheng Zhao
,
Yangyang Shi
,
Vikas Chandra
On the Open Prompt Challenge in Conditional Audio Generation.
ICASSP
(2024)
Gaël Le Lan
,
Bowen Shi
,
Zhaoheng Ni
,
Sidd Srinivasan
,
Anurag Kumar
,
Brian Ellis
,
David Kant
,
Varun Nagaraja
,
Ernie Chang
,
Wei-Ning Hsu
,
Yangyang Shi
,
Vikas Chandra
High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching.
CoRR
(2024)
Gaël Le Lan
,
Varun Nagaraja
,
Ernie Chang
,
David Kant
,
Zhaoheng Ni
,
Yangyang Shi
,
Forrest N. Iandola
,
Vikas Chandra
Stack-and-Delay: a new codebook pattern for music generation.
CoRR
(2023)
Yangyang Shi
,
Gaël Le Lan
,
Varun Nagaraja
,
Zhaoheng Ni
,
Xinhao Mei
,
Ernie Chang
,
Forrest N. Iandola
,
Yang Liu
,
Vikas Chandra
Enhance audio generation controllability through representation similarity regularization.
CoRR
(2023)
Xinhao Mei
,
Varun Nagaraja
,
Gaël Le Lan
,
Zhaoheng Ni
,
Ernie Chang
,
Yangyang Shi
,
Vikas Chandra
FoleyGen: Visually-Guided Audio Generation.
CoRR
(2023)
Ernie Chang
,
Sidd Srinivasan
,
Mahi Luthra
,
Pin-Jie Lin
,
Varun Nagaraja
,
Forrest N. Iandola
,
Zechun Liu
,
Zhaoheng Ni
,
Changsheng Zhao
,
Yangyang Shi
,
Vikas Chandra
On The Open Prompt Challenge In Conditional Audio Generation.
CoRR
(2023)
Yangyang Shi
,
Chunyang Wu
,
Dilin Wang
,
Alex Xiao
,
Jay Mahadeokar
,
Xiaohui Zhang
,
Chunxi Liu
,
Ke Li
,
Yuan Shangguan
,
Varun Nagaraja
,
Ozlem Kalinli
,
Mike Seltzer
Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution.
ICASSP
(2022)
Varun Nagaraja
,
Yangyang Shi
,
Ganesh Venkatesh
,
Ozlem Kalinli
,
Michael L. Seltzer
,
Vikas Chandra
Collaborative Training of Acoustic Encoders for Speech Recognition.
CoRR
(2021)
Varun Nagaraja
,
Yangyang Shi
,
Ganesh Venkatesh
,
Ozlem Kalinli
,
Michael L. Seltzer
,
Vikas Chandra
Collaborative Training of Acoustic Encoders for Speech Recognition.
Interspeech
(2021)
Yangyang Shi
,
Varun Nagaraja
,
Chunyang Wu
,
Jay Mahadeokar
,
Duc Le
,
Rohit Prabhavalkar
,
Alex Xiao
,
Ching-Feng Yeh
,
Julian Chan
,
Christian Fuegen
,
Ozlem Kalinli
,
Michael L. Seltzer
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency.
Interspeech
(2021)
Yangyang Shi
,
Chunyang Wu
,
Dilin Wang
,
Alex Xiao
,
Jay Mahadeokar
,
Xiaohui Zhang
,
Chunxi Liu
,
Ke Li
,
Yuan Shangguan
,
Varun Nagaraja
,
Ozlem Kalinli
,
Mike Seltzer
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution.
CoRR
(2021)
Yangyang Shi
,
Varun Nagaraja
,
Chunyang Wu
,
Jay Mahadeokar
,
Duc Le
,
Rohit Prabhavalkar
,
Alex Xiao
,
Ching-Feng Yeh
,
Julian Chan
,
Christian Fuegen
,
Ozlem Kalinli
,
Michael L. Seltzer
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency.
CoRR
(2021)