Login / Signup
Karren Yang
Publication Activity (10 Years)
Years Active: 2020-2023
Publications (10 Years): 11
Top Topics
Multimodal Fusion
Speech Enhancement
Camera Pose
Single Channel
Top Venues
CoRR
CVPR
ICASSP
ECCV (37)
</>
Publications
</>
Hsuan Su
,
Ting-Yao Hu
,
Hema Swetha Koppula
,
Raviteja Vemulapalli
,
Jen-Hao Rick Chang
,
Karren Yang
,
Gautam Varma Mantena
,
Oncel Tuzel
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models.
CoRR
(2023)
Byeongjoo Ahn
,
Karren Yang
,
Brian Hamilton
,
Jonathan Sheaffer
,
Anurag Ranjan
,
Miguel Sarabia
,
Oncel Tuzel
,
Jen-Hao Rick Chang
Novel-View Acoustic Synthesis from 3D Reconstructed Rooms.
CoRR
(2023)
Karren Yang
,
Ting-Yao Hu
,
Jen-Hao Rick Chang
,
Hema Swetha Koppula
,
Oncel Tuzel
Text is all You Need: Personalizing ASR Models Using Controllable Speech Synthesis.
ICASSP
(2023)
Karren Yang
,
Ting-Yao Hu
,
Jen-Hao Rick Chang
,
Hema Swetha Koppula
,
Oncel Tuzel
Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis.
CoRR
(2023)
Karren Yang
,
Dejan Markovic
,
Steven Krenn
,
Vasu Agrawal
,
Alexander Richard
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis.
CoRR
(2022)
Karren Yang
,
Dejan Markovic
,
Steven Krenn
,
Vasu Agrawal
,
Alexander Richard
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis.
CVPR
(2022)
Karren Yang
,
Wan-Yi Lin
,
Manash Barman
,
Filipe Condessa
,
J. Zico Kolter
Defending Multimodal Fusion Models against Single-Source Adversaries.
CoRR
(2022)
Karren Yang
,
Michael Firman
,
Eric Brachmann
,
Clément Godard
Camera Pose Estimation and Localization with Active Audio Sensing.
ECCV (37)
(2022)
Karren Yang
,
Wan-Yi Lin
,
Manash Barman
,
Filipe Condessa
,
J. Zico Kolter
Defending Multimodal Fusion Models Against Single-Source Adversaries.
CVPR
(2021)
Karren Yang
,
Bryan Russell
,
Justin Salamon
Telling Left From Right: Learning Spatial Correspondence of Sight and Sound.
CVPR
(2020)
Karren Yang
,
Bryan Russell
,
Justin Salamon
Telling Left from Right: Learning Spatial Correspondence of Sight and Sound.
CoRR
(2020)