Login / Signup
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception.
HyoJung Han
Mohamed Anwar
Juan Pino
Wei-Ning Hsu
Marine Carpuat
Bowen Shi
Changhan Wang
Published in:
CoRR (2024)
Keyphrases
</>
cross lingual
noisy environments
visual speech
reinforcement learning
visual speech recognition
learning algorithm
speech recognition
search engine
active learning
supervised learning
visual information
speech signal
low level
noise reduction
audio visual