Login / Signup
Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy.
Wenxuan Wu
Xueyuan Chen
Xixin Wu
Haizhou Li
Helen Meng
Published in:
CoRR (2024)
Keyphrases
</>
pre trained
audio visual
training data
training examples
speech recognition
control signals
small number
data sets
viewpoint
wide range
feature space
visual information
speech signal