Login / Signup

Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy.

Wenxuan WuXueyuan ChenXixin WuHaizhou LiHelen Meng
Published in: CoRR (2024)
Keyphrases
  • pre trained
  • audio visual
  • training data
  • training examples
  • speech recognition
  • control signals
  • small number
  • data sets
  • viewpoint
  • wide range
  • feature space
  • visual information
  • speech signal