Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild.
Ganglai WangPeng ZhangLei XieWei HuangYufei ZhaPublished in: CoRR (2022)
Keyphrases
- audio visual
- person authentication
- audio visual speech recognition
- multi modal
- multimodal fusion
- visual information
- facial features
- visual data
- multi stream
- temporal context
- multimedia
- human faces
- facial expressions
- face images
- data processing
- data sets
- facial images
- biometric systems
- biometric identification
- three dimensional
- computer vision