Audio-Driven Talking Face Video Generation with Dynamic Convolution Kernels.
Zipeng YeMengfei XiaRan YiJuyong ZhangYu-Kun LaiXuwei HuangGuo-Xin ZhangYong-Jin LiuPublished in: CoRR (2022)
Keyphrases
- multimedia
- audio video
- video data
- digital video
- scene change detection
- video content analysis
- real time
- video sequences
- video frames
- multimedia processing
- visual data
- dynamic environments
- video files
- audio stream
- video analysis
- digital audio
- audio signals
- audio features
- computer vision
- audio signal
- video database
- video surveillance
- low level
- soccer video
- face recognition
- audio visual
- story segmentation
- video clips
- multimodal fusion
- video content
- media streams
- long video
- mouth region
- multi modal