Audio-Driven Talking Face Video Generation with Emotion.
Jiadong LiangFeng LuPublished in: VR Workshops (2024)
Keyphrases
- multimodal fusion
- audio video
- multimedia
- facial expressions
- scene change detection
- emotion recognition
- video sequences
- audio visual
- mouth region
- digital video
- multimedia processing
- visual data
- video files
- video content
- video content analysis
- audio files
- multimedia information
- audio stream
- video data
- video streams
- video material
- multimedia data
- digital audio
- audio features
- audio signals
- video clips
- face detection and tracking
- media streams
- signal processing
- story segmentation
- visual speech
- human faces
- video analysis
- soccer video
- audio visual content
- multi modal
- moving objects
- real time
- visual information
- facial images
- video copy detection
- video frames
- face images
- computer vision
- broadcast news
- feature vectors
- video database
- gait recognition
- video indexing and retrieval
- video annotation
- video signals
- multimodal interfaces