Video-to-Audio Generation with Hidden Alignment.
Manjie XuChenxing LiYong RenRilin ChenYu GuWei LiangDong YuPublished in: CoRR (2024)
Keyphrases
- multimedia
- audio video
- scene change detection
- video content analysis
- digital video
- visual data
- multimedia processing
- video data
- video files
- audio files
- video sequences
- multimedia information
- real time
- video analysis
- digital audio
- video database
- video copy detection
- broadcast news
- audio stream
- video clips
- video content
- signal processing
- video indexing
- video annotation
- story segmentation
- multimedia data
- long video
- audio signals
- video streams
- closed captions
- video material
- visual information
- lecture videos
- audio visual content
- soccer video
- video signals
- media streams
- video recordings
- mouth region
- space time
- video retrieval
- video frames
- video indexing and retrieval
- surveillance videos
- content based video retrieval
- image sequences
- high definition
- audio signal
- audio features
- audio visual