Visual-guided scene-aware audio generation method based on hierarchical feature codec and rendering decision.
Ruiqi WangHaonan ChengLong YeQin ZhangPublished in: Displays (2024)
Keyphrases
- generation method
- visual data
- visual information
- d scene
- semantic context
- ambient occlusion
- observed scene
- cross modal
- visual features
- rendered images
- image based rendering
- decision making
- visual scene
- audio visual
- single image
- video sequences
- cepstral features
- image sequences
- scene categorization
- image generation
- low level
- computer graphics
- image features
- spatial relations
- image based modeling
- global illumination
- visual input
- image rendering
- photorealistic rendering
- feature vectors
- motion features
- high quality
- feature set
- three dimensional
- video coding
- multimedia
- moving objects
- photorealistic
- complex scenes
- view dependent
- computer vision
- video data
- depth map
- visual effects
- inter frame
- real world objects
- texture mapping
- scene understanding
- multiple images