Audio-Language Datasets of Scenes and Events: A Survey.
Gijs WijngaardElia FormisanoMichele EspositoMichel DumontierPublished in: CoRR (2024)
Keyphrases
- human language
- multimedia
- audio stream
- long video
- programming language
- event detection
- soccer video
- language learning
- video clips
- interesting events
- natural language
- natural scenes
- video scene
- audio signals
- spatio temporal patterns
- computer vision
- visual information
- database
- temporal data
- audio visual
- signal processing
- data mining
- data sets