Automated Audio Captioning using Audio Event Clues.
Aysegül Özkaya ErenMustafa SertPublished in: CoRR (2022)
Keyphrases
- multimedia
- signal processing
- audio video
- visual data
- soccer video
- image processing
- visual information
- audio visual
- audio recordings
- audio signals
- music scores
- audio stream
- music score
- text graphics
- cross modal
- audio signal
- text to speech
- multimedia information
- emotion recognition
- digital audio
- semi automated
- music genre classification
- event detection
- hidden markov models
- image sequences
- computer vision