Automated Audio Captioning and Language-Based Audio Retrieval.
Clive GomesHyejin ParkPatrick KollmanYi SongPublished in: CoRR (2022)
Keyphrases
- multimedia information
- multimedia
- audio visual content
- human language
- cross modal
- digital video
- audio stream
- text to speech
- information retrieval
- audio visual
- semi automated
- lifelog
- visual information
- signal processing
- visual data
- emotion recognition
- content based retrieval
- multimedia data
- multimedia documents
- multimedia information retrieval
- information retrieval systems
- audio signals
- programming language
- image database
- video indexing and retrieval
- audio content
- relevance feedback
- metadata