Investigations in Audio Captioning: Addressing Vocabulary Imbalance and Evaluating Suitability of Language-Centric Performance Metrics.
Sandeep KothintiDimitra EmmanouilidouPublished in: CoRR (2022)
Keyphrases
- human language
- natural language
- programming language
- multimedia
- language processing
- language learning
- machine learning
- audio visual
- signal processing
- evaluation metrics
- similarity metrics
- specification language
- cost sensitive
- audio stream
- language acquisition
- linguistic knowledge
- class distribution
- visual information
- information extraction
- feature space
- learning algorithm