Caption Alignment for Low Resource Audio-Visual Data.
Vighnesh Reddy KondaMayur WarialaniRakesh Prasanth AchariVarad BhatnagarJayaprakash AkulaPreethi JyothiGanesh RamakrishnanGholamreza HaffariPankaj SinghPublished in: INTERSPEECH (2020)
Keyphrases
- visual data
- visual features
- visual information
- audio visual
- contextual information
- multimedia data
- multimodal information
- high dimensional data
- video data
- visual content
- image classification
- image data
- image retrieval
- high dimensional
- video sequences
- image sequences
- low level
- data sets
- image content
- human motion
- video retrieval
- semantic information
- data analysis
- human actions
- pattern recognition
- keywords