YFACC: A Yorùbá Speech-Image Dataset for Cross-Lingual Keyword Localisation Through Visual Grounding.
Kayode OlaleyeDan OneataHerman KamperPublished in: SLT (2022)
Keyphrases
- cross lingual
- image dataset
- multi lingual
- machine translation
- language modeling
- language independent
- image database
- cross lingual information retrieval
- visual features
- image collections
- automatic classification
- image annotation
- keywords
- visual information
- text classification
- low level
- transfer learning
- language model
- image retrieval
- high level
- decision trees
- knowledge discovery
- news articles
- feature extraction