Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment.
Shengqiong WuHao FeiWei JiTat-Seng ChuaPublished in: ACL (1) (2023)
Keyphrases
- cross lingual
- cross modal
- machine translation
- language modeling
- text classification
- image classification
- image content
- image retrieval
- image features
- multi modal
- visual data
- image representation
- image collections
- language model
- document clustering
- low level
- news articles
- transfer learning
- machine learning
- visual similarity
- web images
- machine learning algorithms
- visual words
- query expansion
- active learning
- similarity measure
- information retrieval