BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation.
Liyan KangLuyang HuangNingxin PengPeihao ZhuZewei SunShanbo ChengMingxuan WangDegen HuangJinsong SuPublished in: ACL (Findings) (2023)
Keyphrases
- machine translation
- cross language information retrieval
- natural language processing
- target language
- multimedia
- cross lingual
- video data
- language independent
- machine translation system
- statistical machine translation
- information extraction
- language processing
- language resources
- chinese english
- video content
- word sense disambiguation
- source language
- word level
- finite state transducers
- machine readable dictionaries
- statistical translation models
- parallel corpora
- natural language generation
- natural language
- brazilian portuguese
- mt evaluation
- word alignment
- english chinese
- wordnet
- information retrieval