M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation.
Benjamin HsuXiaoyu LiuHuayang LiYoshinari FujinumaMaria NadejdeXing NiuYair KittenplonRon LitmanRaghavendra Reddy PappagariPublished in: CoRR (2024)
Keyphrases
- multi modal
- benchmark datasets
- lexical cohesion
- machine translation
- document level
- evaluation metrics
- chinese english
- text summarization
- language model
- sentiment classification
- natural language processing
- sentence level
- cross lingual
- pseudo relevance feedback
- query expansion
- word sense disambiguation
- high dimensional
- audio visual
- coreference resolution
- word level
- cross language information retrieval
- image annotation
- information extraction
- language modeling
- semantic similarity
- statistical machine translation
- machine translation system
- knn