Multilingual Open Text Release 1: Public Domain News in 44 Languages.
Chester Palen-MichelJune KimConstantine LignosPublished in: LREC (2022)
Keyphrases
- multi lingual
- language specific
- language independent
- cross lingual
- multilingual documents
- web news
- news articles
- text generation
- comparable corpora
- keywords
- news stories
- financial news
- indian languages
- english text
- text documents
- multilingual information retrieval
- machine translation system
- machine translation
- language resources
- cross media
- n gram
- expressive power
- arabic language
- text summarization
- information access
- language identification
- short texts
- news video
- text retrieval
- text mining
- manually constructed
- cross language
- information retrieval
- online news
- cross lingual information retrieval
- native language
- news sources
- natural language
- news items
- co occurrence
- short text
- statistical machine translation
- social media
- target language
- text collections
- cross language information retrieval