Two Multilingual Corpora Extracted from the Tenders Electronic Daily for Machine Learning and Machine Translation Applications.
Oussama AhmiaNicolas BéchetPierre-François MarteauPublished in: LREC (2018)
Keyphrases
- machine translation
- chinese english
- natural language processing
- machine learning
- parallel corpus
- cross lingual
- language independent
- cross language information retrieval
- language resources
- comparable corpora
- information extraction
- statistical machine translation
- language specific
- multilingual documents
- cross lingual information retrieval
- parallel corpora
- machine translation system
- natural language
- language processing
- wordnet
- linguistic resources
- cross language
- word alignment
- machine readable dictionaries
- target language
- artificial intelligence
- word sense disambiguation
- question answering
- natural language generation
- lexical knowledge
- query translation
- text classification
- text mining
- knowledge representation
- bilingual dictionaries
- translation model
- named entities
- finite state transducers
- bilingual lexicon