The Scielo Corpus: a Parallel Corpus of Scientific Publications for Biomedicine.
Mariana L. NevesAntonio Jimeno-YepesAurélie NévéolPublished in: LREC (2016)
Keyphrases
- parallel corpus
- scientific publications
- cross lingual
- language independent
- metadata
- cross language information retrieval
- text mining
- machine translation
- query translation
- scientific literature
- machine translation system
- linked open data
- word alignment
- artificial intelligence
- life sciences
- statistical machine translation
- linked data
- document clustering
- target language
- cross language
- latent semantic analysis
- digital libraries
- machine learning
- language modeling
- user queries
- semantic web
- text classification
- language model
- data sources