Tuned and GPU-accelerated parallel data mining from comparable corpora.
Krzysztof WolkKrzysztof MarasekPublished in: CoRR (2015)
Keyphrases
- gpu accelerated
- parallel data mining
- comparable corpora
- cross language information retrieval
- parallel corpora
- data mining
- news articles
- language modeling
- bilingual lexicon
- text corpora
- finite element
- machine translation
- real time
- word pairs
- text documents
- language model
- bilingual dictionaries
- information retrieval
- linguistic resources
- query translation
- bi directional
- cross language
- text mining
- labor intensive
- retrieval model
- query expansion