TTC TermSuite - A UIMA Application for Multilingual Terminology Extraction from Comparable Corpora.
Jérôme RocheteauBéatrice DaillePublished in: IJCNLP (2011)
Keyphrases
- terminology extraction
- comparable corpora
- cross language information retrieval
- language modeling
- document clustering
- cross language
- news articles
- textual data
- parallel corpora
- machine translation
- query terms
- information retrieval
- text categorization
- wikipedia articles
- word pairs
- knowledge discovery
- digital libraries