Unsupervised Construction of Quasi-comparable Corpora and Probing for Parallel Textual Data.
Krzysztof WolkKrzysztof MarasekPublished in: MISSI (2016)
Keyphrases
- textual data
- terminology extraction
- comparable corpora
- information extraction
- structured data
- text documents
- text mining
- natural language processing
- cross language information retrieval
- textual information
- raw data
- parallel corpora
- unsupervised learning
- text collections
- news articles
- text categorization
- semi supervised
- supervised learning
- formal concept analysis
- data mining
- text corpora
- word pairs
- digital libraries