Creating a multilingual collocations dictionary from large text corpora.
Luka NerimaVioleta SeretanEric WehrliPublished in: EACL (2003)
Keyphrases
- text corpora
- text corpus
- text mining
- text analysis
- topic models
- topic modeling
- computational linguistics
- document collections
- n gram
- word pairs
- text documents
- digital libraries
- text classifiers
- concept hierarchy
- information retrieval
- text collections
- text classification
- knowledge discovery
- artificial intelligence
- databases
- probabilistic topic models
- latent dirichlet allocation
- word sense disambiguation
- classification accuracy
- metadata
- machine learning