Statistical Evaluation of Similarity Measures on Multi-lingual Text Corpora.
Robert NeumannRudolf SchmidtPublished in: TSD (1999)
Keyphrases
- text corpora
- multi lingual
- similarity measure
- text mining
- language independent
- information access
- computational linguistics
- information retrieval
- document collections
- topic models
- text analysis
- text documents
- text collections
- cross lingual
- topic modeling
- text classifiers
- concept hierarchy
- feature vectors
- language identification
- text classification
- natural language processing
- knowledge representation
- named entities
- language modeling
- document retrieval
- information retrieval systems
- supervised learning
- information extraction
- digital libraries
- feature extraction
- artificial intelligence