Word unit based multilingual comparative analysis of text corpora.
Géza NémethCsaba ZainkóPublished in: INTERSPEECH (2001)
Keyphrases
- comparative analysis
- text corpora
- text corpus
- word pairs
- text mining
- computational linguistics
- text analysis
- topic models
- document collections
- digital libraries
- text collections
- co occurrence
- topic modeling
- concept hierarchy
- text classifiers
- text classification
- computer science
- text documents
- probabilistic topic models
- cross language information retrieval
- cross lingual
- n gram
- image classification