Document Clustering and Cluster Topic Extraction in Multilingual Corpora.
Joaquim Ferreira da SilvaJoão MexiaCarlos Agra CoelhoJosé Gabriel Pereira LopesPublished in: ICDM (2001)
Keyphrases
- topic extraction
- document clustering
- document corpus
- clustering algorithm
- document clusters
- parallel corpus
- cluster analysis
- text mining
- document collections
- natural language processing
- text analysis
- k means
- text documents
- clustering method
- digital libraries
- document classification
- topic modeling
- blog entries
- information retrieval
- latent dirichlet allocation
- data mining
- topic models
- search engine