Multilingual document clustering : state of the art (Construction de corpus multilingues : état de l'art) [in French].
Manuela YapomoPublished in: RÉCITAL (2013)
Keyphrases
- document clustering
- data mining and information retrieval
- cross lingual
- document corpus
- parallel corpus
- text mining
- document collections
- clustering algorithm
- clustering method
- document representation
- similar documents
- negative matrix factorization
- document clusters
- vector space model
- text documents
- tf idf
- k means
- cross language
- cluster analysis
- topic detection
- information retrieval systems
- co occurrence
- tolerance rough set
- digital libraries