A Latent Semantic Indexing-based approach to multilingual document clustering.
Chih-Ping WeiChristopher C. YangChia-Min LinPublished in: Decis. Support Syst. (2008)
Keyphrases
- document clustering
- latent semantic indexing
- document representation
- vector space model
- latent semantic space
- cross lingual
- text mining
- document collections
- text retrieval
- clustering method
- text documents
- negative matrix factorization
- digital libraries
- clustering algorithm
- singular value decomposition
- information retrieval
- vector space
- k means
- document retrieval
- bag of words
- web documents
- information extraction