Semantic-Based Multilingual Document Clustering via Tensor Modeling.
Salvatore RomeoAndrea TagarelliDino IencoPublished in: EMNLP (2014)
Keyphrases
- document clustering
- cross lingual
- text mining
- topic extraction
- clustering algorithm
- document representation
- document collections
- clustering method
- text documents
- tolerance rough set
- negative matrix factorization
- document clusters
- ant based clustering
- k means
- digital libraries
- cluster analysis
- text categorization
- cross language information retrieval
- real world