SMGKM: An Efficient Incremental Algorithm for Clustering Document Collections.
Adil M. BagirovSattar SeifollahiMassimo PiccardiEhsan Zare BorzeshiBernie KrugerPublished in: CICLing (2) (2018)
Keyphrases
- document collections
- document clustering
- topic detection
- information retrieval systems
- document retrieval
- information retrieval
- test collection
- text retrieval
- clustering algorithm
- document clusters
- k means
- scatter gather
- document representation
- text clustering
- relevant documents
- index terms
- digital libraries
- ad hoc retrieval
- cross language
- clustering method
- text data
- text classification
- data points
- learning process
- databases
- automatic document classification
- text corpora
- xml retrieval
- text collections
- cluster analysis
- text mining
- metadata
- machine learning