Regularized Dual-PPMI Co-clustering for Text Data.
Séverine AffeldtLazhar LabiodMohamed NadifPublished in: SIGIR (2021)
Keyphrases
- text data
- text clustering
- text mining
- text classification
- high dimensional
- structured data
- topic hierarchies
- document collections
- text documents
- high dimensional data
- data sets
- clustering algorithm
- information extraction
- web pages
- multimedia
- clustering method
- digital libraries
- document clustering
- dimensionality reduction
- semi supervised
- text categorization
- expectation maximization
- metadata
- topic models
- natural language processing
- nearest neighbor
- data mining
- real world
- pattern recognition