Text Clustering on Latent Thematic Spaces: Variants, Strengths and Weaknesses.
Xavier SevillanoGermán CoboFrancesc AlíasJoan Claudi SocoróPublished in: ICA (2007)
Keyphrases
- strengths and weaknesses
- text clustering
- document clustering
- advantages and disadvantages
- text mining
- clustering algorithm
- text categorization
- background knowledge
- hierarchical clustering
- text data
- user feedback
- k means
- text documents
- wordnet
- text collections
- latent semantic analysis
- relative strengths and weaknesses
- self organizing maps
- metric learning
- document collections
- document representation
- semantic information
- structured data
- user interaction
- text classification
- information extraction
- high dimensional
- information retrieval
- neural network