Fuzzy named entity-based document clustering.
Tru Hoang CaoH. T. DoDunt T. HongThanh Tho QuanPublished in: FUZZ-IEEE (2008)
Keyphrases
- document clustering
- named entities
- text documents
- text mining
- named entity recognition
- information extraction
- co occurrence
- question answering
- document representation
- natural language processing
- clustering algorithm
- tf idf
- text classification
- document clusters
- semantic features
- data mining
- knowledge discovery
- machine learning
- document collections
- topic models
- cluster analysis
- semi supervised
- news articles
- cross lingual
- information retrieval
- wordnet
- generative model
- maximum likelihood
- keywords
- automatic summarization
- databases