Named entities as privileged information for hierarchical text clustering.
Roberta Akemi SinoaraCamila Vaccari SundermannRicardo M. MarcaciniMarcos Aurélio DominguesSolange O. RezendePublished in: IDEAS (2014)
Keyphrases
- named entities
- text clustering
- text mining
- text documents
- hierarchical clustering
- named entity recognition
- information extraction
- privileged information
- document clustering
- natural language processing
- text data
- co occurrence
- clustering algorithm
- hierarchical structure
- question answering
- text collections
- unsupervised learning
- document representation
- data mining
- machine learning
- text classification
- text categorization
- information retrieval
- k means
- background knowledge
- data analysis
- semantic relations
- document collections
- wordnet
- topic models
- text summarization
- model selection
- knowledge discovery
- generative model
- similarity measure
- image segmentation
- search engine
- data sets