Prioritized Named Entity Driven LDA for Document Clustering.
Durgesh KumarSanasam Ranbir SinghPublished in: PReMI (2) (2019)
Keyphrases
- document clustering
- named entities
- text mining
- text documents
- latent dirichlet allocation
- topic models
- topic modeling
- named entity recognition
- co occurrence
- information extraction
- natural language processing
- question answering
- text analysis
- tf idf
- dimensionality reduction
- information retrieval
- text classification
- news articles
- face recognition
- clustering method
- data analysis
- generative model
- knowledge discovery
- feature extraction
- principal component analysis
- document collections
- semantic features
- data mining
- unsupervised learning
- databases
- machine learning
- statistical topic models
- model selection
- k means
- pattern recognition
- clustering algorithm
- artificial intelligence