Topic modeling for large-scale text data.
Ximing LiJihong OuyangYou LuPublished in: Frontiers Inf. Technol. Electron. Eng. (2015)
Keyphrases
- text data
- topic modeling
- text mining
- text classification
- text documents
- topic models
- latent dirichlet allocation
- bag of words
- text categorization
- structured data
- information extraction
- knowledge discovery
- labeled data
- keywords
- document clustering
- natural language processing
- information retrieval
- data mining
- high dimensional
- n gram
- data analysis
- feature selection
- machine learning
- latent topics
- real world
- unsupervised learning
- web pages
- data sets
- k nearest neighbor
- named entities
- knowledge representation
- multiscale
- face recognition
- search engine
- probabilistic topic models