Mining coherent topics in documents using word embeddings and large-scale text data.
Liang YaoYin ZhangQinfei ChenHongze QianBaogang WeiZhifeng HuPublished in: Eng. Appl. Artif. Intell. (2017)
Keyphrases
- text data
- text mining
- text documents
- text streams
- topic hierarchies
- concept space
- text classification
- high dimensional data
- topic detection
- latent topics
- document collections
- high dimensional
- text clustering
- topic models
- text analytics
- low dimensional
- n gram
- keywords
- knowledge discovery
- structured data
- document representation
- information extraction
- topic modeling
- dimensionality reduction
- textual data
- latent semantic
- data mining
- latent dirichlet allocation
- natural language processing
- text classifiers
- data analysis
- text corpora
- information retrieval
- vector space
- document clustering
- named entities
- bag of words
- nearest neighbor
- image retrieval
- relational databases
- relevant documents
- real world
- web pages
- feature selection
- query expansion
- artificial intelligence
- information retrieval systems