Topic discovery in massive text corpora based on Min-Hashing.
Gibran Fuentes PinedaIván V. Meza-RuízPublished in: Expert Syst. Appl. (2019)
Keyphrases
- topic discovery
- text corpora
- text analysis
- topic models
- latent dirichlet allocation
- text mining
- topic modeling
- text documents
- text classification
- information extraction
- probabilistic model
- latent variables
- data analysis
- data structure
- co occurrence
- text collections
- query processing
- text classifiers
- artificial intelligence
- information retrieval
- natural language processing
- databases
- document collections
- prior knowledge
- computational linguistics