Topic Discovery in Massive Text Corpora Based on Min-Hashing.
Gibran Fuentes PinedaIván Vladimir Meza RuízPublished in: CoRR (2018)
Keyphrases
- text corpora
- topic discovery
- text analysis
- topic models
- text mining
- latent dirichlet allocation
- topic modeling
- text classification
- text documents
- data analysis
- data structure
- information extraction
- natural language processing
- text classifiers
- text collections
- latent variables
- computational linguistics
- probabilistic model
- information retrieval
- document collections
- language model
- image segmentation