SeNMFk-SPLIT: large corpora topic modeling by semantic non-negative matrix factorization with automatic model selection.
Maksim Ekin ErenNick SolovyevManish BhattaraiKim Ø. RasmussenCharles NicholasBoian S. AlexandrovPublished in: DocEng (2022)
Keyphrases
- negative matrix factorization
- topic modeling
- automatic model selection
- topic models
- matrix factorization
- probabilistic latent semantic analysis
- factor analysis
- collaborative filtering
- document clustering
- latent dirichlet allocation
- natural language processing
- principal component analysis
- text mining
- model selection
- natural language
- sparse representation
- mixture model
- spectral clustering
- text documents
- information extraction
- information retrieval
- prior knowledge
- machine learning
- generative model