Familia: An Open-Source Toolkit for Industrial Topic Modeling.
Di JiangZeyu ChenRongzhong LianSiqi BaoChen LiPublished in: CoRR (2017)
Keyphrases
- topic modeling
- open source
- topic models
- latent dirichlet allocation
- text classification
- text mining
- topic extraction
- modeling framework
- collaborative filtering
- scientific articles
- case study
- latent topics
- neural network
- pattern recognition
- information retrieval
- text corpora
- document collections
- latent variables
- artificial intelligence
- real world
- databases