Building topic models in a federated digital library through selective document exclusion.
Miles EfronPeter OrganisciakKatrina FenlonPublished in: ASIST (2011)
Keyphrases
- topic models
- digital libraries
- latent topics
- text documents
- topic discovery
- latent dirichlet allocation
- topic modeling
- document collections
- statistical topic models
- text mining
- latent variables
- generative model
- multimedia
- lda model
- text corpora
- relevance model
- probabilistic model
- latent topic models
- gibbs sampling
- co occurrence
- information retrieval
- language modeling framework
- document retrieval
- document clustering
- document classification
- news articles
- probabilistic topic models
- generative process
- hierarchical bayesian model
- web documents
- latent semantic analysis
- keywords
- tf idf