Using LDA to detect semantically incoherent documents.
Hemant MisraOlivier CappéFrançois YvonPublished in: CoNLL (2008)
Keyphrases
- topic modeling
- latent dirichlet allocation
- latent dirichlet
- latent semantic analysis
- topic discovery
- linear discriminant analysis
- latent topics
- topic models
- semantic information
- document retrieval
- lda model
- web documents
- document collections
- text documents
- discriminant analysis
- detection method
- probabilistic topic models
- information retrieval systems
- face recognition
- generative model
- relevant documents
- electronic documents
- semantic content
- keywords
- document classification
- information retrieval
- document analysis
- co occurrence
- detection algorithm
- xml documents
- ranked list
- document representation
- natural language
- document clustering
- feature extraction
- vector space
- machine learning
- em algorithm
- vector space model
- dimension reduction
- metadata