Automatic Extraction of Document Topics.
Luís F. S. TeixeiraGabriel Pereira LopesRita A. RibeiroPublished in: DoCEIS (2011)
Keyphrases
- automatic extraction
- html documents
- information retrieval
- text documents
- keywords
- latent topics
- topic discovery
- relation extraction
- document set
- document images
- information retrieval systems
- document clustering
- text collections
- topic detection
- latent dirichlet allocation
- document collections
- document classification
- web documents
- retrieval systems
- document corpus
- structured documents
- topic models
- wrapper generation
- document clusters
- term extraction
- text categorization
- keyphrases
- topic hierarchy
- natural language text
- relevant documents
- wikipedia pages
- technical papers
- news articles
- statistical topic models
- related documents
- blog posts
- question answering
- biomedical literature
- document retrieval
- key concepts
- document representation