Exploring topics in the field of data science by analyzing wikipedia documents: A preliminary result.
Yanyan WangSoohyung JooKun LuPublished in: ASIST (2014)
Keyphrases
- wikipedia pages
- document collections
- wikipedia articles
- information retrieval
- data science
- keywords
- text documents
- probabilistic topic models
- scientific literature
- topic modeling
- document set
- big data
- document representation
- latent dirichlet allocation
- database
- information theoretic
- relevant documents
- wordnet
- information retrieval systems
- topic detection
- document corpus
- natural language text
- text classification
- relevance assessments
- metadata
- search engine
- databases