Moving beyond word lists: towards abstractive topic labels for human-like topics of scientific documents.
Domenic RosatiPublished in: CoRR (2022)
Keyphrases
- scientific documents
- related documents
- latent topics
- statistical topic models
- topic models
- latent dirichlet allocation
- related topics
- topic drift
- topic modeling
- keywords
- information retrieval
- information retrieval systems
- semantic similarity
- news articles
- document collections
- pdf documents
- document set
- document clustering
- co occurrence
- relevant documents
- semantically related
- digital libraries
- text documents
- search engine
- n gram
- probabilistic model
- website
- image classification
- text corpora
- text mining
- document representation
- scientific literature
- user queries
- writing style
- language model
- databases