Text Mining in the SOMLib Digital Library System: The Representation of Topics and Genres.
Andreas RauberDieter MerklPublished in: Appl. Intell. (2003)
Keyphrases
- text mining
- topic models
- latent dirichlet allocation
- information retrieval
- text documents
- topic modeling
- information extraction
- digital libraries
- biomedical literature
- natural language processing
- knowledge discovery
- information retrieval systems
- text corpora
- image representation
- web mining
- text data
- data mining
- word counts
- feature representation
- text classification
- knn
- active learning
- multiscale