Investigation of Latent Semantic Analysis for Clustering of Czech News Articles.
Michal RottPetr CervaPublished in: DEXA Workshops (2014)
Keyphrases
- news articles
- latent semantic analysis
- document clustering
- text documents
- co occurrence
- tf idf
- text mining
- clustering algorithm
- k means
- clustering method
- information retrieval
- singular value decomposition
- topic modeling
- text classification
- latent dirichlet allocation
- text categorization
- document collections
- latent semantic indexing
- document representation
- visual words
- text summarization
- vector space model
- information extraction
- keywords
- machine learning
- wordnet