Golchin: A Distributed News Classifier for Persian News Archive using Enhanced Topic-based Vector Space Model.
Sayed Nasir KhalifehsoltaniAli VahdaniReza MoallemiPublished in: PDPTA (2009)
Keyphrases
- vector space model
- topic detection and tracking
- news articles
- news stories
- topic detection
- information retrieval
- document clustering
- feature selection
- document representation
- learning algorithm
- vector space
- text classification
- language model
- feature space
- training data
- web documents
- semantic information
- retrieval model
- semantic similarity
- keywords
- document collections
- bag of words
- low level
- multimedia
- index terms
- metadata
- knowledge base