Wikipedia-based hybrid document representation for textual news classification.
Marcos Mouriño-GarcíaRoberto Pérez-RodríguezLuis E. Anido-RifónManuel Vilares FerroPublished in: Soft Comput. (2018)
Keyphrases
- document representation
- document categorization
- bag of words
- document collections
- document clustering
- web documents
- topic detection and tracking
- keywords
- image classification
- vector space model
- machine learning
- feature extraction
- language model
- text classification
- text documents
- data fusion
- feature vectors
- vector space
- semantic relations
- natural language processing
- information extraction
- training data
- multimedia
- supervised learning
- feature space
- news articles
- high level
- computer vision