Effect of Document Representation on the Performance of Medical Document Classification.
Fathi H. SaadBeatriz de la IglesiaDuncan G. BellPublished in: DMIN (2006)
Keyphrases
- document classification
- document representation
- text documents
- web documents
- document categorization
- text classification
- bag of words
- text mining
- text categorization
- document clustering
- text data
- document collections
- keywords
- vector space model
- information extraction
- topic models
- vector space
- data fusion
- classification algorithm
- web pages
- data sets
- semantic information
- natural language processing
- data analysis
- information retrieval
- topic modeling
- language model
- databases
- clustering algorithm
- semantic relations
- similarity measure
- training data
- n gram
- association rules
- image classification
- supervised learning