A Classifier to Evaluate Language Specificity of Medical Documents.
Trudi MillerGondy LeroySamir ChatterjeeJie FanBrian ThomsPublished in: HICSS (2007)
Keyphrases
- text classifiers
- multilingual documents
- training documents
- document collections
- patient records
- document classification
- web documents
- information retrieval systems
- training data
- information retrieval
- programming language
- decision trees
- language learning
- text documents
- classification method
- logical structure
- classification algorithm
- svm classifier
- natural language
- training set
- support vector machine
- xml documents
- medical records
- medical diagnosis
- document clustering
- parallel corpus
- relevant documents
- indian languages
- classify documents
- text classification
- feature selection
- keywords
- support vector
- feature space
- metadata
- extensible markup language
- medical data
- xml data
- document retrieval
- data model
- retrieval systems
- co occurrence
- feature set
- text categorization