Selecting Text Features for Gene Name Classification: from Documents to Terms.
Goran NenadicSimon B. RiceIrena SpasicSophia AnaniadouBenjamin J. StapleyPublished in: BioNLP@ACL (2003)
Keyphrases
- textual features
- classification accuracy
- automatic categorization
- feature vectors
- feature set
- feature extraction
- classification method
- feature space
- information retrieval
- linguistic information
- text documents
- semantic information
- feature values
- text collections
- automatic text classification
- class labels
- text classification
- document categorization
- document classification
- text categorization
- text retrieval
- semantically related
- free text
- digital documents
- document collections
- web documents
- related documents
- bag of words
- information retrieval systems
- query terms
- document set
- text classifiers
- text clustering
- document content
- concept space
- machine learning
- multiword
- document analysis
- image classification
- document representation
- high dimensionality
- svm classifier
- textual content
- text lines
- text representation
- text data
- xml documents
- stop words
- writer identification
- handwritten text
- document corpus
- keywords