Prioritizing Literature Search Results Using a Training Set of Classified Documents.
Sérgio MatosJosé Luís OliveiraPublished in: PACBB (2011)
Keyphrases
- training set
- training data
- test set
- document collections
- active learning
- classification accuracy
- information retrieval
- relevant documents
- xml documents
- information retrieval systems
- document classification
- free text
- scientific literature
- web documents
- document clustering
- document retrieval
- text documents
- cross validation
- legal documents
- database
- training examples
- keywords
- svm classifier
- training samples
- metadata
- document content
- data sets
- nearest neighbor
- vector space
- information extraction
- feature space
- active appearance models
- structured documents
- biomedical literature