Combining NLP and probabilistic categorisation for document and term selection for Swiss-Prot medical annotation.
Pavel B. DobrokhotovCyril GoutteAnne-Lise VeutheyÉric GaussierPublished in: ISMB (Supplement of Bioinformatics) (2003)
Keyphrases
- term selection
- text categorization
- ad hoc retrieval
- query expansion
- relevant documents
- natural language processing
- information retrieval
- pseudo relevance feedback
- expansion terms
- document frequency
- probabilistic model
- information extraction
- relevance feedback
- generative model
- metadata
- document collections
- text classification
- natural language
- question answering
- document retrieval
- retrieval systems
- web documents
- image retrieval
- text mining
- document representation
- user queries
- language model
- retrieved documents
- relevance model
- active learning
- selection mechanism
- search engine
- feature selection
- ir models
- document clustering
- text documents
- test collection
- data fusion