Estimating term domain relevance through term frequency, disjoint corpora frequency - tf-dcf.
Lucelene LopesPaulo FernandesRenata VieiraPublished in: Knowl. Based Syst. (2016)
Keyphrases
- document frequency
- term frequency
- inverse document frequency
- tf idf
- term weighting
- text categorization
- retrieved documents
- term weights
- word frequency
- feature selection
- retrieval model
- text classification
- information gain
- okapi bm
- information retrieval
- average precision
- bag of words
- document representation
- query terms
- n gram
- text documents
- test collection
- relevant documents
- document retrieval
- retrieval effectiveness
- vector space model
- probabilistic retrieval model
- co occurrence
- language modeling
- retrieval systems
- information retrieval systems
- pairwise
- pseudo relevance feedback
- query expansion
- natural language processing
- machine learning
- data mining