Revisiting Bag of Words Document Representations for Efficient Ranking with Transformers.
David RauMostafa DehghaniJaap KampsPublished in: ACM Trans. Inf. Syst. (2024)
Keyphrases
- bag of words
- document representation
- image classification
- action recognition
- text classification
- image representation
- text documents
- language model
- n gram
- text representation
- web documents
- document collections
- anchor text
- data fusion
- vector space model
- web search
- document clustering
- semantic information
- ranking algorithm
- knowledge discovery
- feature extraction
- vector space
- feature selection
- search engine