SAUCE: Truncated Sparse Document Signature Bit-Vectors for Fast Web-Scale Corpus Expansion.
Muntasir WahedDaniel GruhlAlfredo AlbaAnna Lisa GentilePetar RistoskiChad DeLucaSteve WelchIsmini LourentzouPublished in: CoRR (2021)
Keyphrases
- web scale
- bit vectors
- million images
- bit vector
- image search
- semi structured
- information retrieval
- web documents
- information retrieval systems
- document collections
- retrieval systems
- web images
- database
- high dimensional
- keywords
- multi modal
- semantic information
- text classification
- information extraction
- image data
- gray code
- data mining