Login / Signup
DocSplit: Simple Contrastive Pretraining for Large Document Embeddings.
Yujie Wang
Mike Izbicki
Published in:
EMNLP (Findings) (2023)
Keyphrases
</>
information retrieval
distance measure
real time
information retrieval systems
document retrieval
learning algorithm
metadata
website
dimensionality reduction
document collections
document images
document classification
document content