MTEB: Massive Text Embedding Benchmark.
Niklas MuennighoffNouamane TaziLoïc MagneNils ReimersPublished in: CoRR (2022)
Keyphrases
- data analysis
- free text
- database
- information retrieval
- natural language generation
- text information
- document analysis
- text analysis
- text collections
- text documents
- automatically extracted
- nonlinear dimensionality reduction
- watermarking algorithm
- textual data
- key concepts
- text data
- text retrieval
- semantic information
- structured data
- web documents
- co occurrence
- real world
- data sets