Columnar Formats for Schemaless LSM-based Document Stores.
Wail Y. AlkowaileetMichael J. CareyPublished in: Proc. VLDB Endow. (2022)
Keyphrases
- pdf documents
- xml format
- electronic documents
- web documents
- information retrieval
- document images
- document clustering
- document collections
- document retrieval
- retrieval systems
- structured documents
- document classification
- text documents
- keywords
- metadata
- neural network
- text categorization
- information retrieval systems
- machine learning
- digital documents
- scientific documents
- real time