Constant-delay enumeration algorithms for document spanners over nested documents.
Martin MuñozCristian RiverosPublished in: CoRR (2020)
Keyphrases
- document collections
- information retrieval
- web documents
- document clustering
- document classification
- keywords
- information retrieval systems
- text documents
- document representation
- learning algorithm
- retrieval systems
- document analysis
- document type
- document content
- document processing
- vector space model
- text classifiers
- structured documents
- graph theory
- information extraction