Constant-Delay Enumeration for Nondeterministic Document Spanners.
Antoine AmarilliPierre BourhisStefan MengelMatthias NiewerthPublished in: CoRR (2018)
Keyphrases
- information retrieval
- information retrieval systems
- document collections
- relevant documents
- structured documents
- web documents
- document clustering
- document retrieval
- document images
- document analysis
- document structure
- document representation
- document classification
- retrieval systems
- search space
- digital libraries
- electronic documents
- database
- data sets
- document processing
- document content
- initial state
- finite state
- text documents
- learning algorithm
- machine learning