Constant-Delay Enumeration for Nondeterministic Document Spanners.
Antoine AmarilliPierre BourhisStefan MengelMatthias NiewerthPublished in: SIGMOD Rec. (2020)
Keyphrases
- information retrieval systems
- retrieval systems
- document images
- structured documents
- search space
- web documents
- finite state
- information retrieval
- document clustering
- document retrieval
- text documents
- document collections
- machine learning
- semantic information
- relevant documents
- text mining
- dynamic programming
- digital libraries
- tf idf
- document representation
- learning algorithm
- multimedia documents
- document analysis
- textual content
- digital documents