Document Spanners: A Formal Approach to Information Extraction.
Ronald FaginBenny KimelfeldFrederick ReissStijn VansummerenPublished in: J. ACM (2015)
Keyphrases
- information extraction
- web documents
- text documents
- information retrieval
- unstructured documents
- text mining
- natural language processing
- machine learning
- document classification
- document images
- document processing
- question answering
- document clustering
- precision and recall
- cross document
- information retrieval systems
- text summarization
- named entities
- free text
- natural language
- document representation
- ontology based information extraction
- semantic information
- semi structured
- web mining
- user queries
- keywords
- relational learning
- natural language text
- document collections
- text classification
- co occurrence
- formal model
- named entity recognition
- relation extraction
- formal methods
- probabilistic model