A Span Extraction Approach for Information Extraction on Visually-Rich Documents.
Tuan-Anh D. NguyenHieu M. VuNguyen Hong SonMinh-Tien NguyenPublished in: ICDAR Workshops (2) (2021)
Keyphrases
- information extraction
- free text
- text documents
- web documents
- information retrieval
- unstructured documents
- text analysis
- natural language text
- natural language processing
- text mining
- textual data
- document collections
- unstructured text
- extraction rules
- named entity recognition
- precision and recall
- structured data
- web information extraction
- natural language
- named entities
- document classification
- relational learning
- relation extraction
- information retrieval systems
- machine learning
- document clustering
- information extraction systems
- text summarization
- xml documents
- relevant documents
- semi structured
- web mining
- metadata
- question answering
- high level
- machine translation
- text processing
- document set
- data extraction
- document structure
- web data
- retrieval systems
- textual information
- knowledge discovery
- document analysis
- news articles