Metadata Extraction for Low-Quality Semi-structured Spreadsheets.
Arwa AwadRania ElgoharyIbrahim F. MoawadMohamed RoushdyPublished in: AICV (2020)
Keyphrases
- low quality
- semi structured
- metadata extraction
- unstructured text
- structured data
- structured information
- digital libraries
- metadata
- high quality
- information integration
- information extraction
- data model
- data extraction
- semi structured data
- web documents
- unstructured data
- metadata management
- web data
- text mining
- genre classification
- structured knowledge
- content features
- databases
- multimedia
- web sources
- database systems
- xml databases
- linked data
- active learning
- web pages
- data sets