Partially Materializable Delta Trees for Efficient Data Wrangling of Semi-Structured Contents.
Nico SchäferSebastian MichelPublished in: EDBT (2020)
Keyphrases
- semi structured
- data sets
- database
- data extraction
- data sources
- semistructured data
- information extraction
- raw data
- artificial intelligence
- decision trees
- web documents
- data collection
- hierarchical data
- data collections
- data model
- machine learning
- data mining techniques
- text mining
- free text
- xml databases
- knowledge discovery
- semi structured data
- data analysis
- html pages
- structured knowledge