The Case For Alternative Web Archival Formats To Expedite The Data-To-Insight Cycle.
Xinyue WangZhiwu XiePublished in: CoRR (2020)
Keyphrases
- data sets
- data analysis
- database
- data structure
- data processing
- increasing rapidly
- data objects
- raw data
- essential information
- data extraction
- data points
- synthetic data
- statistical analysis
- data collection
- high quality
- web sources
- original data
- data mining
- prior knowledge
- website
- semantic web
- data mining techniques
- end users
- xml documents
- data quality
- web data
- web resources
- association rules
- metadata
- huge data
- training data