A cost-based storage format selector for materialized results in big data frameworks.
Rana Faisal MunirAlberto AbellóOscar RomeroMaik ThieleWolfgang LehnerPublished in: Distributed Parallel Databases (2020)
Keyphrases
- big data
- cloud computing
- big data analytics
- data management
- data analysis
- social media
- data intensive
- data processing
- high volume
- data warehousing
- business intelligence
- data warehouse
- storage cost
- unstructured data
- massive data
- query plan
- predictive modeling
- materialized views
- vast amounts of data
- knowledge discovery
- massive datasets
- health informatics
- data science
- data cube
- database
- semi structured
- open source
- case study
- metadata