Materialization and Reuse Optimizations for Production Data Science Pipelines.
Behrouz DerakhshanAlireza Rezaei MahdirajiZoi KaoudiTilmann RablVolker MarklPublished in: SIGMOD Conference (2022)
Keyphrases
- data science
- big data
- statistical learning
- data warehousing
- data analysis
- cloud computing
- machine learning
- data management
- social media
- distributed databases
- software reuse
- data warehouse
- data processing
- business intelligence
- production planning
- production system
- materialized views
- knowledge discovery
- information theory
- learning objects
- object oriented
- similarity measure