Capturing and querying fine-grained provenance of preprocessing pipelines in data science.
Adriane ChapmanPaolo MissierGiulia SimonelliRiccardo TorlonePublished in: Proc. VLDB Endow. (2020)
Keyphrases
- fine grained
- data science
- big data
- preprocessing
- coarse grained
- machine learning
- statistical learning
- data provenance
- data analysis
- social media
- cloud computing
- databases
- access control
- data processing
- data management
- query language
- tightly coupled
- feature extraction
- business intelligence
- knowledge discovery
- provenance information
- data mining
- information processing
- object oriented
- social networks