On Efficiently Processing Workflow Provenance Queries in Spark.
Rajmohan CPranay LohiaHimanshu GuptaSiddhartha BrahmaMauricio A. HernándezSameep MehtaPublished in: ICDCS (2019)
Keyphrases
- provenance information
- efficient processing
- scientific workflows
- processing queries
- query processing
- query evaluation
- metadata
- range queries
- real time
- database
- query language
- pre computed
- data processing
- join queries
- workflow systems
- efficient execution
- retrieval systems
- database queries
- data flow
- complex queries
- query logs
- response time
- aggregation queries
- query formulation
- query patterns
- user queries
- fine grained
- data sources
- web services
- databases