Provenance for MapReduce-based data-intensive workflows.
Daniel CrawlJianwu WangIlkay AltintasPublished in: WORKS@SC (2011)
Keyphrases
- data intensive
- scientific workflows
- web services
- data management
- workflow systems
- data processing
- scientific data
- big data
- provenance information
- geographically distributed
- data access
- earth science
- globally distributed
- business processes
- workflow management
- service oriented
- metadata
- semantic annotation
- grid computing
- business process
- workflow management systems
- databases
- distributed environment
- computing environments
- data warehouse
- multi agent
- database systems
- data grid
- database