Scalable Provenance Storage and Querying Using Pig Latin for Big Data Workflows.
Fahima Amin BhuyanShiyong LuDong RuanJia ZhangPublished in: SCC (2017)
Keyphrases
- big data
- provenance information
- data intensive
- scientific workflows
- data processing
- data intensive computing
- databases
- data management
- workflow systems
- big data analytics
- cloud computing
- metadata
- social media
- database systems
- storage and retrieval
- commodity hardware
- scientific data
- unstructured data
- vast amounts of data
- data analysis
- high volume
- data science
- knowledge discovery
- data grid
- massive data
- query language
- workflow management systems
- file system
- business intelligence
- web services
- machine learning
- database
- query processing
- information technology
- data warehousing
- massive datasets
- business processes
- business analytics
- decision support system
- data driven decision making