SIM-PIPE DryRunner: An approach for testing container-based big data pipelines and generating simulation data.
Aleena ThomasNikolay NikolovAntoine PultierDumitru RomanBrian ElvesæterAhmet SoyluPublished in: COMPSAC (2022)
Keyphrases
- big data
- simulation data
- numerical simulations
- data management
- cloud computing
- data analysis
- data intensive
- data processing
- high volume
- unstructured data
- data sets
- social media
- knowledge discovery
- big data analytics
- vast amounts of data
- business intelligence
- massive data
- data warehousing
- data science
- health informatics
- database systems
- huge data
- gene expression data
- data analytics
- predictive modeling
- data warehouse
- case based reasoning
- query processing
- metadata