Assessing the Dependability of Apache Spark System: Streaming Analytics on Large-Scale Ocean Data.
Janak DahalElias IoupShaikh ArifuzzamanMahdi AbdelguerfiPublished in: DependSys (2019)
Keyphrases
- data sets
- data analysis
- database
- data sources
- original data
- data processing
- data collection
- data structure
- experimental data
- prior knowledge
- high quality
- training data
- probability distribution
- raw data
- spatial data
- sensor data
- complex systems
- high dimensional data
- end users
- image data
- knowledge discovery
- data points
- social media
- xml documents
- data streams
- real world
- real time