Apache Spark: a unified engine for big data processing.
Matei ZahariaReynold S. XinPatrick WendellTathagata DasMichael ArmbrustAnkur DaveXiangrui MengJosh RosenShivaram VenkataramanMichael J. FranklinAli GhodsiJoseph GonzalezScott ShenkerIon StoicaPublished in: Commun. ACM (2016)
Keyphrases
- big data
- high volume
- data processing
- data intensive computing
- cloud computing
- data management
- data analysis
- social media
- open source
- data visualization
- data intensive
- unstructured data
- information processing
- data science
- big data analytics
- real time
- vast amounts of data
- massive data
- data warehousing
- business intelligence
- knowledge discovery
- open source software
- predictive modeling
- huge data
- information extraction
- query processing
- decision making