A Survey on Spark Ecosystem: Big Data Processing Infrastructure, Machine Learning, and Applications (Extended abstract).
Shanjiang TangBingsheng HeCe YuYusen LiKun LiPublished in: ICDE (2023)
Keyphrases
- extended abstract
- big data
- machine learning
- data processing
- high volume
- data science
- data analysis
- data intensive computing
- cloud computing
- data management
- data intensive
- knowledge discovery
- social media
- information processing
- big data analytics
- predictive modeling
- real time
- natural language processing
- massive data
- vast amounts of data
- business intelligence
- information extraction
- social computing
- data mining
- statistical and machine learning
- commodity hardware
- massive datasets
- e learning
- real world