A Survey on Spark Ecosystem: Big Data Processing Infrastructure, Machine Learning, and Applications.
Shanjiang TangBingsheng HeCe YuYusen LiKun LiPublished in: IEEE Trans. Knowl. Data Eng. (2022)
Keyphrases
- big data
- machine learning
- high volume
- data processing
- data science
- data analysis
- data intensive computing
- data management
- knowledge discovery
- data intensive
- big data analytics
- unstructured data
- cloud computing
- social media
- vast amounts of data
- commodity hardware
- information processing
- business intelligence
- massive data
- predictive modeling
- real time
- information extraction
- artificial intelligence
- data warehousing
- data sets
- statistical learning
- structured data
- text mining
- natural language processing
- information technology
- databases
- database
- data driven decision making