Performance Comparison of Apache Spark and Hadoop for Machine Learning based iterative GBTR on HIGGS and Covid-19 Datasets.
Piyush SewalHari SinghPublished in: Scalable Comput. Pract. Exp. (2024)
Keyphrases
- machine learning
- open source
- map reduce
- pattern recognition
- information extraction
- benchmark datasets
- machine learning algorithms
- decision trees
- open source software
- cloud computing
- text mining
- learning tasks
- machine learning methods
- statistical analysis
- data management
- source code
- web server
- active learning
- feature selection
- data mining
- data sets
- iterative methods
- knowledge acquisition
- computational intelligence
- natural language processing
- computer science
- neural network