Random forest implementation and optimization for Big Data analytics on LexisNexis's high performance computing cluster platform.
Victor M. HerreraTaghi M. KhoshgoftaarFlavio VillanustreBorko FurhtPublished in: J. Big Data (2019)
Keyphrases
- random forest
- high performance computing
- scientific computing
- decision trees
- big data analytics
- computational science
- computing systems
- parallel computing
- feature set
- massively parallel
- multi label
- fault tolerance
- ensemble methods
- grid computing
- database
- computing environments
- software engineering
- energy efficiency
- computing resources
- big data
- training set
- feature extraction
- information systems