On the use of MapReduce for imbalanced big data using Random Forest.
Sara del RíoVictoria LópezJosé Manuel BenítezFrancisco HerreraPublished in: Inf. Sci. (2014)
Keyphrases
- random forest
- big data
- cloud computing
- learning from imbalanced data
- imbalanced data
- data analytics
- data intensive computing
- random forests
- decision trees
- data management
- feature set
- ensemble methods
- big data analytics
- imbalanced datasets
- unstructured data
- multi label
- business intelligence
- data processing
- data analysis
- ensemble learning
- parallel processing
- base classifiers
- decision making
- databases
- ensemble classifier
- knowledge discovery
- knowledge management
- data integration