Scalable Feature Subset Selection for Big Data using Parallel Hybrid Evolutionary Algorithm based Wrapper in Apache Spark.
Yelleti VivekVadlamani RaviPisipati RadhakrishnaPublished in: CoRR (2021)
Keyphrases
- big data
- feature subset selection
- feature selection
- map reduce
- cloud computing
- commodity hardware
- hybrid evolutionary algorithm
- evolutionary algorithm
- data intensive computing
- multi objective optimization
- data management
- open source
- data analysis
- parallel processing
- data processing
- big data analytics
- social media
- knowledge discovery
- feature subset
- mutual information
- multi objective
- parallel computing
- parallel computation
- shared memory
- business intelligence
- selection algorithm
- support vector
- data warehousing
- information processing
- social networks
- neural network
- metadata
- object oriented
- knn
- support vector machine
- database
- data warehouse
- feature set