Subspace selection in high-dimensional big data using genetic algorithm in apache spark.
Fatemeh CheraghchiArash IranzadBijan RaahemiPublished in: ICC (2017)
Keyphrases
- feature space
- big data
- high dimensional
- genetic algorithm
- open source
- low dimensional
- dimensionality reduction
- cloud computing
- high volume
- data analysis
- unstructured data
- data visualization
- data management
- data intensive
- open source software
- social media
- big data analytics
- business intelligence
- massive data
- data processing
- vast amounts of data
- map reduce
- data science
- knowledge discovery
- high dimensional data
- data warehousing
- databases
- open source projects
- relational databases
- case study
- information retrieval
- real world
- data sets
- data intensive computing