A novel approach for big data classification based on hybrid parallel dimensionality reduction using spark cluster.
Ahmed Hussein AliMahmood Zaki AbdullahPublished in: Comput. Sci. (2019)
Keyphrases
- big data
- dimensionality reduction
- feature extraction
- cloud computing
- pattern recognition
- high dimensionality
- data analysis
- data management
- machine learning
- business intelligence
- feature space
- big data analytics
- high dimensional data
- predictive modeling
- knowledge discovery
- clustering algorithm
- unstructured data
- feature selection
- principal component analysis
- low dimensional
- data points
- high dimensional
- information technology
- parallel implementation
- data science