Distributed ReliefF-based feature selection in Spark.
Raul-Jose Palma-MendozaDaniel RodríguezLuis de MarcosPublished in: Knowl. Inf. Syst. (2018)
Keyphrases
- feature selection
- information gain
- feature subset
- cooperative
- text categorization
- multi agent
- support vector
- distributed systems
- machine learning
- distributed environment
- mutual information
- peer to peer
- model selection
- lightweight
- communication overhead
- distributed data
- feature selection algorithms
- communication cost
- real time
- unsupervised feature selection
- fault tolerant
- microarray data
- unsupervised learning
- feature set
- multi class