Distributed data augmented support vector machine on Spark.
Tu Dinh NguyenVu NguyenTrung LeDinh Q. PhungPublished in: ICPR (2016)
Keyphrases
- distributed data
- support vector machine
- data sharing
- distributed data mining
- data distribution
- file system
- databases
- integrating heterogeneous
- support vector
- svm classifier
- data mining algorithms
- decision boundary
- training set
- communication cost
- training data
- distributed data sources
- machine learning
- feature vectors
- feature space
- feature selection
- rare events
- knn
- object oriented
- decision making