Understanding and optimizing the performance of distributed machine learning applications on apache spark.
Celestine DünnerThomas P. ParnellKubilay AtasuManolis SifalakisHaralampos PozidisPublished in: IEEE BigData (2017)
Keyphrases
- machine learning
- distributed systems
- open source
- cooperative
- data mining
- learning systems
- multi agent
- machine learning methods
- deeper understanding
- distributed data
- pattern recognition
- web server
- artificial intelligence
- learning algorithm
- neural network
- explanation based learning
- machine learning algorithms
- knowledge acquisition
- text classification
- map reduce
- peer to peer
- information extraction
- active learning
- computational intelligence
- text mining
- natural language processing
- support vector machine
- statistical methods
- machine learning approaches
- data analysis
- decision trees
- feature selection
- computer vision