High-Performance Distributed Machine Learning using Apache SPARK.
Celestine DünnerThomas P. ParnellKubilay AtasuManolis SifalakisHaralampos PozidisPublished in: CoRR (2016)
Keyphrases
- machine learning
- open source
- distributed systems
- data intensive
- decision trees
- open source software
- distributed data
- lightweight
- information extraction
- peer to peer
- text mining
- learning problems
- natural language processing
- supervised learning
- map reduce
- learning algorithm
- semi supervised learning
- learning systems
- mobile agents
- text classification
- fault tolerant
- distributed environment
- distributed computing
- machine learning methods
- artificial intelligence
- data mining
- pattern recognition
- reinforcement learning
- database systems
- website
- computer vision