Scalable Data Analytics Using R: Single Machines to Hadoop Spark Clusters.
John-Mark AgostaDebraj GuhaThakurtaRobert HortonMario InchiosaSrini KumarMengyue ZhaoPublished in: KDD (2016)
Keyphrases
- commodity hardware
- data analytics
- map reduce
- cloud computing
- parallel processing
- parallel computing
- big data
- open source
- data analysis
- keyword search
- clustering algorithm
- shared memory
- internet search
- data mining techniques
- traditional chinese medicine
- data mining
- social media
- database
- community detection
- real world