Parallel Training GBRT Based on KMeans Histogram Approximation for Big Data.
Rong GuLei JinYongwei WuJingying QuTao WangXiaojun WangChunfeng YuanYihua HuangPublished in: ICA3PP (2) (2015)
Keyphrases
- big data
- cloud computing
- data analysis
- data intensive
- data management
- social media
- k means
- high volume
- data processing
- unstructured data
- big data analytics
- vast amounts of data
- massive data
- business intelligence
- supervised learning
- parallel processing
- data science
- commodity hardware
- data warehousing
- training set
- real world
- knowledge discovery
- shared memory
- massive datasets
- data intensive computing
- social computing
- knowledge management
- management system
- social networks
- machine learning