A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques.
Xuetong LiYuan GaoHong ChangDanyang HuangYingying MaRui PanHaobo QiFeifei WangShuyuan WuKe XuJing ZhouXuening ZhuYingqiu ZhuHansheng WangPublished in: CoRR (2024)
Keyphrases
- statistical methods
- distributed computing
- massive data
- statistical analysis
- distributed environment
- data mining applications
- grid computing
- fault tolerance
- cloud computing
- distributed systems
- mobile agents
- big data
- machine learning
- peer to peer
- data mining techniques
- machine learning methods
- virtual machine
- distributed computing systems
- statistical models
- statistical approaches
- distributed computing environment
- mobile communications
- data sets
- data mining methods
- maximum likelihood
- data management
- social media
- data mining
- databases