Network Load Balancing with Parallel Flowlets for AI Training Clusters.
Peirui CaoWenxue ChengShizhen ZhaoYongqiang XiongPublished in: NAIC (2024)
Keyphrases
- load balancing
- load balance
- dynamic load balancing
- peer to peer
- resource utilization
- load balancing strategy
- parallel query processing
- parallel database systems
- distributed systems
- peer to peer systems
- artificial intelligence
- low overhead
- fault tolerance
- load balancing strategies
- data skew
- round robin
- proxy servers
- grid computing
- fault tolerant
- mobile agents
- web caching
- load distribution
- training set
- skewed data
- inter processor communication
- network traffic
- multiprocessor systems
- data grids
- data partitioning
- computing resources
- data points
- web services