SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation.
Yifan XiongYuting JiangZiyue YangLei QuGuoshuai ZhaoShuguang LiuDong ZhongBoris PinzurJie ZhangYang WangJithin JoseHossein PourrezaJeff BaxterKushal DattaPrabhat RamLuke MeltonJoe ChauPeng ChengYongqiang XiongLidong ZhouPublished in: USENIX ATC (2024)
Keyphrases
- artificial intelligence
- cloud computing
- intelligent systems
- computing platform
- machine learning
- knowledge representation
- case based reasoning
- cloud services
- ai systems
- data sets
- knowledge based systems
- cost effective
- ai methods
- cost effectiveness
- computing infrastructure
- reliability analysis
- highly reliable
- knowledge representation and reasoning
- information exchange
- web services