Login / Signup

MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems.

Samuel HsiaAlicia GoldenBilge AcunNewsha ArdalaniZachary DeVitoGu-Yeon WeiDavid BrooksCarole-Jean Wu
Published in: ISCA (2024)
Keyphrases
  • distributed systems
  • machine learning
  • artificial intelligence
  • fault tolerant
  • message passing
  • data mining
  • case study
  • distributed database systems
  • distributed computing
  • real time systems
  • geographically distributed