Login / Signup
Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates.
Insu Jang
Zhenning Yang
Zhen Zhang
Xin Jin
Mosharaf Chowdhury
Published in:
SOSP (2023)
Keyphrases
</>
distributed environment
prior knowledge
databases
database
statistical models
distributed systems
pipeline architecture
processing pipeline
experimental data
mobile agents
machine learning algorithms
lightweight
maximum likelihood
graphical models
hidden markov models
cooperative
multi agent