Login / Signup
To FP8 and Back Again: Quantifying the Effects of Reducing Precision on LLM Training Stability.
Joonhyung Lee
Jeongin Bae
Byeongwook Kim
Se Jung Kwon
Dongsoo Lee
Published in:
CoRR (2024)
Keyphrases
</>
high precision
precision and recall
numerical stability
training algorithm
training set
small number
supervised learning
training samples
training process
neural network
learning algorithm
data sets
data mining
training phase
feedforward neural networks
stability analysis
high recall
computer based instruction