Login / Signup
8-bit Transformer Inference and Fine-tuning for Edge Accelerators.
Jeffrey Yu
Kartik Prabhu
Yonatan Urman
Robert M. Radway
Eric Han
Priyanka Raina
Published in:
ASPLOS (3) (2024)
Keyphrases
</>
fine tuning
fine tune
fine tuned
viable alternative
fuzzy logic
edge information
bayesian networks
fault diagnosis
probabilistic inference
inference process
weighted graph
bayesian inference
belief networks
edge detection
decision making
real time
power system
single chip
artificial intelligence