Login / Signup
Weight subcloning: direct initialization of transformers using larger pretrained ones.
Mohammad Samragh
Mehrdad Farajtabar
Sachin Mehta
Raviteja Vemulapalli
Fartash Faghri
Devang Naik
Oncel Tuzel
Mohammad Rastegari
Published in:
CoRR (2023)
Keyphrases
</>
real time
decision making
website
k means
database
real world
computer vision
multimedia
objective function
weighting scheme
small size
initial conditions
weight function