Login / Signup

LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters.

Xinyu ZhouBoris KnyazevAlexia Jolicoeur-MartineauJie Fu
Published in: CoRR (2024)
Keyphrases