• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

CompactifAI: Extreme Compression of Large Language Models using Quantum-Inspired Tensor Networks.

Andrei TomutSaeed S. JahromiSukhbinder SinghFaysal IshtiaqCésar MuñozPrabdeep Singh BajajAli ElboradyGianni Del BimboMehrazin AlizadehDavid MonteroPablo Martin-RamiroMuhammad IbrahimOussama Tahiri-AlaouiJohn MalcolmSamuel MugelRoman Orus
Published in: CoRR (2024)
Keyphrases