Extreme compression of sentence-transformer ranker models: faster inference, longer battery life, and less storage on edge devices.
Amit ChaulwarLukas MalikMaciej KrajewskiFelix ReichelLeif-Nissen LundbækMichael HuthBartlomiej MatejczykPublished in: CoRR (2022)