Divergent Token Metrics: Measuring degradation to prune away LLM components - and optimize quantization.
Björn DeiserothMax MeuerNikolas GritschConstantin EichenbergPatrick SchramowskiMatthias AßenmacherKristian KerstingPublished in: NAACL-HLT (2024)