Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph.
Roman VashurinEkaterina FadeevaArtem VazhentsevAkim TsvigunDaniil VasilevRui XingAbdelrahman Boda SadallahLyudmila RvanovaSergey PetrakovAlexander PanchenkoTimothy BaldwinPreslav NakovMaxim PanovArtem ShelmanovPublished in: CoRR (2024)