Login / Signup
Poor-Supervised Evaluation for SuperLLM via Mutual Consistency.
Peiwen Yuan
Shaoxiong Feng
Yiwei Li
Xinglin Wang
Boyuan Pan
Heda Wang
Yao Hu
Kan Li
Published in:
ACL (Findings) (2024)
Keyphrases
</>
web services
semi supervised
evaluation metrics
comparative evaluation
data mining
decision making
image sequences
high dimensional
gold standard
evaluation methods
evaluation process
consistency checking