Login / Signup
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations.
Peiyi Wang
Lei Li
Zhihong Shao
Runxin Xu
Damai Dai
Yifei Li
Deli Chen
Yu Wu
Zhifang Sui
Published in:
ACL (1) (2024)
Keyphrases
</>
human experts
human interaction
data sets
neural network
metadata
digital libraries
semantic annotation
human subjects