C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations.
Peiyi Wang
Lei Li
Zhihong Shao
R. X. Xu
Damai Dai
Yifei Li
Deli Chen
Y. Wu
Zhifang Sui
Published in:
CoRR (2023)
Keyphrases
</>
neural network
metadata
real time
semantic annotation
human behavior
human interaction
data sets
machine learning
artificial intelligence
knowledge base
computational model
human experts
human robot interaction
mathematical problem solving