Login / Signup
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning.
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
Published in:
ICLR (2023)
Keyphrases
</>
semi structured
policy gradient
learning process
information extraction
web documents
model free reinforcement learning
active learning
supervised learning
text mining
database
neural network
structured data
dynamical systems
learning tasks
actor critic