Login / Signup
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning.
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
Published in:
CoRR (2022)
Keyphrases
</>
semi structured
learning process
structured data
policy gradient
data model
reinforcement learning
learning tasks
supervised learning
actor critic
learning problems
domain independent
web documents
data sets
information extraction
learning algorithm
information retrieval
databases