Login / Signup
Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding.
Ha-Thanh Nguyen
Ken Satoh
Published in:
CoRR (2024)
Keyphrases
</>
balancing exploration and exploitation
decision trees
data sets
logic programs
deductive databases
cost function
learning to rank
deeper understanding