Login / Signup

Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding.

Ha-Thanh NguyenKen Satoh
Published in: CoRR (2024)
Keyphrases
  • balancing exploration and exploitation
  • decision trees
  • data sets
  • logic programs
  • deductive databases
  • cost function
  • learning to rank
  • deeper understanding