LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback.

Published in: ACL (1) (2024)

Keyphrases