LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback.

Published in: CoRR (2024)

Keyphrases