Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents.
Renxi WangHaonan LiXudong HanYixuan ZhangTimothy BaldwinPublished in: CoRR (2024)
Keyphrases
- negative examples
- language model
- positive examples
- fine tuning
- concept learning
- learning algorithm
- probabilistic model
- language modeling
- learning tasks
- retrieval model
- positive and negative
- information retrieval
- n gram
- reinforcement learning
- training data
- learning problems
- prediction accuracy
- information extraction
- pairwise
- language models for information retrieval