Learning impartial policies for sequential counterfactual explanations using Deep Reinforcement Learning.
Emmanouil PanagiotouEirini NtoutsiPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- learning algorithm
- learning process
- supervised learning
- active learning
- action selection
- learning tasks
- policy gradient methods
- autonomous learning
- learning systems
- unsupervised learning
- online learning
- learning environment
- optimal policy
- transfer learning
- knowledge acquisition
- model free
- learning agent
- reinforcement learning methods
- markov decision problems
- state space
- evolutionary learning