Login / Signup

Inverse Reinforcement Learning with Agents' Biased Exploration Based on Sub-Optimal Sequential Action Data.

Fumito UwanoSatoshi HasegawaKeiki Takadama
Published in: J. Adv. Comput. Intell. Intell. Informatics (2024)
Keyphrases
  • multi agent
  • learning algorithm
  • prior knowledge
  • action selection
  • data mining
  • multi agent systems
  • markov chain