Sign in

A Contextual Bandit Approach for Learning to Plan in Environments with Probabilistic Goal Configurations.

Sohan RudraSaksham GoelAnirban SantaraClaudio GentileLaurent PerronFei XiaVikas SindhwaniCarolina ParadaGaurav Aggarwal
Published in: CoRR (2022)
Keyphrases
  • learning algorithm
  • learning process
  • reinforcement learning
  • bayesian networks
  • video sequences
  • data analysis
  • semi supervised
  • supervised learning
  • autonomous robots
  • plan library