Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning.
Junseok ParkYoonsung KimHee bin YooMin Whoo LeeKibeom KimWon-Seok ChoiMinsu LeeByoung-Tak ZhangPublished in: CoRR (2024)
Keyphrases
- goal oriented
- reinforcement learning
- transition model
- function approximation
- state space
- requirements analysis
- machine learning
- reinforcement learning algorithms
- markov decision processes
- eligibility traces
- learning algorithm
- model free
- reward function
- temporal difference
- requirements engineering
- process oriented
- action selection
- multi agent
- optimal policy
- control policy
- average reward
- data marts
- partially observable environments
- state transition
- fine grained
- markov chain
- markov decision process
- learning agent
- knowledge management
- dynamic programming
- learning process