Step climbing method for crawler type rescue robot using reinforcement learning with Proximal Policy Optimization.
Mifu TotaniNoritaka SatoYoshifumi MoritaPublished in: RoMoCo (2019)
Keyphrases
- reinforcement learning
- significant improvement
- objective function
- policy search
- optimization algorithm
- dynamic programming
- mobile robot
- cost function
- detection method
- optimal policy
- preprocessing
- state space
- support vector machine
- dynamic environments
- optimization method
- combinatorial optimization
- convergence rate
- optimization process