Model-based optimal action selection for Dyna-Q reverberation suppression cognitive sonar.
Yubin FuXiaochuan MaChao FengXingxuan PeiPengzhuo LiPublished in: EURASIP J. Adv. Signal Process. (2023)
Keyphrases
- action selection
- temporal difference
- basal ganglia
- decision making
- robot soccer
- reinforcement learning
- worst case
- action selection mechanism
- optimal solution
- mobile robot
- total reward
- information processing
- computational models
- dynamic programming
- computer science
- temporal difference learning
- continuous state and action spaces
- computer vision