Gatekeeper: A deep reinforcement learning-cum-heuristic based algorithm for scheduling and routing trains in complex environments.
Deepak MohapatraAnkush OjhaHarshad KhadilkarSupratim GhoshPublished in: IJCNN (2022)
Keyphrases
- reinforcement learning
- complex environments
- learning algorithm
- segmentation algorithm
- convergence rate
- monte carlo
- cost function
- optimization algorithm
- expectation maximization
- objective function
- detection algorithm
- dynamic programming
- similarity measure
- routing problem
- state space
- probabilistic model
- k means
- preprocessing
- computational cost
- np hard
- optimal policy
- search space
- recognition algorithm
- cooperative
- optimal solution
- model free
- stochastic approximation