Keyphrases
- reinforcement learning
- machine learning
- model free
- markov decision processes
- qualitative and quantitative
- multi agent
- electronic commerce
- policy search
- temporal difference
- function approximation
- multi agent reinforcement learning
- learning agent
- robotic control
- stock trading
- quantitative measures
- real time
- temporal difference learning
- stock exchange
- stock price
- optimal control
- optimal policy
- decision making
- learning algorithm