Learning Competitive Equilibria in Exchange Economies with Bandit Feedback.
Wenshuo GuoKirthevasan KandasamyJoseph GonzalezMichael I. JordanIon StoicaPublished in: AISTATS (2022)
Keyphrases
- reinforcement learning
- learning algorithm
- learning process
- learning scheme
- knowledge acquisition
- learning systems
- supervised learning
- learning tasks
- online learning
- erroneous examples
- neural network
- assessment tool
- motor skills
- information exchange
- incremental learning
- mobile learning
- background knowledge
- learning activities
- semi supervised
- prior knowledge
- lower bound
- cooperative