Login / Signup
Linguistic communication as (inverse) reward design.
Theodore R. Sumers
Robert D. Hawkins
Mark K. Ho
Thomas L. Griffiths
Dylan Hadfield-Menell
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
embedded systems
natural language processing
design process
optimal design
database
neural network
machine learning
case study
multi agent systems
information sharing
data acquisition
design decisions
multi party