Publication: Hardware as Policy: Mechanical and Computational Co-Optimization using Deep Reinforcement Learning.