Login / Signup

A policy optimization algorithm based on sample adaptive reuse and dual-clipping for robotic action control.

Li-yang ZhaoTian-qing ChangJie ZhangLei ZhangKai-xuan ChuLi-bin GuoDepeng Kong
Published in: Appl. Soft Comput. (2023)
Keyphrases