​
Login / Signup
Tao Liu
ORCID
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 13
Top Topics
Multiple Scales
Actor Critic
Function Approximators
Policy Gradient
Top Venues
CoRR
NeurIPS
IEEE Control. Syst. Lett.
IEICE Trans. Inf. Syst.
</>
Publications
</>
Youbang Sun
,
Tao Liu
,
P. R. Kumar
,
Shahin Shahrampour
Linear Convergence of Independent Natural Policy Gradient in Games With Entropy Regularization.
IEEE Control. Syst. Lett.
8 (2024)
Youbang Sun
,
Tao Liu
,
Ruida Zhou
,
P. R. Kumar
,
Shahin Shahrampour
Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games.
CoRR
(2023)
Ruida Zhou
,
Tao Liu
,
Min Cheng
,
Dileep Kalathil
,
P. R. Kumar
,
Chao Tian
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation.
NeurIPS
(2023)
Ruida Zhou
,
Tao Liu
,
Min Cheng
,
Dileep Kalathil
,
P. R. Kumar
,
Chao Tian
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation.
CoRR
(2023)
Youbang Sun
,
Tao Liu
,
Ruida Zhou
,
P. R. Kumar
,
Shahin Shahrampour
Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games.
NeurIPS
(2023)
Ruida Zhou
,
Tao Liu
,
Dileep M. Kalathil
,
P. R. Kumar
,
Chao Tian
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning.
CoRR
(2022)
Tao Liu
,
P. R. Kumar
,
Ruida Zhou
,
Xi Liu
Learning from Few Samples: Transformation-Invariant SVMs with Composition and Locality at Multiple Scales.
NeurIPS
(2022)
Ruida Zhou
,
Tao Liu
,
Dileep Kalathil
,
P. R. Kumar
,
Chao Tian
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning.
NeurIPS
(2022)
Tao Liu
,
Ruida Zhou
,
Dileep Kalathil
,
Panganamala R. Kumar
,
Chao Tian
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs.
NeurIPS
(2021)
Tao Liu
,
P. R. Kumar
,
Xi Liu
Learning from Small Samples: Transformation-Invariant SVMs with Composition and Locality at Multiple Scales.
CoRR
(2021)
Tao Liu
,
Ruida Zhou
,
Dileep Kalathil
,
P. R. Kumar
,
Chao Tian
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs.
CoRR
(2021)
Tao Liu
,
Ruida Zhou
,
Dileep Kalathil
,
P. R. Kumar
,
Chao Tian
Fast Global Convergence of Policy Optimization for Constrained MDPs.
CoRR
(2021)
Tao Liu
,
Huaxi Gu
,
Yue Wang
,
Wei Zou
An Optimized Low-Power Optical Memory Access Network for Kilocore Systems.
IEICE Trans. Inf. Syst.
(5) (2019)