Login / Signup
Wenjia Meng
ORCID
Publication Activity (10 Years)
Years Active: 2017-2024
Publications (10 Years): 8
Top Topics
Deep Learning
Function Approximators
Policy Gradient
Reinforcement Learning
Top Venues
CoRR
IEEE Trans. Neural Networks Learn. Syst.
AAAI
IJCAI
</>
Publications
</>
Wenjia Meng
,
Qian Zheng
,
Long Yang
,
Yilong Yin
,
Gang Pan
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline.
CoRR
(2024)
Wenjia Meng
,
Qian Zheng
,
Gang Pan
,
Yilong Yin
Off-Policy Proximal Policy Optimization.
AAAI
(2023)
Wenjia Meng
,
Qian Zheng
,
Yue Shi
,
Gang Pan
An Off-Policy Trust Region Policy Optimization Method With Monotonic Improvement Guarantee for Deep Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst.
33 (5) (2022)
Wenjia Meng
,
Qian Zheng
,
Long Yang
,
Pengfei Li
,
Gang Pan
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network.
IEEE Trans. Neural Networks Learn. Syst.
31 (10) (2020)
Wenjia Meng
,
Qian Zheng
,
Long Yang
,
Pengfei Li
,
Gang Pan
Qualitative Measurements of Policy Discrepancy for Return-based Deep Q-Network.
CoRR
(2018)
Long Yang
,
Minhao Shi
,
Qian Zheng
,
Wenjia Meng
,
Gang Pan
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning.
IJCAI
(2018)
Long Yang
,
Minhao Shi
,
Qian Zheng
,
Wenjia Meng
,
Gang Pan
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning.
CoRR
(2018)
Wenjia Meng
,
Zonghua Gu
,
Ming Zhang
,
Zhaohui Wu
Two-Bit Networks for Deep Learning on Resource-Constrained Embedded Devices.
CoRR
(2017)