Login / Signup
Wenjia Meng
ORCID
Publication Activity (10 Years)
Years Active: 2017-2024
Publications (10 Years): 8
2025
2016
Top Topics
2025
2016
Deep Learning
2025
2016
Function Approximators
2025
2016
Policy Gradient
2025
2016
Reinforcement Learning
Top Venues
CoRR
IEEE Trans. Neural Networks Learn. Syst.
AAAI
IJCAI
</>
Publications
</>
Wenjia Meng
,
Qian Zheng
,
Long Yang
,
Yilong Yin
,
Gang Pan
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline.
CoRR
(2024)
Wenjia Meng
,
Qian Zheng
,
Gang Pan
,
Yilong Yin
Off-Policy Proximal Policy Optimization.
AAAI
(2023)
Wenjia Meng
,
Qian Zheng
,
Yue Shi
,
Gang Pan
An Off-Policy Trust Region Policy Optimization Method With Monotonic Improvement Guarantee for Deep Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst.
33 (5) (2022)
Wenjia Meng
,
Qian Zheng
,
Long Yang
,
Pengfei Li
,
Gang Pan
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network.
IEEE Trans. Neural Networks Learn. Syst.
31 (10) (2020)
Wenjia Meng
,
Qian Zheng
,
Long Yang
,
Pengfei Li
,
Gang Pan
Qualitative Measurements of Policy Discrepancy for Return-based Deep Q-Network.
CoRR
(2018)
Long Yang
,
Minhao Shi
,
Qian Zheng
,
Wenjia Meng
,
Gang Pan
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning.
IJCAI
(2018)
Long Yang
,
Minhao Shi
,
Qian Zheng
,
Wenjia Meng
,
Gang Pan
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning.
CoRR
(2018)
Wenjia Meng
,
Zonghua Gu
,
Ming Zhang
,
Zhaohui Wu
Two-Bit Networks for Deep Learning on Resource-Constrained Embedded Devices.
CoRR
(2017)