C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Dhawal Gupta
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 17
Top Topics
Dialogue Management
Credit Assignment
Reinforcement Learning
Reward Function
Top Venues
CoRR
NeurIPS
ACM Trans. Asian Low Resour. Lang. Inf. Process.
Multim. Tools Appl.
</>
Publications
</>
Mehwash Weqar
,
Shabana Mehfuz
,
Dhawal Gupta
,
Shabana Urooj
Adaptive Switching Based Data-Communication Model for Internet of Healthcare Things Networks.
IEEE Access
12 (2024)
Simeng Sun
,
Dhawal Gupta
,
Mohit Iyyer
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF.
CoRR
(2023)
James E. Kostas
,
Scott M. Jordan
,
Yash Chandak
,
Georgios Theocharous
,
Dhawal Gupta
,
Martha White
,
Bruno Castro da Silva
,
Philip S. Thomas
Coagent Networks: Generalized and Scaled.
CoRR
(2023)
Yinlam Chow
,
Aza Tulepbergenov
,
Ofir Nachum
,
Dhawal Gupta
,
Moonkyung Ryu
,
Mohammad Ghavamzadeh
,
Craig Boutilier
A Mixture-of-Expert Approach to RL-based Dialogue Management.
ICLR
(2023)
Dhawal Gupta
,
Yinlam Chow
,
Mohammad Ghavamzadeh
,
Craig Boutilier
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management.
CoRR
(2023)
Dhawal Gupta
,
Yash Chandak
,
Scott M. Jordan
,
Philip S. Thomas
,
Bruno C. da Silva
Behavior Alignment via Reward Function Optimization.
NeurIPS
(2023)
Dhawal Gupta
,
Yinlam Chow
,
Azamat Tulepbergenov
,
Mohammad Ghavamzadeh
,
Craig Boutilier
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management.
NeurIPS
(2023)
Dhawal Gupta
,
Scott M. Jordan
,
Shreyas Chaudhari
,
Bo Liu
,
Philip S. Thomas
,
Bruno Castro da Silva
From Past to Future: Rethinking Eligibility Traces.
CoRR
(2023)
Dhawal Gupta
,
Yash Chandak
,
Scott M. Jordan
,
Philip S. Thomas
,
Bruno Castro da Silva
Behavior Alignment via Reward Function Optimization.
CoRR
(2023)
Tulika Saha
,
Dhawal Gupta
,
Sriparna Saha
,
Pushpak Bhattacharyya
A Unified Dialogue Management Strategy for Multi-intent Dialogue Conversations in Multiple Languages.
ACM Trans. Asian Low Resour. Lang. Inf. Process.
20 (6) (2021)
Tulika Saha
,
Dhawal Gupta
,
Sriparna Saha
,
Pushpak Bhattacharyya
A hierarchical approach for efficient multi-intent dialogue policy learning.
Multim. Tools Appl.
80 (28-29) (2021)
Dhawal Gupta
,
Gabor Mihucz
,
Matthew Schlegel
,
James E. Kostas
,
Philip S. Thomas
,
Martha White
Structural Credit Assignment in Neural Networks using Reinforcement Learning.
NeurIPS
(2021)
Tulika Saha
,
Dhawal Gupta
,
Sriparna Saha
,
Pushpak Bhattacharyya
Emotion Aided Dialogue Act Classification for Task-Independent Conversations in a Multi-modal Framework.
Cogn. Comput.
13 (2) (2021)
Sina Ghiassian
,
Andrew Patterson
,
Shivam Garg
,
Dhawal Gupta
,
Adam White
,
Martha White
Gradient Temporal-Difference Learning with Regularized Corrections.
ICML
(2020)
Sina Ghiassian
,
Andrew Patterson
,
Shivam Garg
,
Dhawal Gupta
,
Adam White
,
Martha White
Gradient Temporal-Difference Learning with Regularized Corrections.
CoRR
(2020)
Tulika Saha
,
Dhawal Gupta
,
Sriparna Saha
,
Pushpak Bhattacharyya
Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning.
Expert Syst. Appl.
162 (2020)
Tulika Saha
,
Dhawal Gupta
,
Sriparna Saha
,
Pushpak Bhattacharyya
Reinforcement Learning Based Dialogue Management Strategy.
ICONIP (3)
(2018)