Login / Signup
Rohan Garg
ORCID
Publication Activity (10 Years)
Years Active: 2012-2024
Publications (10 Years): 16
Top Topics
Distributed Databases
Parallel Algorithm
Success Probability
Fault Tolerance
Top Venues
CoRR
CLUSTER
ICPADS
ASPLOS (1)
</>
Publications
</>
Rohan Basu Roy
,
Tirthak Patel
,
Rohan Garg
,
Devesh Tiwari
CodeCrunch: Improving Serverless Performance via Function Compression and Cost-Aware Warmup Location Optimization.
ASPLOS (1)
(2024)
Yao Xu
,
Zhengji Zhao
,
Rohan Garg
,
Harsh Khetawat
,
Rebecca Hartman-Baker
,
Gene Cooperman
MANA-2.0: A Future-Proof Design for Transparent Checkpointing of MPI at Scale.
SC (Workshops)
(2021)
Prashant Singh Chouhan
,
Harsh Khetawat
,
Neil Resnik
,
Twinkle Jain
,
Rohan Garg
,
Gene Cooperman
,
Rebecca Hartman-Baker
,
Zhengji Zhao
Improving scalability and reliability of MPI-agnostic transparent checkpointing for production workloads at NERSC.
CoRR
(2021)
Yao Xu
,
Zhengji Zhao
,
Rohan Garg
,
Harsh Khetawat
,
Rebecca Hartman-Baker
,
Gene Cooperman
MANA-2.0: A Future-Proof Design for Transparent Checkpointing of MPI at Scale.
CoRR
(2021)
Tirthak Patel
,
Rohan Garg
,
Devesh Tiwari
GIFT: A Coupon Based Throttle-and-Reward Mechanism for Fair and Efficient I/O Bandwidth Management on Parallel Storage Systems.
FAST
(2020)
Rohan Garg
,
Gregory Price
,
Gene Cooperman
MANA for MPI: MPI-Agnostic Network-Agnostic Transparent Checkpointing.
HPDC
(2019)
Rohan Garg
,
Gregory Price
,
Gene Cooperman
MANA for MPI: MPI-Agnostic Network-Agnostic Transparent Checkpointing.
CoRR
(2019)
Rohan Garg
,
Apoorve Mohan
,
Michael B. Sullivan
,
Gene Cooperman
CRUM: Checkpoint-Restart Support for CUDA's Unified Memory.
CLUSTER
(2018)
Rohan Garg
,
Apoorve Mohan
,
Michael B. Sullivan
,
Gene Cooperman
CRUM: Checkpoint-Restart Support for CUDA's Unified Memory.
CoRR
(2018)
Rohan Garg
,
Tirthak Patel
,
Gene Cooperman
,
Devesh Tiwari
Shiraz: Exploiting System Reliability and Application Resilience Characteristics to Improve Large Scale System Throughput.
DSN
(2018)
Rohan Garg
,
Kapil Arya
,
Jiajun Cao
,
Gene Cooperman
,
Jeff Evans
,
Ankit Garg
,
Neil A. Rosenberg
,
K. Suresh
Adapting the DMTCP Plugin Model for Checkpointing of Hardware Emulation.
CoRR
(2017)
Kapil Arya
,
Rohan Garg
,
Artem Y. Polyakov
,
Gene Cooperman
Design and Implementation for Checkpointing of Distributed Resources Using Process-Level Virtualization.
CLUSTER
(2016)
Jiajun Cao
,
Kapil Arya
,
Rohan Garg
,
L. Shawn Matott
,
Dhabaleswar K. Panda
,
Hari Subramoni
,
Jérôme Vienne
,
Gene Cooperman
System-level Scalable Checkpoint-Restart for Petascale Computing.
CoRR
(2016)
Jiajun Cao
,
Kapil Arya
,
Rohan Garg
,
L. Shawn Matott
,
Dhabaleswar K. Panda
,
Hari Subramoni
,
Jérôme Vienne
,
Gene Cooperman
System-Level Scalable Checkpoint-Restart for Petascale Computing.
ICPADS
(2016)
Rohan Garg
,
Jérôme Vienne
,
Gene Cooperman
System-Level Transparent Checkpointing for OpenSHMEM.
OpenSHMEM
(2016)
Rohan Garg
,
Jiajun Cao
,
Kapil Arya
,
Gene Cooperman
,
Jérôme Vienne
Extended Batch Sessions and Three-Phase Debugging: Using DMTCP to Enhance the Batch Environment.
XSEDE
(2016)
Rohan Garg
,
Komal Sodha
,
Zhengping Jin
,
Gene Cooperman
Checkpoint-restart for a network of virtual machines.
CLUSTER
(2013)
Samaneh Kazemi
,
Rohan Garg
,
Gene Cooperman
Transparent Checkpoint-Restart for Hardware-Accelerated 3D Graphics.
CoRR
(2013)
Kurt L. Keville
,
Rohan Garg
,
David J. Yates
,
Kapil Arya
,
Gene Cooperman
Towards Fault-Tolerant Energy-Efficient High Performance Computing in the Cloud.
CLUSTER
(2012)
Rohan Garg
,
Komal Sodha
,
Gene Cooperman
A Generic Checkpoint-Restart Mechanism for Virtual Machines
CoRR
(2012)