​
Login / Signup
Nawras Alnaasan
ORCID
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 12
Top Topics
Distributed Data
Autoregressive Model
Artificial Vision
Quantization Error
Top Venues
CoRR
HOTI
BigData
IEEE Micro
</>
Publications
</>
Bharath Ramesh
,
Nick Contini
,
Nawras Alnaasan
,
Kaushik Kandadi Suresh
,
Mustafa Abduljabbar
,
Aamir Shafi
,
Hari Subramoni
,
Dhabaleswar K. D. K. Panda
HINT: Designing Cache-Efficient MPI_Alltoall using Hybrid Memory Copy Ordering and Non-Temporal Instructions.
IPDPS
(2024)
Bharath Ramesh
,
Goutham Kalikrishna Reddy Kuncham
,
Kaushik Kandadi Suresh
,
Rahul Vaidya
,
Nawras Alnaasan
,
Mustafa Abduljabbar
,
Aamir Shafi
,
Hari Subramoni
,
Dhabaleswar K. D. K. Panda
Designing In-network Computing Aware Reduction Collectives in MPI.
HOTI
(2023)
Hyunho Ahn
,
Tian Chen
,
Nawras Alnaasan
,
Aamir Shafi
,
Mustafa Abduljabbar
,
Hari Subramoni
,
Dhabaleswar K. Panda
Performance Characterization of Using Quantization for DNN Inference on Edge Devices.
ICFEC
(2023)
Jinghan Yao
,
Nawras Alnaasan
,
Tian Chen
,
Aamir Shafi
,
Hari Subramoni
,
Dhabaleswar K. Panda
Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference.
CoRR
(2023)
Jinghan Yao
,
Nawras Alnaasan
,
Tian Chen
,
Aamir Shafi
,
Hari Subramoni
,
Dhabaleswar K. D. K. Panda
Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference.
HiPC
(2023)
Hyunho Ahn
,
Tian Chen
,
Nawras Alnaasan
,
Aamir Shafi
,
Mustafa Abduljabbar
,
Hari Subramoni
,
Dhabaleswar K. Panda
Performance Characterization of using Quantization for DNN Inference on Edge Devices: Extended Version.
CoRR
(2023)
Nawras Alnaasan
,
Matthew Lieber
,
Aamir Shafi
,
Hari Subramoni
,
Scott Shearer
,
Dhabaleswar K. Panda
HARVEST: High-Performance Artificial Vision Framework for Expert Labeling using Semi-Supervised Training.
BigData
(2023)
Nawras Alnaasan
,
Arpan Jain
,
Aamir Shafi
,
Hari Subramoni
,
Dhabaleswar K. Panda
OMB-Py: Python Micro-Benchmarks for Evaluating Performance of MPI Libraries on HPC Systems.
IPDPS Workshops
(2022)
Nawras Alnaasan
,
Arpan Jain
,
Aamir Shafi
,
Hari Subramoni
,
Dhabaleswar K. Panda
AccDP: Accelerated Data-Parallel Distributed DNN Training for Modern GPU-Based HPC Clusters.
HIPC
(2022)
Arpan Jain
,
Nawras Alnaasan
,
Aamir Shafi
,
Hari Subramoni
,
Dhabaleswar K. Panda
Optimizing Distributed DNN Training Using CPUs and BlueField-2 DPUs.
IEEE Micro
42 (2) (2022)
Arpan Jain
,
Nawras Alnaasan
,
Aamir Shafi
,
Hari Subramoni
,
Dhabaleswar K. Panda
Accelerating CPU-based Distributed DNN Training on Modern HPC Clusters using BlueField-2 DPUs.
HOTI
(2021)
Nawras Alnaasan
,
Arpan Jain
,
Aamir Shafi
,
Hari Subramoni
,
Dhabaleswar K. Panda
OMB-Py: Python Micro-Benchmarks for Evaluating Performance of MPI Libraries on HPC Systems.
CoRR
(2021)