Login / Signup
Yehia Arafa
ORCID
Publication Activity (10 Years)
Years Active: 2018-2023
Publications (10 Years): 25
Top Topics
Low Overhead
Parallel Implementation
Shared Memory
Discrete Event
Top Venues
CoRR
MEMSYS
HPEC
SIGSIM-PADS
</>
Publications
</>
George Michelogiannakis
,
Yehia Arafa
,
Brandon Cook
,
Liang Yuan Dai
,
Abdel-Hameed A. Badawy
,
Madeleine Glick
,
Yuyang Wang
,
Keren Bergman
,
John Shalf
Efficient Intra-Rack Resource Disaggregation for HPC Using Co-Packaged DWDM Photonics.
CLUSTER
(2023)
Hamdy Abdelkhalik
,
Yehia Arafa
,
Nandakishore Santhi
,
Nirmal Prajapati
,
Abdel-Hameed A. Badawy
Modeling and Characterizing Shared and Local Memories of the Ampere GPUs.
MEMSYS
(2023)
George Michelogiannakis
,
Yehia Arafa
,
Brandon Cook
,
Liang Yuan Dai
,
Abdel-Hameed A. Badawy
,
Madeleine Glick
,
Yuyang Wang
,
Keren Bergman
,
John Shalf
Efficient Intra-Rack Resource Disaggregation for HPC Using Co-Packaged DWDM Photonics.
CoRR
(2023)
Hamdy Abdelkhalik
,
Shamminuj Aktar
,
Yehia Arafa
,
Atanu Barai
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Nishant Panda
,
Nirmal Prajapati
,
Nazmul Haque Turja
,
Stephan J. Eidenbenz
,
Abdel-Hameed A. Badawy
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques.
ICPADS
(2023)
Hamdy Abdelkhalik
,
Yehia Arafa
,
Nandakishore Santhi
,
Abdel-Hameed A. Badawy
Demystifying the Nvidia Ampere Architecture through Microbenchmarking and Instruction-level Analysis.
CoRR
(2022)
Shamminuj Aktar
,
Hamdy Abdelkhalik
,
Nazmul Haque Turja
,
Yehia Arafa
,
Atanu Barai
,
Nishant Panda
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
,
Abdel-Hameed A. Badawy
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques.
CoRR
(2022)
Atanu Barai
,
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
PPT-Multicore: performance prediction of OpenMP applications using reuse profiles and analytical modeling.
J. Supercomput.
78 (2) (2022)
Hamdy Abdelkhalik
,
Yehia Arafa
,
Nandakishore Santhi
,
Abdel-Hameed A. Badawy
Demystifying the Nvidia Ampere Architecture through Microbenchmarking and Instruction-level Analysis.
HPEC
(2022)
Ali Eker
,
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
,
Dmitry V. Ponomarev
Load-Aware Dynamic Time Synchronization in Parallel Discrete Event Simulation.
SIGSIM-PADS
(2021)
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Ammar ElWazir
,
Atanu Barai
,
Ali Eker
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
Hybrid, scalable, trace-driven performance modeling of GPGPUs.
SC
(2021)
Atanu Barai
,
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
PPT-Multicore: Performance Prediction of OpenMP applications using Reuse Profiles and Analytical Modeling.
CoRR
(2021)
Atanu Barai
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Abdel-Hameed A. Badawy
,
Yehia Arafa
,
Stephan J. Eidenbenz
PPT-SASMM: Scalable Analytical Shared Memory Model: Predicting the Performance of Multicore Caches from a Single-Threaded Execution Trace.
CoRR
(2021)
Yehia Arafa
,
Ammar ElWazir
,
Abdelrahman Elkanishy
,
Youssef Aly
,
Ayatelrahman Elsayed
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Stephan J. Eidenbenz
,
Nandakishore Santhi
Verified instruction-level energy consumption measurement for NVIDIA GPUs.
CF
(2020)
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Atanu Barai
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
Fast, accurate, and scalable memory modeling of GPGPUs using reuse profiles.
ICS
(2020)
Yehia Arafa
,
Ammar ElWazir
,
Abdelrahman Elkanishy
,
Youssef Aly
,
Ayatelrahman Elsayed
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Stephan J. Eidenbenz
,
Nandakishore Santhi
Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs.
CoRR
(2020)
Atanu Barai
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Abdel-Hameed A. Badawy
,
Yehia Arafa
,
Stephan J. Eidenbenz
PPT-SASMM: Scalable Analytical Shared Memory Model: Predicting the Performance of Multicore Caches from a Single-Threaded Execution Trace.
MEMSYS
(2020)
Yehia Arafa
,
Ammar ElWazir
,
Abdelrahman Elkanishy
,
Youssef Aly
,
Ayatelrahman Elsayed
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Stephan J. Eidenbenz
,
Nandakishore Santhi
NVIDIA GPGPUs Instructions Energy Consumption.
ISPASS
(2020)
Yehia Arafa
,
Gopinath Chennupati
,
Atanu Barai
,
Abdel-Hameed A. Badawy
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
GPUs Cache Performance Estimation using Reuse Distance Analysis.
IPCCC
(2019)
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
POSTER: GPUs Pipeline Latency Analysis.
ASAP
(2019)
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
Instructions' Latencies Characterization for NVIDIA GPGPUs.
CoRR
(2019)
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
Low Overhead Instruction Latency Characterization for NVIDIA GPGPUs.
HPEC
(2019)
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
PPT-GPU: Scalable GPU Performance Modeling.
IEEE Comput. Archit. Lett.
18 (2019)
Yehia Arafa
,
Abdel-Hameed A. Badawy
,
Gopinath Chennupati
,
Nandakishore Santhi
,
Stephan J. Eidenbenz
PPT-GPU: performance prediction toolkit for GPUs identifying the impact of caches: extended abstract.
MEMSYS
(2018)
Yehia Arafa
,
Atanu Barai
,
Mai Zheng
,
Abdel-Hameed A. Badawy
Evaluating the Fault Tolerance Performance of HDFS and Ceph.
PEARC
(2018)
Yehia Arafa
,
Atanu Barai
,
Mai Zheng
,
Abdel-Hameed A. Badawy
Fault Tolerance Performance Evaluation of Large-Scale Distributed Storage Systems HDFS and Ceph Case Study.
HPEC
(2018)