SafeAI@AAAI

Keyphrases

Publications

2023

Peter Barnett, Rachel Freedman, Justin Svegliato, Stuart Russell
Active Reward Learning from Multiple Teachers. SafeAI@AAAI (2023)
Valency Oscar Colaco, Simin Nadjm-Tehrani
Formal Verification of Tree Ensembles against Real-World Composite Geometric Perturbations. SafeAI@AAAI (2023)
Maryam Bagheri, Josephine Lamp, Xugui Zhou, Lu Feng, Homa Alemzadeh
Towards Developing Safety Assurance Cases for Learning-Enabled Medical Cyber-Physical Systems. SafeAI@AAAI (2023)
Alberto Huertas Celdrán, Jan Kreischer, Melike Demirci, Joel Leupp, Pedro Miguel Sánchez Sánchez, Muriel Figueredo Franco, Gérôme Bovet, Gregorio Martínez Pérez, Burkhard Stiller
A Framework Quantifying Trustworthiness of Supervised Machine and Deep Learning Models. SafeAI@AAAI (2023)
Stephen Casper, Dylan Hadfield-Menell, Gabriel Kreiman
White-Box Adversarial Policies in Deep Reinforcement Learning. SafeAI@AAAI (2023)
Axel Brando, Isabel Serra, Enrico Mezzetti, Francisco J. Cazorla, Jaume Abella
Standardizing the Probabilistic Sources of Uncertainty for the sake of Safety Deep Learning. SafeAI@AAAI (2023)
Teddy Ferdinan, Jan Kocon
Personalized Models Resistant to Malicious Attacks for Human-centered Trusted AI. SafeAI@AAAI (2023)
Salah Ghamizi, Maxime Cordy, Mike Papadakis, Yves Le Traon
On Evaluating Adversarial Robustness of Chest X-ray Classification. SafeAI@AAAI (2023)
Saaduddin Mahmud, Sandhya Saisubramanian, Shlomo Zilberstein
REVEALE: Reward Verification and Learning Using Explanations. SafeAI@AAAI (2023)
Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Vineet Gundecha, Sahand Ghorbanpour, Alexander Shmakov, Ricardo Luna Gutierrez, Antonio Guillen, Avisek Naug
Robustness with Black-Box Adversarial Attack using Reinforcement Learning. SafeAI@AAAI (2023)
Fateh Kaakai, Paul-Marie Raffi
Towards Multi-timescale Online Monitoring of AI Models. SafeAI@AAAI (2023)
Chiara Picardi, Richard Hawkins, Colin Paterson, Ibrahim Habli
Transfer Assurance for Machine Learning in Autonomous Systems. SafeAI@AAAI (2023)
Dirk Eilers, Simon Burton, Felippe Schmoeller da Roza, Karsten Roscher
Safety Assurance with Ensemble-based Uncertainty Estimation and overlapping alternative Predictions in Reinforcement Learning. SafeAI@AAAI (2023)
Chenyang Yang, Rachel A. Brower-Sinning, Grace A. Lewis, Christian Kästner, Tongshuang Wu
Capabilities for Better ML Engineering. SafeAI@AAAI (2023)
Soumyadeep Pal, Ren Wang, Yuguang Yao, Sijia Liu
Towards Understanding How Self-training Tolerates Data Backdoor Poisoning. SafeAI@AAAI (2023)
Sumanta Dey, Pallab Dasgupta, Soumyajit Dey
Safe Reinforcement Learning through Phasic Safety-Oriented Policy Optimization. SafeAI@AAAI (2023)
Weimin Zhao, Sanaa A. Alwidian, Qusay H. Mahmoud
Evaluation of GAN Architectures for Adversarial Robustness of Convolution Classifier. SafeAI@AAAI (2023)
Yize Li, Pu Zhao, Xue Lin, Bhavya Kailkhura, Ryan A. Goldhahn
Less is More: Data Pruning for Faster Adversarial Training. SafeAI@AAAI (2023)
Khondoker Murad Hossain, Tim Oates
Backdoor Attack Detection in Computer Vision by Applying Matrix Factorization on the Weights of Deep Networks. SafeAI@AAAI (2023)
Maxime Fuccellaro, Laurent Simon, Akka Zemmari
A Robust Drift Detection Algorithm with High Accuracy and Low False Positives Rate. SafeAI@AAAI (2023)
Nikiforos Pittaras, Sean McGregor
A Taxonomic System for Failure Cause Analysis of Open Source AI Incidents. SafeAI@AAAI (2023)
Juliette Mattioli, Henri Sohier, Agnès Delaborde, Gabriel Pedroza, Kahina Amokrane-Ferka, Afef Awadid, Zakaria Chihani, Souhaiel Khalfaoui
Towards a holistic approach for AI trustworthiness assessment based upon aids for multi-criteria aggregation. SafeAI@AAAI (2023)
Fabio Arnez, Ansgar Radermacher, François Terrier
Out-of-Distribution Detection Using Deep Neural Network Latent Space Uncertainty. SafeAI@AAAI (2023)
Václav Divis, Tobias Schuster, Marek Hrúz
Domain-centric ADAS Datasets. SafeAI@AAAI (2023)
Felippe Schmoeller Roza, Simon Hadwiger, Ingo Thon, Karsten Roscher
Towards Safety Assurance of Uncertainty-Aware Reinforcement Learning Agents. SafeAI@AAAI (2023)
Chen Chen, Haibo Hong, Mande Xie, Jun Shao, Tao Xiang
Bab: A novel algorithm for training clean model based on poisoned data. SafeAI@AAAI (2023)
Tian Tan, Carlos Huertas, Qi Zhao
Efficient and Effective Uncertainty Quantification in Gradient Boosting via Cyclical Gradient MCMC. SafeAI@AAAI (2023)
Matthias König, Annelot Bosman, Holger H. Hoos, Jan N. van Rijn
Critically Assessing the State of the Art in CPU-based Local Robustness Verification. SafeAI@AAAI (2023)

volume 3381, 2023

Proceedings of the Workshop on Artificial Intelligence Safety 2023 (SafeAI 2023) co-located with the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023), Washington DC, USA, February 13-14, 2023. SafeAI@AAAI 3381 (2023)

2022

volume 3087, 2022

Proceedings of the Workshop on Artificial Intelligence Safety 2022 (SafeAI 2022) co-located with the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI2022), Virtual, February, 2022. SafeAI@AAAI 3087 (2022)