Login / Signup
SafeAI@AAAI
2019
2023
2019
2023
Keyphrases
Publications
2023
Peter Barnett
,
Rachel Freedman
,
Justin Svegliato
,
Stuart Russell
Active Reward Learning from Multiple Teachers.
SafeAI@AAAI
(2023)
Valency Oscar Colaco
,
Simin Nadjm-Tehrani
Formal Verification of Tree Ensembles against Real-World Composite Geometric Perturbations.
SafeAI@AAAI
(2023)
Maryam Bagheri
,
Josephine Lamp
,
Xugui Zhou
,
Lu Feng
,
Homa Alemzadeh
Towards Developing Safety Assurance Cases for Learning-Enabled Medical Cyber-Physical Systems.
SafeAI@AAAI
(2023)
Alberto Huertas Celdrán
,
Jan Kreischer
,
Melike Demirci
,
Joel Leupp
,
Pedro Miguel Sánchez Sánchez
,
Muriel Figueredo Franco
,
Gérôme Bovet
,
Gregorio Martínez Pérez
,
Burkhard Stiller
A Framework Quantifying Trustworthiness of Supervised Machine and Deep Learning Models.
SafeAI@AAAI
(2023)
Stephen Casper
,
Dylan Hadfield-Menell
,
Gabriel Kreiman
White-Box Adversarial Policies in Deep Reinforcement Learning.
SafeAI@AAAI
(2023)
Axel Brando
,
Isabel Serra
,
Enrico Mezzetti
,
Francisco J. Cazorla
,
Jaume Abella
Standardizing the Probabilistic Sources of Uncertainty for the sake of Safety Deep Learning.
SafeAI@AAAI
(2023)
Teddy Ferdinan
,
Jan Kocon
Personalized Models Resistant to Malicious Attacks for Human-centered Trusted AI.
SafeAI@AAAI
(2023)
Salah Ghamizi
,
Maxime Cordy
,
Mike Papadakis
,
Yves Le Traon
On Evaluating Adversarial Robustness of Chest X-ray Classification.
SafeAI@AAAI
(2023)
Saaduddin Mahmud
,
Sandhya Saisubramanian
,
Shlomo Zilberstein
REVEALE: Reward Verification and Learning Using Explanations.
SafeAI@AAAI
(2023)
Soumyendu Sarkar
,
Ashwin Ramesh Babu
,
Sajad Mousavi
,
Vineet Gundecha
,
Sahand Ghorbanpour
,
Alexander Shmakov
,
Ricardo Luna Gutierrez
,
Antonio Guillen
,
Avisek Naug
Robustness with Black-Box Adversarial Attack using Reinforcement Learning.
SafeAI@AAAI
(2023)
Fateh Kaakai
,
Paul-Marie Raffi
Towards Multi-timescale Online Monitoring of AI Models.
SafeAI@AAAI
(2023)
Chiara Picardi
,
Richard Hawkins
,
Colin Paterson
,
Ibrahim Habli
Transfer Assurance for Machine Learning in Autonomous Systems.
SafeAI@AAAI
(2023)
Dirk Eilers
,
Simon Burton
,
Felippe Schmoeller da Roza
,
Karsten Roscher
Safety Assurance with Ensemble-based Uncertainty Estimation and overlapping alternative Predictions in Reinforcement Learning.
SafeAI@AAAI
(2023)
Chenyang Yang
,
Rachel A. Brower-Sinning
,
Grace A. Lewis
,
Christian Kästner
,
Tongshuang Wu
Capabilities for Better ML Engineering.
SafeAI@AAAI
(2023)
Soumyadeep Pal
,
Ren Wang
,
Yuguang Yao
,
Sijia Liu
Towards Understanding How Self-training Tolerates Data Backdoor Poisoning.
SafeAI@AAAI
(2023)
Sumanta Dey
,
Pallab Dasgupta
,
Soumyajit Dey
Safe Reinforcement Learning through Phasic Safety-Oriented Policy Optimization.
SafeAI@AAAI
(2023)
Weimin Zhao
,
Sanaa A. Alwidian
,
Qusay H. Mahmoud
Evaluation of GAN Architectures for Adversarial Robustness of Convolution Classifier.
SafeAI@AAAI
(2023)
Yize Li
,
Pu Zhao
,
Xue Lin
,
Bhavya Kailkhura
,
Ryan A. Goldhahn
Less is More: Data Pruning for Faster Adversarial Training.
SafeAI@AAAI
(2023)
Khondoker Murad Hossain
,
Tim Oates
Backdoor Attack Detection in Computer Vision by Applying Matrix Factorization on the Weights of Deep Networks.
SafeAI@AAAI
(2023)
Maxime Fuccellaro
,
Laurent Simon
,
Akka Zemmari
A Robust Drift Detection Algorithm with High Accuracy and Low False Positives Rate.
SafeAI@AAAI
(2023)
Nikiforos Pittaras
,
Sean McGregor
A Taxonomic System for Failure Cause Analysis of Open Source AI Incidents.
SafeAI@AAAI
(2023)
Juliette Mattioli
,
Henri Sohier
,
Agnès Delaborde
,
Gabriel Pedroza
,
Kahina Amokrane-Ferka
,
Afef Awadid
,
Zakaria Chihani
,
Souhaiel Khalfaoui
Towards a holistic approach for AI trustworthiness assessment based upon aids for multi-criteria aggregation.
SafeAI@AAAI
(2023)
Fabio Arnez
,
Ansgar Radermacher
,
François Terrier
Out-of-Distribution Detection Using Deep Neural Network Latent Space Uncertainty.
SafeAI@AAAI
(2023)
Václav Divis
,
Tobias Schuster
,
Marek Hrúz
Domain-centric ADAS Datasets.
SafeAI@AAAI
(2023)
Felippe Schmoeller Roza
,
Simon Hadwiger
,
Ingo Thon
,
Karsten Roscher
Towards Safety Assurance of Uncertainty-Aware Reinforcement Learning Agents.
SafeAI@AAAI
(2023)
Chen Chen
,
Haibo Hong
,
Mande Xie
,
Jun Shao
,
Tao Xiang
Bab: A novel algorithm for training clean model based on poisoned data.
SafeAI@AAAI
(2023)
Tian Tan
,
Carlos Huertas
,
Qi Zhao
Efficient and Effective Uncertainty Quantification in Gradient Boosting via Cyclical Gradient MCMC.
SafeAI@AAAI
(2023)
Matthias König
,
Annelot Bosman
,
Holger H. Hoos
,
Jan N. van Rijn
Critically Assessing the State of the Art in CPU-based Local Robustness Verification.
SafeAI@AAAI
(2023)
volume 3381, 2023
Proceedings of the Workshop on Artificial Intelligence Safety 2023 (SafeAI 2023) co-located with the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023), Washington DC, USA, February 13-14, 2023.
SafeAI@AAAI
3381 (2023)
2022
Pascal Gerber
,
Lisa Jöckel
,
Michael Kläs
A Study on Mitigating Hard Boundaries of Decision-Tree-based Uncertainty Estimates for AI Models.
SafeAI@AAAI
(2022)
Ann-Katrin Reuel
,
Mark Koren
,
Anthony Corso
,
Mykel J. Kochenderfer
Using Adaptive Stress Testing to Identify Paths to Ethical Dilemmas in Autonomous Systems.
SafeAI@AAAI
(2022)
Juliette Mattioli
,
Gabriel Pedroza
,
Souhaiel Khalfaoui
,
Bertrand Leroy
Combining Data-Driven and Knowledge-Based AI Paradigms for Engineering AI-Based Safety-Critical Systems.
SafeAI@AAAI
(2022)
Saeed Bakhshi Germi
,
Esa Rahtu
A Practical Overview of Safety Concerns and Mitigation Methods for Visual Deep Learning Algorithms.
SafeAI@AAAI
(2022)
Bertrand Braunschweig
,
Rodolphe Gelin
,
François Terrier
The wall of safety for AI: approaches in the Confiance.ai program.
SafeAI@AAAI
(2022)
Hal Ashton
Defining and Identifying the Legal Culpability of Side Effects using Causal Graphs.
SafeAI@AAAI
(2022)
Ignacio Serna
,
Daniel DeAlcala
,
Aythami Morales Moreno
,
Julian Fiérrez
,
Javier Ortega-Garcia
IFBiD: Inference-Free Bias Detection.
SafeAI@AAAI
(2022)
Edoardo Manino
,
Danilo Carvalho
,
Yi Dong
,
Julia Rozanova
,
Xidan Song
,
Mustafa A. Mustafa
,
André Freitas
,
Gavin Brown
,
Mikel Lujan
,
Xiaowei Huang
,
Lucas C. Cordeiro
EnnCore: End-to-End Conceptual Guarding of Neural Architectures.
SafeAI@AAAI
(2022)
Mathieu Godbout
,
Maxime Heuillet
,
Sharath Chandra Raparthy
,
Rupali Bhati
,
Audrey Durand
A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning.
SafeAI@AAAI
(2022)
Jin Woo Ro
,
Gerald Lüttgen
,
Diedrich Wolter
Reinforcement Learning With Imperfect Safety Constraints.
SafeAI@AAAI
(2022)
Peter Barnett
,
John Burden
Oases of Cooperation: An Empirical Evaluation of Reinforcement Learning in the Iterated Prisoner's Dilemma.
SafeAI@AAAI
(2022)
Hal Ashton
,
Matija Franklin
The Problem of Behaviour and Preference Manipulation in AI Systems.
SafeAI@AAAI
(2022)
Amany Alshareef
,
Nicolas Berthier
,
Sven Schewe
,
Xiaowei Huang
Quantifying the Importance of Latent Features in Neural Networks.
SafeAI@AAAI
(2022)
Preston Putzel
,
Scott Lee
Blackbox Post-Processing for Multiclass Fairness.
SafeAI@AAAI
(2022)
Leopold Müller
,
Lars Böcking
,
Michael Färber
Safety Aware Reinforcement Learning by Identifying Comprehensible Constraints in Expert Demonstrations.
SafeAI@AAAI
(2022)
Michal Filipiuk
,
Vasu Singh
Comparing Vision Transformers and Convolutional Nets for Safety Critical Systems.
SafeAI@AAAI
(2022)
Adrian Schwaiger
,
Kristian Schwienbacher
,
Karsten Roscher
Beyond Test Accuracy: The Effects of Model Compression on CNNs.
SafeAI@AAAI
(2022)
Poulami Sinhamahapatra
,
Rajat Koner
,
Karsten Roscher
,
Stephan Günnemann
Is it all a cluster game? - Exploring Out-of-Distribution Detection based on Clustering in the Embedding Space.
SafeAI@AAAI
(2022)
Sheila Alemany
,
Niki Pissinou
The Dilemma Between Data Transformations and Adversarial Robustness for Time Series Application Systems.
SafeAI@AAAI
(2022)
Prajit T. Rajendran
,
Huáscar Espinoza
,
Agnès Delaborde
,
Chokri Mraidha
Human-in-the-loop Learning for Safe Exploration through Anomaly Prediction and Intervention.
SafeAI@AAAI
(2022)
volume 3087, 2022
Proceedings of the Workshop on Artificial Intelligence Safety 2022 (SafeAI 2022) co-located with the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI2022), Virtual, February, 2022.
SafeAI@AAAI
3087 (2022)