Exploring the Robustness of Model-Graded Evaluations and Automated Interpretability.

Simon Lermen Ondrej Kvapil

Published in: CoRR (2023)

Keyphrases

computational model
theoretical analysis
neural network
hybrid model
formal model
theoretical framework
statistical model
probability distribution
cost function
simulation model
prior knowledge
management system
data sets
mathematical model
sensitivity analysis
theoretical foundation
parameter values
decision trees
semi automated