Compact Proofs of Model Performance via Mechanistic Interpretability.
Jason GrossRajashree AgrawalThomas KwaEuan OngChun Hei YipAlex GibsonSoufiane NoubirLawrence ChanPublished in: CoRR (2024)
Keyphrases
- formal model
- theoretical analysis
- probabilistic model
- mathematical model
- statistical model
- conceptual model
- data sets
- artificial intelligence
- bayesian networks
- closed form
- machine learning
- artificial neural networks
- cost function
- management system
- database
- computational model
- theoretical framework
- sensitivity analysis