Benchmark Early and Red Team Often: A Framework for Assessing and Managing Dual-Use Hazards of AI Foundation Models.
Anthony M. BarrettKrystal JacksonEvan R. MurphyNada MadkourJessica NewmanPublished in: CoRR (2024)
Keyphrases
- probabilistic model
- modeling framework
- artificial intelligence
- statistical models
- prior knowledge
- main contribution
- machine learning
- experimental data
- complex systems
- intelligent systems
- graphical models
- bayesian networks
- case study
- theoretical foundation
- autoregressive
- neural network
- mathematical framework
- emergency management
- john mccarthy