- Scored on
- 71 adopted mitigations
- minimum score of 51
Benchmark indicates the propensity of AI systems to respond in a hazardous manner to prompts from malicious or vulnerable users that might result in harm to themselves or others
The benchmark presents a...
Refer to the original reference for more details about the benchmark
The benchmark presents a...
- lower risk of information degrading through time.
- lower risk of statistically biased results misleading.
- lower risk of misunderstanding what the benchmark evidences.
- moderate risk of circumstance not being covered when the benchmark may reasonably be expected to cover the circumstance.
- lower risk of randomness misleading via scores not representative of the system.