DeepMind

Risk assessment 34%

60%
Evals: domains, quality, elicitation
1%
Evals: accountability
0%
Adversarial evaluation for alignment
10%
Model organisms

Evals: domains, quality, elicitation

60%
Click to show details/rubric

Evals: accountability

1%
Click to show details/rubric

Adversarial evaluation for alignment

0%
Click to show details/rubric

Model organisms

10%
Click to show details/rubric