DeepMind

Risk assessment 29%

50%
Evals: domains, quality, elicitation
1%
Evals: accountability
0%
Adversarial evaluation for alignment
10%
Model organisms

Evals: domains, quality, elicitation

50%
Click to show details/rubric

Evals: accountability

1%
Click to show details/rubric

Adversarial evaluation for alignment

0%
Click to show details/rubric

Model organisms

10%
Click to show details/rubric