AI Lab Watch

			*
Weighted score	28%	20%	18%	5%	4%	3%	1%
Risk assessment	44%	29%	34%	6%	6%	1%	0%	27% weight
Scheming risk prevention	4%	8%	3%	2%	2%	2%	2%	21% weight
Boosting safety research	70%	52%	35%	25%	0%	13%	8%	14% weight
Misuse prevention	14%	4%	9%	0%	1%	0%	0%	12% weight
Prep for extreme security	3%	5%	0%	0%	0%	0%	0%	12% weight
Risk info sharing	35%	13%	32%	0%	28%	0%	0%	8% weight
Planning	14%	26%	0%	0%	0%	1%	0%	6% weight

Up to date as of September 15

Overall score

Anthropic

DeepMind

OpenAI

DeepMind

Risk assessment 29%

50%

Evals: domains, quality, elicitation

Evals: accountability

Adversarial evaluation for alignment

10%

Model organisms

AI companies should do model evals and uplift experiments to determine whether models have dangerous capabilities or how close they are. They should also prepare to check whether models will act well in high-stakes situations.

Compare all companies on risk assessment

Evals: domains, quality, elicitation

50%

Click to show details/rubric

Evals: accountability

Click to show details/rubric

Adversarial evaluation for alignment

Click to show details/rubric

Model organisms

10%

Click to show details/rubric

AI Lab Watch

Categories

Companies

Resources

Blog

About

DeepMind

Risk assessment 29%

Evals: domains, quality, elicitation

Evals: accountability

Adversarial evaluation for alignment

Model organisms