AI Lab Watch

Other safety-related scorecards for AI companies or models:

FLI scored companies on safety: AI Safety Index (2025)
SaferAI scored companies' risk management practices: Are AI Companies Preparing for a Safe Future? (Papadatos et al. 2025)
The Midas Project scored companies' "risk evaluation policies": Seoul Commitment Tracker (2025)
CFI scored companies on the UK safety recommendations: Do companies' AI Safety Policies meet government best practice? (Ó hÉigeartaigh et al. 2023)
CLTC applied their risk-management guidance to four language models: "Appendix 4: Retrospective test use of Profile draft guidance" in "AI Risk-Management Standards Profile for General-Purpose AI Systems (GPAIS) and Foundation Models, Version 1.0" (Barrett et al. 2023)
Scale has an adversarial robustness leaderboard (2025); previously they had a different adversarial robustness leaderboard (2024)
PRISM has an adversarial robustness leaderboard
Scale has an honesty leaderboard
DarkBench (Kran et al. 2025) evaluates "dark design patterns"
TrustLLM (Huang et al. 2024) evaluates language models' trustworthiness
DecodingTrust (Wang et al. 2023) evaluates language models' trustworthiness
HydroX has an adversarial robustness leaderboard
CRFM has made (non-existential) "safety" model scorecards: HELM, especially HELM Safety (blogpost) and AIR-Bench; draft EU AI Act compliance (Bommasani et al. 2023); and transparency (Bommasani et al. 2023)
How are AI companies doing with their voluntary commitments on vulnerability reporting? (Jones 2024)
A Safe Harbor for AI Evaluation and Red Teaming (Longpre et al. 2024) evaluated how companies enable/stifle evaluation and red-teaming of their models by the AI community

Other comparative evaluation/analysis on AI companies' safety:

Lab Statements on AI Governance (GovAI: Wei et al. 2023) evaluated three companies' statements on AI policy
Thoughts on the AI Safety Summit company policy requests and responses (MIRI: Soares 2023)
Is OpenAI's Preparedness Framework better than its competitors' "Responsible Scaling Policies"? A Comparative Analysis (SaferAI 2024)

The UK Department for Science, Innovation and Technology published best practices for AI safety in October 2023; some companies responded by describing some of their practices.

Evaluation/analysis of a particular AI company (not comparative):

Responsible Scaling: Comparing Government Guidance and Company Policy (IAPS: Anderson-Samways et al. 2024)

Scorecards for government policy proposals:

FLI, Nov 2023: AI Governance Scorecard and Safety Standards Policy
AIPI, Sep 2023: AI Policies Under the Microscope

Scorecards for governments:

Stephen Casper et al., Feb 2025: Pitfalls of Evidence-Based AI Policy

AI Lab Watch

Categories

Companies

Resources

Blog

About

Other scorecards & evaluation