Blog

Some rough ideas

Some reactions

Testing an LM system for dangerous capabilities is crucial for assessing its risks

And new-ish page: Policy advocacy

New details on the Long-Term Benefit Trust, but most questions remain

Anthropic should share the details

16 companies commit to make RSPs

But they are doing model evals for dangerous capabilities


Subscribe on Substack.