Testing an LM system for dangerous capabilities is crucial for assessing its risks
And new-ish page: Policy advocacy
New details on the Long-Term Benefit Trust, but most questions remain
Anthropic should share the details
But they should
16 companies commit to make RSPs
But they are doing model evals for dangerous capabilities
Subscribe on Substack.