Responsible Scaling Policy v3.0 — Full Text Published
Anthropic has published the full text of Responsible Scaling Policy v3.0, following the policymaker pre-briefings conducted earlier this week. The document is the most substantial revision of the RSP since the ASL framework was introduced in 2025, and introduces three significant structural changes to how Anthropic governs its model deployment process.
The three structural changes in RSP v3.0
- Deployment Review Board (DRB) — a new internal governance body with authority to delay any model release pending additional safety evaluation. The DRB comprises representatives from research, policy, legal, and external safety advisors, and must reach consensus before any ASL-3-or-above candidate proceeds to deployment review
- Mandatory third-party audit for ASL-3 candidates — models that trigger ASL-3 screening criteria must undergo evaluation by at least two independent external auditors before the DRB review. Previously, third-party audit was voluntary and aspirational
- ARARA evaluation as a hard gate — the new autonomous replication and resource acquisition evaluation cluster introduced in the February Risk Report is now a mandatory hard gate at every model release, not an advisory check
Anthropic has also updated the language around ASL-3 trigger criteria to adopt the revised uplift thresholds from the February Risk Report. The full document is available at anthropic.com/research/rsp-v3.