🧭 Claude Sonnet 4.5 — Targeted Instruction-Following Improvement Update
Anthropic has released an update to Claude Sonnet 4.5 targeting instruction-following consistency, particularly in complex system prompts and multi-turn conversations with multiple simultaneous constraints. The update does not change the model version identifier (claude-sonnet-4-5) — it is an improvement to the same model checkpoint that improves behaviour without requiring code changes from developers. Anthropic's practice of releasing targeted behavioural improvements within a stable model alias means existing integrations benefit automatically.
Improvements in this update
- Constraint persistence across turns — constraints specified in the system prompt (such as output format, response language, or persona restrictions) are now maintained more reliably across long conversations where earlier instructions might previously have been partially overridden by later user messages
- Nested instruction resolution — when the system prompt and a user message contain related but not identical instructions, Claude is now better at identifying the correct interpretation rather than arbitrarily favouring one source
- Operator format compliance — JSON-only output mode and other strict operator-specified formats show improved compliance rates on the instruction-following benchmark, up from 94% to 98% on the internal evaluation suite
Sonnet 4.5
instruction following
model update
operators
retrospective
🧭 Refreshed Sonnet 4.5 Model Card — Updated Evaluations and Failure Mode Notes
Alongside the instruction-following update, Anthropic has published a refreshed model card for Claude Sonnet 4.5 that reflects the updated model's current evaluation results and adds new entries to the Known Failure Modes section introduced in the January model card refresh. The model card is updated to version 1.2 and includes benchmark results run against the updated checkpoint.
Notable model card updates
- Updated benchmark table — MMLU, HumanEval, and instruction-following benchmark numbers now reflect the updated January checkpoint, with delta annotations showing changes from the initial November release
- New failure mode entries — two new failure modes are documented: a tendency to over-literalise format instructions when no formatting is necessary, and occasional inconsistency when asked to maintain a numbered list structure across more than 15 turns
- Extended thinking interaction notes — a new section describes how the instruction-following improvements interact with extended thinking mode, noting that thinking-token usage tends to improve constraint adherence in complex multi-part instructions
The updated model card is available at anthropic.com/research/model-card-sonnet-4-5 and reflects Anthropic's commitment to maintaining living documentation that tracks model behaviour across updates.
model cards
Sonnet 4.5
documentation
transparency
retrospective