← Back to all entries
2026-03-25 🧭 Daily News

Auto Mode, 81K Voices & Changelog Highlights

Auto Mode, 81K Voices & Changelog Highlights — visual for 2026-03-25

🧭 Claude Code Auto Mode — The Middle Ground Between Speed and Control

Anthropic shipped a research preview of Auto Mode for Claude Code on March 24 — a new operating level designed to sit squarely between two extremes that developers have long had to choose between: constant approval prompts (safe but slow) and --dangerously-skip-permissions (fast but scary). Auto Mode threads that needle by adding an AI classifier layer that reviews each tool call before execution, approving the safe ones silently and blocking — or escalating — the risky ones.

How the classifier works

Availability and caveats

Best practice for adopting Auto Mode: Start with low-stakes repositories and observe which actions Claude takes silently vs. which it escalates. After a few runs you'll develop intuition for what the classifier approves, and can calibrate your task scoping accordingly. Pair with Claude Code's --verbose flag during the learning phase to see the full tool-call log.

Claude Code auto mode safety permissions developer tools

🧭 What 81,000 People Want From AI — Anthropic's Landmark User Study

Anthropic published what it describes as the largest and most multilingual qualitative study ever conducted — 80,508 conversations with Claude users across 159 countries in 70 languages, conducted in December 2025 using Anthropic Interviewer, an AI-powered conversational research tool. The March 18 publication crystallises a picture of how real people think about AI: not as a chatbot novelty, but as a lever on the most important dimensions of their lives — work, money, time, and meaning.

What people want (top visions)

Where AI has actually delivered

Top concerns

The geographic split: Sub-Saharan Africa, South Asia and Latin America are significantly more optimistic than North America and Western Europe. Lower/middle-income users frame AI as a "ladder up" — bypass capital barriers, build businesses, access expertise. Higher-income regions focus on governance gaps and surveillance concerns. 67% of all respondents expressed net positive sentiment; 81% said AI was already making progress toward their vision.

The study also identified five "light and shade" tensions where the same users simultaneously experience benefit and worry — most sharply for emotional support (16% benefit, 12% fear dependency) and time-saving (50% gain time, 18% feel a "productivity treadmill" instead). These are not separate camps of optimists and pessimists: they are the same people, holding both things at once.

research user study global Anthropic AI impact

🧭 Claude Code v2.1.83 — New Hooks, Drop-In Config & Transcript Search

The latest Claude Code release (v2.1.83) ships four quality-of-life improvements that, taken together, make the tool significantly more scriptable, observable, and comfortable to use in larger, longer sessions. None of these are headline features, but they're the kind of sharp edges that become obvious once you're running Claude Code as a daily driver.

What's new

Hook recipe — auto-lint on FileChanged:

{
  "hooks": {
    "FileChanged": [
      {
        "matcher": "**/*.py",
        "hooks": [{ "type": "command", "command": "ruff check --fix $CLAUDE_FILE_PATH" }]
      }
    ]
  }
}

Drop this in your managed-settings.d/ directory and every Python file Claude edits gets auto-linted before you review the diff.

Claude Code changelog hooks configuration developer tools

🧭 Vercept Closes, OSWorld Hits 72.5% — Claude's Computer Use Comes of Age

March 25 marked a quiet but significant milestone: Anthropic wound down Vercept's external product as the February acquisition formally completed. Vercept — founded by Kiana Ehsani, Luca Weihs, and Ross Girshick — built Vy, a cloud-hosted AI agent that operated a remote Apple MacBook to complete complex multi-step tasks. Their speciality was high-precision visual perception within live software interfaces: the exact capability Anthropic needed to make Mac Computer Use genuinely reliable. The team is now fully inside Anthropic working on what the company describes as "some of the hardest problems" in agentic AI.

The benchmark that puts it in context

Coinciding with the integration, Anthropic published an updated OSWorld score for Claude Sonnet 4.6: 72.5%. OSWorld is the standard evaluation for AI computer-use ability — tasks like filling web forms across multiple browser tabs, navigating complex spreadsheets, and managing files across applications. The trajectory tells the story:

That is roughly a 5× improvement in 15 months. For comparison, human performance on OSWorld sits at around 72–74%, meaning Claude is now within the noise of an average human on this measure — and considerably faster.

What this means for Claude Code users: the Auto Mode and Dispatch workflows covered earlier this week are only as useful as the underlying computer-use accuracy. A 72.5% OSWorld score means roughly 3 in 4 GUI interactions succeed on the first attempt — practical for supervised use, but still worth giving Claude clear stopping conditions and running in a sandboxed environment for anything sensitive.

computer use Vercept OSWorld benchmarks Anthropic acquisitions