GeistHaus
log in · sign up

Adaptive thinking

platform.claude.com

Let Claude dynamically determine when and how much to use extended thinking with adaptive thinking mode.

8 pages link to this URL
Is Opus 4.7 a Downgrade?

Opus 4.7 is not generally a worse model than Opus 4.6, but there is a real downgrade: with Opus 4.7, the control over the thinking budget is now fully owned by Anthropic. This change matters in a way…

0 inbound links article en
Quo Vadis, Agentic Engineering?

The post highlights constraints, mechanisms, and factors influencing Agentic Engineering, emphasizing the types of bottlenecks we’re hitting and how GPU shortages are driving product changes.

0 inbound links article en posts llmengineering
Opus 4.7 Low Vs Medium Vs High Vs Xhigh Vs Max: the Reasoning Curve on 29 Real Tasks from an Open Source Repo

Claude Opus 4.7 reasoning-effort curve on 29 matched GraphQL-go-tools tasks: low, medium, high, xhigh, and max. Medium wins the behavioral metrics; more reasoning does not reliably buy better patches.

0 inbound links article en Opus 4.7 reasoning effortClaude Opus 4.7 benchmarkClaude Code reasoning effortGraphQL-go-tools benchmarkAI coding agent evaluationStet reasoning curveadaptive thinking
Introducing Sonnet 4.6

Claude Sonnet 4.6 is a full upgrade of the model’s skills across coding, computer use, long-reasoning, agent planning, knowledge work, and design.

26 inbound links website en
Claude Opus 4.6

We’re upgrading our smartest model. Across agentic coding, computer use, tool use, search, and finance, Opus 4.6 is an industry-leading model, often by wide margin.

58 inbound links website en