Adaptive thinking — GeistHaus

Vincent Schmalbach; Vincent Schmalbach ChatGPT May 9, 2026

Opus 4.7 is not generally a worse model than Opus 4.6, but there is a real downgrade: with Opus 4.7, the control over the thinking budget is now fully owned by Anthropic. This change matters in a way…

0 inbound links article en

Improving Deep Agents with Harness Engineering | vtrivedy

blog.langchain.com Vivek Trivedy Feb 17, 2026

Our coding agent went from Top 30 to Top 5 on Terminal Bench 2.0. We only changed the harness. Here's our approach to harness engineering.

0 inbound links BlogPosting en

Quo Vadis, Agentic Engineering?

Bartosz's blog Apr 22, 2026

The post highlights constraints, mechanisms, and factors influencing Agentic Engineering, emphasizing the types of bottlenecks we’re hitting and how GPU shortages are driving product changes.

0 inbound links article en posts llmengineering

Hacker News

Claude Opus 4.7 Apr 16, 2026

2 inbound links en

Best practices for computer and browser use with Claude | Claude

Claude May 13, 2026

Practical guidance for developers building computer and browser use integrations with the Claude model family.

3 inbound links website en

Opus 4.7 Low Vs Medium Vs High Vs Xhigh Vs Max: the Reasoning Curve on 29 Real Tasks from an Open Source Repo

Stet Ben Redmond May 12, 2026

Claude Opus 4.7 reasoning-effort curve on 29 matched GraphQL-go-tools tasks: low, medium, high, xhigh, and max. Medium wins the behavioral metrics; more reasoning does not reliably buy better patches.

0 inbound links article en Opus 4.7 reasoning effortClaude Opus 4.7 benchmarkClaude Code reasoning effortGraphQL-go-tools benchmarkAI coding agent evaluationStet reasoning curveadaptive thinking

Introducing Sonnet 4.6

anthropic.com Feb 16, 2026

Claude Sonnet 4.6 is a full upgrade of the model’s skills across coding, computer use, long-reasoning, agent planning, knowledge work, and design.

26 inbound links website en

Claude Opus 4.6

anthropic.com Feb 5, 2026

We’re upgrading our smartest model. Across agentic coding, computer use, tool use, search, and finance, Opus 4.6 is an industry-leading model, often by wide margin.

58 inbound links website en