GeistHaus
log in · sign up

Introducing Sonnet 4.6

anthropic.com

Claude Sonnet 4.6 is a full upgrade of the model’s skills across coding, computer use, long-reasoning, agent planning, knowledge work, and design.

26 pages link to this URL
Simon Willison on llm-pricing

73 posts tagged ‘llm-pricing’. Posts about the pricing of various LLMs. See also my pricing calculator.

0 inbound links website en generative-ai 1793llms 1759ai 2026llm-release 200llm 600gemini 186pelican-riding-a-bicycle 115openai 419anthropic 282claude 275
daveverse

Opus 4.6 is much smarter than the other one. It feels like I’m working with someone from Bronx Science. I had been using Sonnet 4.6, which I switched to after reading somewhere that it costs …

0 inbound links article en
Simon Willison on pelican-riding-a-bicycle

114 posts tagged ‘pelican-riding-a-bicycle’. My benchmark for LLMs: "Generate an SVG of a pelican riding a bicycle". Here's my answer to what happens if AI labs train for pelicans riding bicycles?. "User …

0 inbound links website en ai 2025generative-ai 1792llms 1758llm-release 199llm 600llm-reasoning 98ai-in-china 95llm-pricing 72openai 419google 407
Introducing Claude Sonnet 4.6

Sonnet 4.6 is out today, and Anthropic claim it offers similar performance to November's Opus 4.5 while maintaining the Sonnet pricing of $3/million input and $15/million output tokens (the Opus …

2 inbound links article en ai 2024generative-ai 1791llms 1757llm 600anthropic 282claude 275llm-pricing 72pelican-riding-a-bicycle 113llm-release 199claude-code 112
Simon Willison on pelican-riding-a-bicycle

113 posts tagged ‘pelican-riding-a-bicycle’. My benchmark for LLMs: "Generate an SVG of a pelican riding a bicycle". Here's my answer to what happens if AI labs train for pelicans riding bicycles?. "User …

0 inbound links website en generative-ai 1791ai 2024llms 1757llm-release 199llm 600llm-reasoning 98ai-in-china 95llm-pricing 72openai 419google 407
Simon Willison on pelican-riding-a-bicycle

113 posts tagged ‘pelican-riding-a-bicycle’. My benchmark for LLMs: "Generate an SVG of a pelican riding a bicycle". Here's my answer to what happens if AI labs train for pelicans riding bicycles?. "User …

0 inbound links website en generative-ai 1791ai 2024llms 1757llm-release 199llm 600llm-reasoning 98ai-in-china 95llm-pricing 72openai 419google 407
Home - Hacker Bits

Our Magazine HACKER BITS is the monthly magazine that gives you the hottest technology stories straight from Hacker News. We select from the top voted stories for you and email them to you in an easy-to-read email magazine format. hacker ... Read More

0 inbound links website en
Simon Willison on pelican-riding-a-bicycle

113 posts tagged ‘pelican-riding-a-bicycle’. My benchmark for LLMs: "Generate an SVG of a pelican riding a bicycle". Here's my answer to what happens if AI labs train for pelicans riding bicycles?. "User …

0 inbound links website en generative-ai 1790ai 2023llms 1756llm-release 199llm 600llm-reasoning 98ai-in-china 95llm-pricing 72openai 419google 407
Beyond the Vibes: A Rigorous Guide to AI Coding Assistants and Agents - tedious ramblings

AI Coding Assistants and Agents are changing the way software is created, but developers have been expected to learn how to use them without any guidance. As managers push developers into adopting these tools how are they expected to avoid the dangers and pitfalls of vibe coding? In this post I take a high level look at the tools and practices developers can use to move quickly with AI without wanting to set their code bases on fire.

0 inbound links article en Guides coding assistantspec driven developmentcoding agentopenspecvibe coding
Miss One Weekend, Fall Behind One Month

I went on a weekend trip. I came back to a new Claude, a new GPT and an existential crisis. The pace of AI is no longer monthly. It's weekly.

0 inbound links article en CC BY-SA 4.0
Lockd & Loaded (February 20 2026)

Claude Sonnet 4.6 Released - I am all-in on the Claude train at the moment. Sonnet 4.6 has raised my expectation for other “default” frontier models. Anthropic seems to want to eliminate low-skill office work with these models. Useful? Yes. Potentially dangerous to society? Absolutely. It’s Not Just You - the iOS keyboard is broken - As an Apple user, I’ve been annoyed for a decade at how bad the keyboard on iOS is. This video is proof I’m not the only one having issues with it. FOSDEM 2025 - The Road to Mainstream Matrix - I flirted with the Matrix protocol in 2019 as I was searching for a persistent alternative to IRC that wasn’t Discord. I want Matrix to succeed, but I’ve also been curious why it hasn’t seemed to take off. This video explains the trouble that project is facing. That’s all for this week! God bless.

0 inbound links article en post
Claude Sonnet 4.6: Agentic Power vs. Safety

Anthropic’s latest model brings Opus-level reasoning to a mid-tier price point. We analyze the technical leaps, the lingering security flaws, and the community’

0 inbound links article en StoriesTech AnthropicClaude
Hacker Bits, Issue 123 - Hacker Bits

Welcome to issue 123 of Hacker Bits! We have a fantastic lineup this month spanning AI-assisted development and its limits, the human side of technical leadership, time handling in JavaScript, a rogue AI security incident and some honest reflection on ... Read More

0 inbound links article en
Simon Willison on llm-release

199 posts tagged ‘llm-release’. New releases of various LLMs.

0 inbound links website en LLMsllms 1751generative-ai 1785ai 2016pelican-riding-a-bicycle 113llm 598local-llms 156llm-reasoning 98ai-in-china 95llm-pricing 72gemini 185
Simon Willison on claude-code

112 posts tagged ‘claude-code’. Claude Code is Anthropic's terminal coding agent, enabling Claude to run tools in a loop on your own machine.

0 inbound links website en llms 1751ai 2016generative-ai 1785coding-agents 201ai-assisted-programming 381anthropic 282claude 275ai-agents 110projects 526vibe-coding 90
Agentic Engineering: Building Without Writing — Bill de hÓra

tars is a personal AI assistant with CLI, Web UI, Email, and Telegram channels, persistent memory, hybrid search, integration with tools I used all the time. About 35 features, 14kloc of python and 600 tests all told. I didn't write any of it. The experience was different enough from traditional de

2 inbound links article en