GeistHaus
log in · sign up

Introducing Claude Opus 4.5

anthropic.com

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

62 pages link to this URL
Simon Willison on llm-pricing

73 posts tagged ‘llm-pricing’. Posts about the pricing of various LLMs. See also my pricing calculator.

0 inbound links website en generative-ai 1793llms 1759ai 2026llm-release 200llm 600gemini 186pelican-riding-a-bicycle 115openai 419anthropic 282claude 275
Simon Willison on november-2025-inflection

14 posts tagged ‘november-2025-inflection’. November 2025 felt like an inflection point for coding agents, with Opus 4.5 and GPT 5.1 (and GPT-5.1 Codex) increasing the utility of those agents in a very n…

0 inbound links website en llms 1758generative-ai 1792ai 2025ai-assisted-programming 383coding-agents 203agentic-engineering 51gpt-5 30gpt 124anthropic 282llm-release 199
The AI App Experience Matters More Than Benchmarks Now

I was catching up on different articles after the release of Claude Opus 4.5 earlier this week, and this part from Simon Willison’s blog post about it stood out to me: I’m not saying the new model isn’t an improvement on Sonnet 4.5—but I can’t say with confidence that the challenges I posed it were

1 inbound link article en notes AIartificial intelligencefeaturedLLMs
The AI App Experience Matters More Than Benchmarks Now

I was catching up on different articles after the release of Claude Opus 4.5 earlier this week, and this part from Simon Willison’s blog post about it stood out to me: I’m not saying the new model isn’t an improvement on Sonnet 4.5—but I can’t say with confidence that the challenges I posed it were

1 inbound link article en notes AIartificial intelligencefeaturedLLMs
GitHub - danielmiessler/Fabric: Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere. - dan...

1 inbound link object en repository:738733003
305 - Subagents explained: What they are, when (not) to spawn them - Fragmented | AI Developer Podcast

Download directly Subagents are becoming a core primitive for serious AI-assisted development. In this episode, Kaushik and Iury disambiguate “agent” terminology, unpack plan mode vs subagents, and explain how parallel, scoped workers improve research quality without polluting the main thread. Full shownotes at fragmentedpodcast.com.

1 inbound link article en
GitHub - danielmiessler/Fabric: Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere. - dan...

3 inbound links object en repository:738733003
My take on AI is changing

Less than half a year ago I was not only skeptical towards AI in my field, software development, but also not very impressed by the results it produced at the time. Now, however, I rarely write code myself anymore, and I'm not really observing the problems I envisioned would arise by the increased use. But. Some concerns I had, I still have.

0 inbound links en Fredrik August Madsen-Malmohomepagesoftwaredevelopmentprogrammingk3sgolangrusttypescript
A Model Explosion: 4 in 1 Week! - Does it matter in AI?

5 models in 5 weeks. Then Anthropic released a new model, Claude 4.5 Opus, and I thought my post would be about it. But a rush of new models in the past few days can’t be ignored. What does it mean? Let’s dive in. Model #6: Anthropic On November 24, Anthropic announced Claude 4.5 Opus, to complete their

0 inbound links article en Uncategorized
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult

Anthropic released Claude Opus 4.5 this morning, which they call “best model in the world for coding, agents, and computer use”. This is their attempt to retake the crown for …

10 inbound links article en prompt-injection 147generative-ai 1790llms 1756anthropic 282claude 275evals 42llm-pricing 72pelican-riding-a-bicycle 113llm-release 199november-2025-inflection 14
GitHub - danielmiessler/Fabric: Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere. - dan...

15 inbound links object en repository:738733003
Into the Singularity

The singularity started somewhere around Thanksgiving 2025. Maybe it was Opus 4.5, maybe it was just everything in the air, but we hit the point where the future is genuinely different from the past. There’s a kind of imagination block where we don’t even know how to think about what comes next. There will be a transition to something else – it’s actually quite different to have intelligence everywhere in the world.

1 inbound link article en fragments AiSingularity
What to know about Claude Opus 4.5 - TechTalks

Anthropic responds to OpenAI and Google with Claude Opus 4.5, a model that prioritizes coding dominance, cost-efficiency, and user-controlled reasoning.

0 inbound links article en What is...AnthropicArtificial intelligence (AI)Claude LLMClaude Opus 4.5large language modelslarge reasoning modelsbendee983
Introducing SwiftPorts

When I was building SwiftBash I made surprisingly quick headway on the basic CLI utilities — jq, awk, sed, grep. Each one is a small, well-scoped language, and once you sit down with the spec it re…

0 inbound links article en CC BY-NC-ND 3.0
Stop Prompting, Start Briefing

For the past year I have been teaching technical teams a five-step system for getting better results from AI. I call it the Prompt Cycle. Teams that adopt it...

0 inbound links website en tw-jekyll
A Return to Code

You’re listening to the Naval Podcast. This is Nivi, his regular co-host. Today we’re going to talk about vibe coding. Let me tee up the conversation with a tweet from Naval from March 23rd: “AI coding agents can now deliver one-shot custom apps straight to your phone. More

0 inbound links article en Technology AIEntrepreneurshipTechnology
Coding Agent VMs on NixOS with microvm.nix

I have come to appreciate coding agents to be valuable tools for working with computer program code in any capacity, such as learning about any program’s architecture, diagnosing bugs or developing proofs of concept. Depending on the use-case, reviewing each command the agent wants to run can get tedious and time-consuming very quickly. To safely run a coding agent without review, I wanted a Virtual Machine (VM) solution where the agent has no access to my personal files and where it’s no big deal if the agent gets compromised by malware: I can just throw away the VM and start over.

5 inbound links en CC BY 4.0
Holiday update

Inbetween eggnog and christmas carols, I figured I’d bring a short update about things that have been on my mind that I wish to share before the new year. LLMs Gemini 3 Pro Preview was released, but I prefer Claude Opus 4.5 in Claude Code over Gemini for anything coding related, most of the time. Claude is more consistent in my opinion. Recently, with Opus, I do hit the rate limits much faster with my $20/month Claude Pro plan and sometimes fall back to Gemini 3. Apart from MCPs, I’ve also started to rely more and more on skills, which I heard OpenAI is adopting too. But for everything else, it’s Gemini 3 all the way. I’m even using it to plan an upcoming Japan trip. I wonder if Gemini 3 Flash will beat Claude in coding tasks (it kind of sounds like some people are very impressed)?

0 inbound links article en blog
The hard part was never building - David Mytton

The ability to complete no longer hinges solely on how many features you can build. The engineering gap between a two-person startup and a hundred-person org has collapsed.

0 inbound links article en Product EngineeringProductivityStartups
OpenClaw Showed Me What the Future of Personal AI Assistants Looks Like

Update, February 6: I've published an in-depth guide with advanced tips for secure credentials, memory management, automations, and proactive work with OpenClaw for our Club members here. For the past week or so, I’ve been working with a digital assistant that knows my name, my preferences for my morning routine, how I like to use

14 inbound links article en stories AIai experimentsartificial intelligenceclawdfeaturedLLMs
Simon Willison on prompt-injection

147 posts tagged ‘prompt-injection’. Prompt Injection is a security attack against applications built on top of Large Language Models, introduced here and further described in this series of posts.

0 inbound links website en llms 1751ai 2016generative-ai 1785security 602exfiltration-attacks 43openai 418prompt-engineering 190ai-agents 110lethal-trifecta 25anthropic 282
Agentic Engineering: Building Without Writing — Bill de hÓra

tars is a personal AI assistant with CLI, Web UI, Email, and Telegram channels, persistent memory, hybrid search, integration with tools I used all the time. About 35 features, 14kloc of python and 600 tests all told. I didn't write any of it. The experience was different enough from traditional de

2 inbound links article en
AI and software security: the slop is now signal

No matter how you feel about AI, it’s changing the world of software. The “T” in ChatGPT was invented to improve language translation, and large language models (LLMs) are very good at this. Interestingly, translating between French and Japanese is effectively the same as translating between English and Python for these systems. As LLMs improve, we’re also finding that there’s little difference between “help me fix mistakes in this document”, and “find the flaws in this codebase”. LLMs are now great at both tasks, but the latter has much larger implications.

0 inbound links article en