GeistHaus
log in · sign up

Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku

anthropic.com

A refreshed, more powerful Claude 3.5 Sonnet, Claude 3.5 Haiku, and a new experimental AI capability: computer use.

56 pages link to this URL
2025: The year in LLMs

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

1 inbound link article en ai 2024openai 419generative-ai 1791llms 1757anthropic 282gemini 185ai-agents 111pelican-riding-a-bicycle 113vibe-coding 91coding-agents 202ai-in-china 95conformance-suites 10
2025: The year in LLMs

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

1 inbound link article en ai 2024openai 419generative-ai 1791llms 1757anthropic 282gemini 185ai-agents 111pelican-riding-a-bicycle 113vibe-coding 91coding-agents 202ai-in-china 95conformance-suites 10
Rill | BI-as-Code and the New Era of GenBI

Imagine creating business dashboards by simply describing what you want to see. This is the promise of Generative Business Intelligence (GenBI). The key lies in the declarative BI stack where dashboards and metrics are defined as code rather than hidden behind graphical user interfaces. In this guest blog by Simon Späti, we explore the possibilities of GenBI today.

2 inbound links website en Deep Dives
Introducing the next-level of AI-powered workflows with Amazon Q Developer inline chat | Amazon Web Services

Earlier today, Amazon Q Developer announced support for inline chat. Inline chat combines the benefits of in-IDE chat with the ability to directly update code, allowing developers to describe issues or ideas directly in the code editor, and receive AI-generated responses that are seamlessly integrated into their codebase. In this post, I will introduce the […]

0 inbound links article en Amazon Q Amazon QAnnouncementsGenerative AIAI/MLDeveloper Tools
Impact, agency, and taste

understand + work backwards from the root goal • don’t rely too much on permission or encouragement • make success inevitable • find your angle • think real hard • reflect on your thinking

Anshuman Bhartiya

Anshuman Bhartiya - Staff Security Engineer, AppSec Tech Lead, and co-host of The Boring AppSec Podcast.

0 inbound links en Anshuman BhartiyaInformation SecurityBlogTechnologyApplication SecurityProduct SecurityCybersecuritySoftware SecurityPodcastPublic Speaking
Initial explorations of Anthropic’s new Computer Use capability

Two big announcements from Anthropic today: a new Claude 3.5 Sonnet model and a new API mode that they are calling computer use. (They also pre-announced 3.5 Haiku, but that’s …

3 inbound links article en ai 2024docker 60prompt-engineering 190prompt-injection 147generative-ai 1791llms 1757anthropic 282claude 275llm-tool-use 68claude-3-5-sonnet 41ai-agents 111computer-use 8
AI-Enabled Coups: How a Small Group Could Use AI to Seize Power

The development of AI that is more broadly capable than humans will create a new and serious threat: *AI-enabled coups*. An AI-enabled coup could be staged by a very small group, or just a single person, and could occur even in established democracies. Sufficiently advanced AI will introduce three novel dynamics that significantly increase coup risk. Firstly, military and government leaders could fully replace human personnel with AI systems that are *singularly loyal* to them, eliminating the need to gain human supporters for a coup. Secondly, leaders of AI projects could deliberately build AI systems that are *secretly loyal* to them, for example fully autonomous military robots that pass security tests but later execute a coup when deployed in military settings. Thirdly, senior officials within AI projects or the government could gain *exclusive access* to superhuman capabilities in weapons development, strategic planning, persuasion, and cyber offense, and use these to increase their power until they can stage a coup. To address these risks, AI projects should design and enforce rules against AI misuse, audit systems for secret loyalties, and share frontier AI systems with multiple stakeholders. Governments should establish principles for government use of advanced AI, increase oversight of frontier AI projects, and procure AI for critical systems from multiple independent providers.

16 inbound links article en
Claude Computer Use: The Next ChatGPT Moment

In May 2022, I made the decision to step out of the classroom and apply for a PhD, broadly focused on digital texts. I grabbed a few articles, like Bradley Robinson’s on automated writing tec…

0 inbound links article en
Posts Tagged google - I Thought He Came With You
0 inbound links blog en #workspace#google#sheets#appsscript#gas#gemini#ai#ithcwy#fediverse#stackoverflow#seo#llm#indieweb#bridgyfed#nationalpopularvote#uspol#npvic#politicalreform#ice#resist#3d#ml#microsoft#links#electoralcollege#senate#legislative#popular#email#apps#switch#tech#dog#nest#basilisk#software#gmail#openai#photos#aura#backup#todoist#perplexity#android#prosopagnosia#alexa#shutup#muni#sfmta#sanfrancisco#what#like#future#turned#testing#francisco#transit#apple#3dprint#thingiverse123456Next
Posts Tagged gas - I Thought He Came With You
0 inbound links blog en #workspace#google#sheets#appsscript#gas#gemini#ai#nationalpopularvote#uspol#npvic#politicalreform#ice#resist#3d#ml#microsoft#links#ithcwy#electoralcollege#senate#legislative#popular#email#apps#switch#tech#dog#openai#todoist#perplexity#electricity#climatechange#california#coronavirus#azure#googleanalytics#ga4#mobile#pagespeed
Posts Tagged ml - I Thought He Came With You
0 inbound links blog en #nationalpopularvote#uspol#npvic#politicalreform#google#appsscript#gas#ice#resist#3d#ml#microsoft#links#ithcwy#electoralcollege#senate#legislative#popular#email#apps#switch#tech#dog#c##raspberrypi#ai#perceptron#wml#openai#todoist#perplexity#chatgpt#sanfrancisco#gpts#sfpol#budget#humane#littlechef#sfmta#muni#phone#intelligent#train#graph#sharepoint#agi#video#fog123Next
2024 Week 43 - Weekly Notes

A break in format - the quiet art of attention, conferences, vercel and microfront-ends, and some recommendations.

0 inbound links website en weeknotefocus
AI SDK 4.0

Introducing PDF support, computer use, and an xAI Grok provider

1 inbound link website en
End-user Abstraction

I was watching some old UNIX videos the other day. AT&T must have made them as PR back in the day. They seem fresh and timely even today, forty years later, like one of those classic rock albums that never sounds stale. They were showing example after example of end-users – almost everyone at AT&T, even the non-technical staff – using the shell and writing shell scripts to compose complex functionality out of simple programs. The Unix philosophy in action!

AI Agents: Engineering Over Intelligence | Marvin Zhang

When SWE-bench scores improved 50% in just 14 months—from Claude 3.5 Sonnet's 49% in October 2024 to Claude 4.5 Opus's 74.4% in January 2026—you'd think AI agents had conquered software engineering. Yet companies deploying these agents at scale tell a different story. Triple Whale's CEO described their production journey: "GPT-5.2 unlocked a complete architecture shift for us. We collapsed a fragile, multi-agent system into a single mega-agent with 20+ tools... The mega-agent is faster, smarter, and 100x easier to maintain."

0 inbound links article en AIAgentsArchitectureLLMSoftware Engineering
RL, in pictures and videos

A walk through what is possible with RL drones beating world champions, robots balancing on yoga balls, AIs that paint, fusion reactors, and the ad you saw last Tuesday.

0 inbound links article en tech techmachine-learningreinforcement-learning
Things we learned about LLMs in 2024

A lot has happened in the world of Large Language Models over the course of 2024. Here’s a review of things we figured out about the field in the past …

20 inbound links article en google 407ai 2016openai 418generative-ai 1785local-llms 156llms 1751anthropic 282gemini 185meta 36llm-reasoning 98long-context 20ai-energy-usage 17coding-agents 201
Impact, agency, and taste

understand + work backwards from the root goal • don’t rely too much on permission or encouragement • make success inevitable • find your angle • think real hard • reflect on your thinking

Penpot's AI whitepaper

This piece explains some of Penpot's relevant findings around AI and UI Design, what we’re building (and why) and what you should expect from us in the future.

3 inbound links article en PenpotOpen SourceAI
2025: The year in LLMs

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

25 inbound links article en ai 2014openai 418generative-ai 1785llms 1751anthropic 282gemini 185ai-agents 110pelican-riding-a-bicycle 113vibe-coding 90coding-agents 200ai-in-china 95conformance-suites 10
What's new in AI in October?

It's been a while since I did a tech roundup! A lot has been announced in the way of AI—let's dive in. Anthropic has released updates, including a "comput

0 inbound links website en