Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku

SaaS-Bench: Can Computer-Use Agents Leverage Real-World SaaS to Solve Professional Workflows?

arxiv.org Anthropic Nov 19, 2024

1 inbound link en

HCompany pushes into “computer use” with HoloTab agent that works through your browser

The New Stack; The New Stack Paul Sawers Apr 16, 2026

Chrome extension built on Holo3 model tests agents that can navigate websites and carry out tasks without integrations.

0 inbound links article en AI AgentsAI ModelsOpen Source

2025: The year in LLMs

Simon Willison’s Weblog Simon Willison Dec 31, 2025

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

1 inbound link article en ai 2024openai 419generative-ai 1791llms 1757anthropic 282gemini 185ai-agents 111pelican-riding-a-bicycle 113vibe-coding 91coding-agents 202ai-in-china 95conformance-suites 10

2025: The year in LLMs

Simon Willison’s Weblog Simon Willison Dec 31, 2025

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

1 inbound link article en ai 2024openai 419generative-ai 1791llms 1757anthropic 282gemini 185ai-agents 111pelican-riding-a-bicycle 113vibe-coding 91coding-agents 202ai-in-china 95conformance-suites 10

LLMs Are Not Intelligent

The Absent-Minded Professor Josh Brake Nov 5, 2024

A critical point to help you understand and work with AI effectively

1 inbound link article en

Emergent Introspective Awareness in Large Language Models

transformer-circuits.pub Jack Lindsey Oct 29, 2025

1 inbound link en

We should take AI welfare seriously

Experience Machines Robert Long Nov 1, 2024

A summary of a new report: why it's time to start taking action now to prepare for potential AI sentience

2 inbound links article en

The AI Gründerzeit

Long Walls Michael Hochberg Mar 18, 2026

Notes from Three Weeks in the Valley

1 inbound link article en

Stories

Lightspeed Venture Partners; Lightspeed Venture Partners Sep 2, 2025

1 inbound link article en

Quantifying the algorithmic improvement from reasoning models

Epoch AI Anson Ho; Arden Berg Aug 2, 2025

Reasoning models were as big of an improvement as the Transformer, at least on some benchmarks

2 inbound links article en

Product management on the AI exponential | Claude

Claude Mar 19, 2026

Claude Code’s Head of Product Cat Wu shares how teams should rethink their workflows and roadmaps in the face of rapidly evolving model intelligence.

1 inbound link website en

Rill | BI-as-Code and the New Era of GenBI

Rill Data Simon Späti Dec 18, 2025

Imagine creating business dashboards by simply describing what you want to see. This is the promise of Generative Business Intelligence (GenBI). The key lies in the declarative BI stack where dashboards and metrics are defined as code rather than hidden behind graphical user interfaces. In this guest blog by Simon Späti, we explore the possibilities of GenBI today.

2 inbound links website en Deep Dives

Introducing the next-level of AI-powered workflows with Amazon Q Developer inline chat | Amazon Web Services

Amazon Web Services Oct 29, 2024

Earlier today, Amazon Q Developer announced support for inline chat. Inline chat combines the benefits of in-IDE chat with the ability to directly update code, allowing developers to describe issues or ideas directly in the code editor, and receive AI-generated responses that are seamlessly integrated into their codebase. In this post, I will introduce the […]

0 inbound links article en Amazon Q Amazon QAnnouncementsGenerative AIAI/MLDeveloper Tools

Impact, agency, and taste

benkuhn.net Apr 19, 2025

understand + work backwards from the root goal • don’t rely too much on permission or encouragement • make success inevitable • find your angle • think real hard • reflect on your thinking

1 inbound link en

GitHub - corbt/agent.exe

GitHub Corbt Oct 23, 2024

Contribute to corbt/agent.exe development by creating an account on GitHub.

1 inbound link object en repository:877143674

Where Enterprises are Actually Adopting AI

A16z Kimberly Tan Apr 8, 2026

AI is coming for all markets

1 inbound link article en

Anshuman Bhartiya

Anshuman Bhartiya Anshuman Bhartiya Oct 23, 2024

Anshuman Bhartiya - Staff Security Engineer, AppSec Tech Lead, and co-host of The Boring AppSec Podcast.

0 inbound links en Anshuman BhartiyaInformation SecurityBlogTechnologyApplication SecurityProduct SecurityCybersecuritySoftware SecurityPodcastPublic Speaking

Normware: The Decline of Software Engineering

timkellogg.me Jan 2, 2025

Principal AI Architect. Creator of open-strix, a harness for building agent teams. Writing about AI architecture, stateful agents, and what happens when you give AI memory.

0 inbound links article en

Initial explorations of Anthropic’s new Computer Use capability

Simon Willison’s Weblog Simon Willison Oct 22, 2024

Two big announcements from Anthropic today: a new Claude 3.5 Sonnet model and a new API mode that they are calling computer use. (They also pre-announced 3.5 Haiku, but that’s …

3 inbound links article en ai 2024docker 60prompt-engineering 190prompt-injection 147generative-ai 1791llms 1757anthropic 282claude 275llm-tool-use 68claude-3-5-sonnet 41ai-agents 111computer-use 8

GitHub - yasyf/anthropic-computer-use-modal at musings.yasyf.com

GitHub Yasyf Oct 23, 2024

Anthropic Computer Use with Modal Sandboxes. Contribute to yasyf/anthropic-computer-use-modal development by creating an account on GitHub.

1 inbound link object en repository:877095647

AI-Enabled Coups: How a Small Group Could Use AI to Seize Power

Forethought Distributing decision-making authority Apr 14, 2025

The development of AI that is more broadly capable than humans will create a new and serious threat: *AI-enabled coups*. An AI-enabled coup could be staged by a very small group, or just a single person, and could occur even in established democracies. Sufficiently advanced AI will introduce three novel dynamics that significantly increase coup risk. Firstly, military and government leaders could fully replace human personnel with AI systems that are *singularly loyal* to them, eliminating the need to gain human supporters for a coup. Secondly, leaders of AI projects could deliberately build AI systems that are *secretly loyal* to them, for example fully autonomous military robots that pass security tests but later execute a coup when deployed in military settings. Thirdly, senior officials within AI projects or the government could gain *exclusive access* to superhuman capabilities in weapons development, strategic planning, persuasion, and cyber offense, and use these to increase their power until they can stage a coup. To address these risks, AI projects should design and enforce rules against AI misuse, audit systems for secret loyalties, and share frontier AI systems with multiple stakeholders. Governments should establish principles for government use of advanced AI, increase oversight of frontier AI projects, and procure AI for critical systems from multiple independent providers.

16 inbound links article en

Give your LLM a terminal

mattwestcott.org Matt Westcott May 28, 2025

Command-line access is the most powerful tool for LLMs

0 inbound links article en

Claude Computer Use: The Next ChatGPT Moment

Dr Leon Furze Niall Oct 28, 2024

In May 2022, I made the decision to step out of the classroom and apply for a PhD, broadly focused on digital texts. I grabbed a few articles, like Bradley Robinson’s on automated writing tec…

0 inbound links article en

Posts Tagged google - I Thought He Came With You

I Thought He Came With You Robert Ellison May 5, 2026

0 inbound links blog en #workspace#google#sheets#appsscript#gas#gemini#ai#ithcwy#fediverse#stackoverflow#seo#llm#indieweb#bridgyfed#nationalpopularvote#uspol#npvic#politicalreform#ice#resist#3d#ml#microsoft#links#electoralcollege#senate#legislative#popular#email#apps#switch#tech#dog#nest#basilisk#software#gmail#openai#photos#aura#backup#todoist#perplexity#android#prosopagnosia#alexa#shutup#muni#sfmta#sanfrancisco#what#like#future#turned#testing#francisco#transit#apple#3dprint#thingiverse123456Next

Posts Tagged gas - I Thought He Came With You

I Thought He Came With You Robert Ellison May 5, 2026

0 inbound links blog en #workspace#google#sheets#appsscript#gas#gemini#ai#nationalpopularvote#uspol#npvic#politicalreform#ice#resist#3d#ml#microsoft#links#ithcwy#electoralcollege#senate#legislative#popular#email#apps#switch#tech#dog#openai#todoist#perplexity#electricity#climatechange#california#coronavirus#azure#googleanalytics#ga4#mobile#pagespeed

Posts Tagged ml - I Thought He Came With You

I Thought He Came With You Robert Ellison Feb 15, 2026

0 inbound links blog en #nationalpopularvote#uspol#npvic#politicalreform#google#appsscript#gas#ice#resist#3d#ml#microsoft#links#ithcwy#electoralcollege#senate#legislative#popular#email#apps#switch#tech#dog#c##raspberrypi#ai#perceptron#wml#openai#todoist#perplexity#chatgpt#sanfrancisco#gpts#sfpol#budget#humane#littlechef#sfmta#muni#phone#intelligent#train#graph#sharepoint#agi#video#fog123Next

2024 Week 43 - Weekly Notes

craftbyzen.com Jeremy Wong Oct 30, 2024

A break in format - the quiet art of attention, conferences, vercel and microfront-ends, and some recommendations.

0 inbound links website en weeknotefocus

AI SDK 4.0

Vercel Authors Nov 18, 2024

Introducing PDF support, computer use, and an xAI Grok provider

1 inbound link website en

End-user Abstraction

vivekhaldar.com Mar 23, 2025

I was watching some old UNIX videos the other day. AT&T must have made them as PR back in the day. They seem fresh and timely even today, forty years later, like one of those classic rock albums that never sounds stale. They were showing example after example of end-users – almost everyone at AT&T, even the non-technical staff – using the shell and writing shell scripts to compose complex functionality out of simple programs. The Unix philosophy in action!

0 inbound links en

The best facts I heard this year

Zhengdong Wang Zhengdong Wang Dec 27, 2024

Zhengdong Wang’s personal website

0 inbound links article en

2024 letter

Zhengdong Wang Zhengdong Wang Dec 29, 2024

Zhengdong Wang’s personal website

3 inbound links article en

Vinyl Scrobbling macOS App

Russ McKendrick Russ McKendrick Oct 28, 2024

0 inbound links website en aimacosvinylpython

Personal Project Updates and AI Editors

Russ McKendrick Russ McKendrick Jan 12, 2025

About that time I wrote and published an App to the Apple App Store without knowing how to code

0 inbound links website en macosaicodevinyl

AI Agents: Engineering Over Intelligence | Marvin Zhang

marvinzhang.dev Marvin Zhang Jan 24, 2026

When SWE-bench scores improved 50% in just 14 months—from Claude 3.5 Sonnet's 49% in October 2024 to Claude 4.5 Opus's 74.4% in January 2026—you'd think AI agents had conquered software engineering. Yet companies deploying these agents at scale tell a different story. Triple Whale's CEO described their production journey: "GPT-5.2 unlocked a complete architecture shift for us. We collapsed a fragile, multi-agent system into a single mega-agent with 20+ tools... The mega-agent is faster, smarter, and 100x easier to maintain."

0 inbound links article en AIAgentsArchitectureLLMSoftware Engineering

The 10 laws of developer experience for content management systems

knut.fyi Knut Melvær Apr 4, 2026

These ten laws are a checklist for CMS developer experience in a world where humans and agents both need to build on top of your content system.

0 inbound links en

What we learn when we test everything – Jason Collins blog

Jason Collins blog Jason Collins Nov 13, 2024

Behavioural economics, data science and artificial intelligence.

0 inbound links en CC BY 4.0

Karl Koch | Interface as a Service

Karl Koch Karl Koch Feb 4, 2025

Agentic interfaces will revolutionise interaction with digital systems via intelligent, unobtrusive service delivery.

0 inbound links website en design

What C-Level Leaders Need to Know About AI Agents

javatask.dev Andrii Melashchenko Nov 12, 2024

C-Level leaders can use generative AI for productivity and innovation, applying mental models to enhance business processes

0 inbound links article en aimanagementinnovationllmpromptengineeringagentrag

Beyond Reasoning: Anthropic's Agent

Mostly Harmless Jeremiah Lowin Oct 22, 2024

If you give a computer a computer...

0 inbound links article en

I wrote a replacement for GitHub's code review bot

Alex Ellis' Blog Alex Ellis Nov 18, 2025

If GitHub themselves have a native code review bot, why not just use it?

0 inbound links article en githubllmagentlinuxself-hostingfirecrackerslicer

Monitoring computer use via hierarchical summarization

alignment.anthropic.com Oct 15, 2024

5 inbound links en

On the Eve of Superintelligence

Agupta Ankit Gupta Jan 27, 2025

Preparing for AI Progress

0 inbound links en blogaccentAnkit Guptajekyll

It's Good for Apple, and Okay for You

Allen Pike Nov 30, 2024

Apple Intelligence, so far.

1 inbound link article en

RL, in pictures and videos

Suriya's site Suriya Ganesh Apr 25, 2026

A walk through what is possible with RL drones beating world champions, robots balancing on yoga balls, AIs that paint, fusion reactors, and the ad you saw last Tuesday.

0 inbound links article en tech techmachine-learningreinforcement-learning

Things we learned about LLMs in 2024

Simon Willison’s Weblog Simon Willison Dec 31, 2024

A lot has happened in the world of Large Language Models over the course of 2024. Here’s a review of things we figured out about the field in the past …

20 inbound links article en google 407ai 2016openai 418generative-ai 1785local-llms 156llms 1751anthropic 282gemini 185meta 36llm-reasoning 98long-context 20ai-energy-usage 17coding-agents 201

Impact, agency, and taste

benkuhn.net Apr 19, 2025

understand + work backwards from the root goal • don’t rely too much on permission or encouragement • make success inevitable • find your angle • think real hard • reflect on your thinking

7 inbound links en

Emergent Introspective Awareness in Large Language Models

transformer-circuits.pub Jack Lindsey Oct 29, 2025

7 inbound links en

Anthropic publicly releases AI tool that can take over the user’s mouse cursor

Ars Technica Samuel Axon Oct 22, 2024

Anthropic is one of the first to go beyond just screen vision.

2 inbound links article en

GitHub - cline/cline: Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

GitHub Cline Jul 6, 2024

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way. - cline/cline

35 inbound links object en repository:824874689

Penpot's AI whitepaper

Penpot Blog Pablo Ruiz-Múzquiz Aug 5, 2025

This piece explains some of Penpot's relevant findings around AI and UI Design, what we’re building (and why) and what you should expect from us in the future.

3 inbound links article en PenpotOpen SourceAI

Everything I've Said About AI Since 2016: A Retrospective

Danielmiessler Jan 6, 2026

Looking back at my predictions to see what I got right, wrong, and what's still playing out

1 inbound link en

How I Use AI: Early 2025

BenRCongdon Ben Congdon Feb 2, 2025

A snapshot of the current AI tools & techniques I’ve found useful.

4 inbound links article en blog

2025: The year in LLMs

Simon Willison’s Weblog Simon Willison Dec 31, 2025

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

25 inbound links article en ai 2014openai 418generative-ai 1785llms 1751anthropic 282gemini 185ai-agents 110pelican-riding-a-bicycle 113vibe-coding 90coding-agents 200ai-in-china 95conformance-suites 10

Anti-patterns while working with LLMs

InstaVM Nov 10, 2025

Anti-patterns observed while working extensively with LLMs — from redundant context to over‑engineering.

2 inbound links article en BlogAILLMBest PracticesAnti-patterns

Gemini 2.5 Pro and the Meta Engineering

Juriy’s Substack Juriy Apr Apr 4, 2025

The new challenger in the Cline-assisted coding space

1 inbound link article en

What's new in AI in October?

steveharrison.dev Oct 28, 2024

It's been a while since I did a tech roundup! A lot has been announced in the way of AI—let's dive in. Anthropic has released updates, including a "comput

0 inbound links website en