Claude 3.7 Sonnet and Claude Code

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

arxiv.org ∗ Equal Contribution See Contribution Statement Jun 30, 2019

1 inbound link en

2025: The year in LLMs

Simon Willison’s Weblog Simon Willison Dec 31, 2025

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

1 inbound link article en ai 2024openai 419generative-ai 1791llms 1757anthropic 282gemini 185ai-agents 111pelican-riding-a-bicycle 113vibe-coding 91coding-agents 202ai-in-china 95conformance-suites 10

2025: The year in LLMs

Simon Willison’s Weblog Simon Willison Dec 31, 2025

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

1 inbound link article en ai 2024openai 419generative-ai 1791llms 1757anthropic 282gemini 185ai-agents 111pelican-riding-a-bicycle 113vibe-coding 91coding-agents 202ai-in-china 95conformance-suites 10

Emergent Introspective Awareness in Large Language Models

transformer-circuits.pub Jack Lindsey Oct 29, 2025

1 inbound link en

The State of LLM Reasoning Model Inference

Ahead of AI Sebastian Raschka; PhD Mar 8, 2025

Inference-Time Compute Scaling Methods to Improve Reasoning Models

1 inbound link article en

Stories

Lightspeed Venture Partners; Lightspeed Venture Partners Sep 2, 2025

1 inbound link article en

How I wrote JustHTML using coding agents

friendlybit.com Emil Stenström Dec 3, 2025

I recently released JustHTML, a python-based HTML5 parser. It passes 100% of the html5lib test suite, has zero dependencies, and includes a CSS selector...

1 inbound link en

Quantifying the algorithmic improvement from reasoning models

Epoch AI Anson Ho; Arden Berg Aug 2, 2025

Reasoning models were as big of an improvement as the Transformer, at least on some benchmarks

2 inbound links article en

How Good Is Gemini 2.5 Pro at Writing R Code? | Simon P. Couch

Simon P. Couch Apr 1, 2025

Since Gemini 2.5 Pro’s release last week, I’ve been seeing a lot of hype claiming that the model is the new state of the art. How well does it know R?

0 inbound links en CC BY-SA 4.0

From IDE to AI orchestration: The end of code-first development

maxghenis.com Jan 11, 2026

How AI coding tools evolved from autocomplete to autonomous agents, and why I expect to abandon my IDE entirely this month.

0 inbound links website en

Productive Struggle: How Artificial Intelligence Is Changing Learning, Effort, and Youth Development in Education | Bellwether

Bellwether Zoe Campbell Jun 25, 2025

3 inbound links article en Publications

Anthropic made a big mistake

archaeologist.dev Jan 12, 2026

1 inbound link en

Is Cursor worth it for developers? | Alex Hyett

alexhyett.com Mar 9, 2025

Last weekend I thought I would try out Cursor, the AI code editor. I have been using Claude for the odd programming question, small script but never for anything larger.

0 inbound links website en

Réflexions sur l'impact de l'IA générative sur les applications logiciels

nocodefunctions.com Nocode functions Aug 18, 2025

Ce sont des notes personnelles sur la manière dont l’IA générative impacte les applications logicielles. Ce post est traduit de l’anglais et fait suite à ce post précédent, en anglais aussi. Ce que l’IA générative peut faire en août 2025 1. Prendre des fichiers multimodaux en entrée Jusqu’à l’année dernière, la plupart des applis d’IA générative étaient limitées quant aux types de fichiers qu’elles pouvaient lire. Gemini ne pouvait même pas lire les PDF si je me souviens bien. Cette barrière est tombée. La plupart des applis acceptent désormais txt, csv, pdf, docx, xlsx, pptx, tout xml ou json. Plus important encore, toutes les grandes applis d’IA générative sont désormais multimodales : elles acceptent du texte mais aussi des images en entrée et peuvent en faire des analyses de contenu très détaillées. J’ai testé avec Le Chat de Mistral (version gratuite), Gemini, Claude (version gratuite) et ChatGPT : toutes ont pu décrire une capture d’écran de mon ordinateur avec un grand niveau de précision. L’analyse sonore est arrivée aussi. On peut réaliser une analyse détaillée d’un fichier .wav, par exemple (en août 2025, seul ChatGPT possède cette capacité). Et la vidéo ? Pour l’instant, seul ChatGPT peut traiter des vidéos : sur la vidéo que j’ai testée, il extrait un échantillon d’images puis tente d’en déduire un sens. 2. Réaliser des analyses avancées, à la volée ChatGPT peut désormais lancer un mini environnement informatique pour traiter vos requêtes à la volée : il peut exécuter n’importe quel code Python disponible librement en bibliothèque packagée sur le web. 3. Raisonner La capacité de raisonnement est ce qui a fait le succès de l’IA générative : l’effet saisissant des LLMs est qu’ils (semblent) raisonner comme le ferait un humain. À vous d’apprécier si cet humain correspond à « un stagiaire », « un étudiant » ou « un niveau doctorat », mais cela reste assez proche de ce qu’un humain ferait. Le raisonnement a pris une nouvelle tournure depuis l’automne 2024 (Op

0 inbound links fr

Unless its governance changes, Anthropic is untrustworthy

anthropic.ml Mikhail Samin Nov 28, 2025

0 inbound links en

Hidden Technical Debt of AI Systems: Agent Harness

Han, Not Solo Han Lee May 8, 2026

The agent is the model plus the harness. The runtime is where the harness lives. As models get better, the structure we put around them turns from scaffoldin...

0 inbound links article en blogs AI EngineeringAgent SystemsCompound AI SystemsMLOpsGenerative AILLMReinforcement Learning

30,656 Pages of Books About the .NET Ecosystem: C#, Blazor, ASP.NET, & T-SQL - Kerrick Long (blog)

Kerrick Long (blog) - Articles about programming, learning, code, books, and teams Kerrick Long Mar 8, 2025

When I went to find the iconic books to learn the .NET stack I came to a shocking realization. There are too many books! 53 books; 30,656 pages; over 757 hours.

0 inbound links article en Blog Posts.NETBlazorBooksC#RailsRubyT-SQL

Everything Wrong with MCP

Shrivu’s Substack Shrivu Shankar Apr 13, 2025

Explaining the Model Context Protocol and everything that might go wrong.

4 inbound links article en

Language models transmit behavioural traits through hidden signals in data - Nature

Nature Publishing Group UK Alex Cloud; Minh Le; James Chua; Jan Betley; Anna Sztyber-Betley; Sören Mindermann; Jacob Hilton; Samuel Marks; Owain Evans Apr 15, 2026

During model distillation, large language models can subtly transmit traits unrelated to the training data.

1 inbound link article en Computer scienceSoftware Computer scienceSoftwareScienceHumanities and Social Sciencesmultidisciplinary CC BY 4.0

coding agents in the real world

dlants.me Mastodon Feb 28, 2025

0 inbound links en

Big Players in AI - dentro.de/ai

dentro.de/ai Jan 8, 2026

The Who is Who in AI Land.

0 inbound links website en

My experiences with AI and Claude Code in 2025

cmdcolin.github.io Dec 23, 2025

Astro description

0 inbound links en

Craft By Zen

craftbyzen.com Jeremy Wong May 3, 2026

Jeremy's personal website

0 inbound links website en

Designing agentic loops

Simon Willison’s Weblog Simon Willison Sep 30, 2025

Coding agents like Anthropic’s Claude Code and OpenAI’s Codex CLI represent a genuine step change in how useful LLMs can be for producing working code. These agents can now directly …

7 inbound links article en definitions 53ai 2023generative-ai 1790llms 1756ai-assisted-programming 383ai-agents 111coding-agents 202async-coding-agents 17

What skills does SWE-bench Verified evaluate?

Epoch AI Florian Brand; Jean-Stanislas Denain May 30, 2025

We take a deep dive into SWE-bench Verified, a prominent agentic coding benchmark. While one of the best public tests of AI coding agents, it is limited by its focus on simple bug fixes in familiar open-source repositories.

2 inbound links article en Capabilities

Claude Code

Simon Späti's Second Brain Simon Späti Feb 26, 2025

This is an Agentic Coding Tool that reasons and writes code.

0 inbound links article en

Designing agentic loops

Simon Willison’s Weblog Simon Willison Sep 30, 2025

Coding agents like Anthropic’s Claude Code and OpenAI’s Codex CLI represent a genuine step change in how useful LLMs can be for producing working code. These agents can now directly …

1 inbound link article en definitions 53ai 2023generative-ai 1790llms 1756ai-assisted-programming 383ai-agents 111coding-agents 202async-coding-agents 17

The Zen of Claude Code

vlad.build Sep 29, 2025

The Zen of Claude Code: How Simplicity Beat Complexity in AI Agents It's amazing how quickly the world of AI agents has changed, especially in the last co...

0 inbound links article en

I've moved this blog to Astro, and I like it here

The Data Quarry Prashanth Rao Feb 26, 2025

The story behind me migrating this blog site from Zola to Astro

0 inbound links article en

MCP: An Introduction to Agentic Op Support

TrustedSec Mar 27, 2025

1 inbound link website en

I’ll Never Build an Ugly Demo Again

tjvantoll.com TJ VanToll Mar 5, 2025

As a long-time developer advocate, one constant struggle of mine has been building good-looking demo applications. I have little-to-no design talent, and alt...

0 inbound links en

Blink, and the entire AI landscape could shift - Log - nibzard

Nibzard Nikola Balić May 20, 2025

AI dev tooling is consolidating--acquisitions, coding agents, and fierce competition reshape interfaces, pricing, and memory.

0 inbound links article en HUMANOPINION HUMANOPINION

AI the Docs 2025 conference

dufcrule.com Jul 12, 2025

0 inbound links en

Connect Claude to your own apps

Alexandru ROSIANU May 31, 2025

Group 1 I’ve always wanted my own assistant, like JARVIS from Iron Man. Every year we get closer to JARVIS being possible, but we’re not there yet. Let me...

0 inbound links article en

Claude Code is impressive

U-Zyn's Nodes U-Zyn Chua Feb 27, 2025

My workflow with generative AI has been changing quite rapidly over the past two weeks. DeepSeek's release really sped up innovation and release cycle among its competitors. I may draft a longer note

0 inbound links article en aianthropicclaude

Claude Code: First Impressions | Ann Catherine Jose

annjose.com Ann Catherine Jose Feb 24, 2025

Exploring Anthropic's new CLI-based agentic coding tool

0 inbound links article en

2025 year in review

kliu.io Kevin Liu Dec 26, 2025

One step closer to the irreducible loss of the data

0 inbound links en jekylljekyll-themeacademic-websiteportfolio-website

The Curve is Bending

Grant Slatton's Blog Apr 5, 2025

Predictions on near-term AI inference spending

0 inbound links article en

Claude Code is My Computer | Peter Steinberger

Peter Steinberger Peter Steinberger Jun 3, 2025

I run Claude Code with --dangerously-skip-permissions flag, giving it full system access. Let me show you a new way of approaching computers.

9 inbound links BlogPosting en AI AIClaudeComputingDevelopmentProductivityClaude-CodeDevOpsAutomation CC BY 4.0

It's official: 2026 is a weird year for tech and programmers

Andrew Montalenti Andrew Montalenti Feb 15, 2026

I've struggled to publish a 2026 essay that captures my headspace appropriately. It's not because of the blog chill, this time. It's because of AI/LLM tools.

0 inbound links article en Open Source

Learning the Bitter Lesson

rlancemartin.github.io Rlancemartin Jul 30, 2025

The Bitter Lesson in AI Engineering.

2 inbound links en

Field Notes From Shipping Real Code With Claude

diwank.space Jun 7, 2025

Note: This post comes with a NotebookLM podcast (1linked at the bottom), and three generated audio recordings. You can read the conversation I had

9 inbound links en

Andrew Montalenti - Code, Essays, Ideas

Andrew Montalenti Andrew Montalenti Apr 23, 2026

@amontalenti - Founder/CTO. Coding in: Python, JavaScript, Zig.

0 inbound links website en Technology

Deque Axe Assistant - First impressions

craigabbott.co.uk Craig Abbott May 10, 2025

Testing Deque's axe Assistant for accuracy and ability.

1 inbound link website en

Emergent Introspective Awareness in Large Language Models

transformer-circuits.pub Jack Lindsey Oct 29, 2025

7 inbound links en

How I wrote JustHTML using coding agents

friendlybit.com Emil Stenström Dec 3, 2025

I recently released JustHTML, a python-based HTML5 parser. It passes 100% of the html5lib test suite, has zero dependencies, and includes a CSS selector...

17 inbound links en

The Decline of the Software Drafter?

BenRCongdon Ben Congdon Dec 8, 2025

What remains if/when coding is ‘solved’

0 inbound links article en blog

The Coming Need for Formal Specification

BenRCongdon Ben Congdon Dec 12, 2025

The potential of formal verification for reasoning about systems.

0 inbound links article en blog

2025: The year in LLMs

Simon Willison’s Weblog Simon Willison Dec 31, 2025

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

25 inbound links article en ai 2014openai 418generative-ai 1785llms 1751anthropic 282gemini 185ai-agents 110pelican-riding-a-bicycle 113vibe-coding 90coding-agents 200ai-in-china 95conformance-suites 10