GeistHaus
log in · sign up

2025: The year in LLMs

simonwillison.net

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

24 pages link to this URL
Stream

Blog posts, photos, and micro updates

0 inbound links website en
2025年我的AI Coding使用评测

2025年里ai coding有一波非常大的能力增长,随着模型能力/agent工具/IDE整个上下游能力的变化,我的使用策略也在不断调整,所以写一篇blog记录下今年我的工具箱变化。视角很不专业,纯粹是一个使用者角度体验(belike 使用评测)。

0 inbound links article zh AI CodingVibe Coding
MCP and Skills: Why Not Both?

The community keeps framing MCP and Skills as competitors. They're not. They solve completely different problems.

0 inbound links article en Posts agentsmcpskillstools
2025: The Year AI Became a Teammate - Log - nibzard

AI became a teammate in 2025. From startups back to academia, advisory, and a summer of full-time AI experimentation.

0 inbound links article en METAYEAR-IN-REVIEWAIAGENTSREFLECTIONSTARTUPSCAREER METAYEAR-IN-REVIEWAIAGENTSREFLECTIONSTARTUPSCAREER
AI in Mineral Exploration: 2025 in Review

2025 was a blockbuster year for both AI and critical minerals! Pretty much every other week either we got news of a new discovery or governmental initiative, or an AI company releasing a new model …

0 inbound links article en
CCS: Search for Claude Code conversations

I’ve accumulated hundreds of Claude Code sessions and kept losing track of where I solved specific problems. The built-in /resume shows recent sessions, but anything older than a few days? Gone. So I built CCS (Claude Code Search) to fix that. TLDR Link to heading brew install agentic-utils/tap/ccs ccs Type to search, Enter to resume. That’s it. The problem Link to heading Claude Code stores conversations in ~/.claude/projects/ as JSONL files. Each session gets its own file:

0 inbound links article en posts ClaudeGoFzf
2025 in review: Going through the middle passage | The Man in the Arena

As I start writing this year in review, the first thing that comes to my mind is this holiday memory I have (and many other nerds have 🤣) from my youth: Going to see Lord of the Rings in theatres.

0 inbound links article en Personal Mental fitnessMidlifeTransparencyYear in review CC BY-NC-SA 4.0
2025 / SimonW Summary | John Maeda’s Blog

Source: https://simonwillison.net/2025/Dec/31/the-year-in-llms/ 2025 in LLMs (Simon Willison) — clustered 1) Capability shift: “reasoning” becomes the new default RLVR/inference-scaling drives 2025 progress: not necessarily bigger base models, but longer RL runs and “reasoning modes/dials.” The practical unlock isn’t puzzles—it’s tool-using planning, especially for research/search and debugging gnarly code. 2) Agents become…

0 inbound links article en