GeistHaus
log in · sign up

2025 LLM Year in Review

karpathy.bearblog.dev

2025 Year in Review of LLM paradigm changes

24 pages link to this URL
A quote from Andrej Karpathy

In 2025, Reinforcement Learning from Verifiable Rewards (RLVR) emerged as the de facto new major stage to add to this mix. By training LLMs against automatically verifiable rewards across a …

1 inbound link article en definitions 53ai 2025andrej-karpathy 42generative-ai 1792llms 1758llm-reasoning 98deepseek 33
2025: The year in LLMs

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

1 inbound link article en ai 2024openai 419generative-ai 1791llms 1757anthropic 282gemini 185ai-agents 111pelican-riding-a-bicycle 113vibe-coding 91coding-agents 202ai-in-china 95conformance-suites 10
2025: The year in LLMs

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

1 inbound link article en ai 2024openai 419generative-ai 1791llms 1757anthropic 282gemini 185ai-agents 111pelican-riding-a-bicycle 113vibe-coding 91coding-agents 202ai-in-china 95conformance-suites 10
Stream

Blog posts, photos, and micro updates

0 inbound links website en
Links, December 26, 2025

2025 LLM Year in Review - Karpathy Chemical Hygiene - Karpathy A Comprehensive Introduction to AI for Proteins - Tamarind Bio https://x.com/shelbynewsad/status/2003508957155844390 AI drug designer Insilico Medicine aims to generate $300M in Hong Kong IPO - FierceBiotech The Virtual Cell Will Be More Like GWAS Than AlphaFold - Andrew Carroll The ML drug discovery startup trying really, really hard to not cheat (Leash Bio) - Owl Posting Leash: Machine learning for medicinal chemistry - Substack Recent discoveries on the acquisition of the highest levels of human performance - Science FrontierScience: Evaluating AI’s Ability to Perform Expert-Level Scientific Tasks - OpenAI Evaluating Large Language Models in Scientific Discovery - arXiv MONDE·T: A Database and Interactive Webserver for Non-Canonical Amino Acids (ncAAs) in the PDB - bioRxiv The Breath of Life - The Atlantic (on Trikafta) Eleven clinical trials that will shape medicine in 2026 - Nature Medicine

0 inbound links article en posts
2025: The Year AI Became a Teammate - Log - nibzard

AI became a teammate in 2025. From startups back to academia, advisory, and a summer of full-time AI experimentation.

0 inbound links article en METAYEAR-IN-REVIEWAIAGENTSREFLECTIONSTARTUPSCAREER METAYEAR-IN-REVIEWAIAGENTSREFLECTIONSTARTUPSCAREER
Vibecoding explained

https://karpathy.bearblog.dev/year-in-review-2025/ In this episode @karpathy blessed us all with another blogpost. While his wording is much more careful and even nuanced, there is still a lot of bullshit in it. It way less outrageous bullshit as in the Friedman poocast and around that time, but still. Here are some excerpts: With vibe coding, programming is not strictly reserved for highly trained professionals, it is something anyone can do. … But not only does vibe coding empower regular people to approach programming, it empowers trained professionals to write a lot more (vibe coded) software that would otherwise never be written.

0 inbound links article en articles AILLMbullshitvibecoding
Five Articles I Read from 2025

From 100+ articles I read closely, I picked the following 5 as highlights of the year 2025. Find Your People - Jessica Livingston How Jensen Works How to...

0 inbound links article en
January quick-takes

Recap of my short posts on LinkedIn and elsewhere in January Truly Open Source AI: OLMo 3 and Nemotron 3 Nano The past few weeks have b...

0 inbound links article en
Simon Willison on definitions

53 posts tagged ‘definitions’. Attempts to assign meaning to words and phrases.

0 inbound links website en generative-ai 1785ai 2016llms 1751ai-assisted-programming 381ai-agents 110vibe-coding 90ai-ethics 301coding-agents 201prompt-engineering 190security 602
Episode #291: Reassessing the LLM Landscape & Summoning Ghosts – The Real Python Podcast

What are the current techniques being employed to improve the performance of LLM-based systems? How is the industry shifting from post-training towards context engineering and multi-agent orchestration? This week on the show, Jodie Burchell, data scientist and Python Advocacy Team Lead at JetBrains, returns to discuss the current AI coding landscape.

0 inbound links article en
2025: The year in LLMs

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

25 inbound links article en ai 2014openai 418generative-ai 1785llms 1751anthropic 282gemini 185ai-agents 110pelican-riding-a-bicycle 113vibe-coding 90coding-agents 200ai-in-china 95conformance-suites 10