In 2025, Reinforcement Learning from Verifiable Rewards (RLVR) emerged as the de facto new major stage to add to this mix. By training LLMs against automatically verifiable rewards across a …
In 2025, Reinforcement Learning from Verifiable Rewards (RLVR) emerged as the de facto new major stage to add to this mix. By training LLMs against automatically verifiable rewards across a …
This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …
AI coding assistants are delivering real productivity gains and are rapidly becoming standard developer tooling. However, we’re increasingly seeing organizations measure success using superficial indicators [...]
Codebase cognitive debt is the growing gap between a system’s implementation and a team’s shared understanding of how and why it works. As AI increases [...]
This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …
Blog posts, photos, and micro updates
Key Points\ The public perception of Large Language Models (LLMs) is shifting from "superintelligence" to standard software infrastructure. While the novel...
From Vibe Coding to Vibe Engineering
2025 LLM Year in Review - Karpathy Chemical Hygiene - Karpathy A Comprehensive Introduction to AI for Proteins - Tamarind Bio https://x.com/shelbynewsad/status/2003508957155844390 AI drug designer Insilico Medicine aims to generate $300M in Hong Kong IPO - FierceBiotech The Virtual Cell Will Be More Like GWAS Than AlphaFold - Andrew Carroll The ML drug discovery startup trying really, really hard to not cheat (Leash Bio) - Owl Posting Leash: Machine learning for medicinal chemistry - Substack Recent discoveries on the acquisition of the highest levels of human performance - Science FrontierScience: Evaluating AI’s Ability to Perform Expert-Level Scientific Tasks - OpenAI Evaluating Large Language Models in Scientific Discovery - arXiv MONDE·T: A Database and Interactive Webserver for Non-Canonical Amino Acids (ncAAs) in the PDB - bioRxiv The Breath of Life - The Atlantic (on Trikafta) Eleven clinical trials that will shape medicine in 2026 - Nature Medicine
Random thoughts | about
Reflect on what I'm thinking and doing in this LLM era
This Technology Radar quadrant explores the techniques being used to develop and deliver software
AI became a teammate in 2025. From startups back to academia, advisory, and a summer of full-time AI experimentation.
https://karpathy.bearblog.dev/year-in-review-2025/ In this episode @karpathy blessed us all with another blogpost. While his wording is much more careful and even nuanced, there is still a lot of bullshit in it. It way less outrageous bullshit as in the Friedman poocast and around that time, but still. Here are some excerpts: With vibe coding, programming is not strictly reserved for highly trained professionals, it is something anyone can do. … But not only does vibe coding empower regular people to approach programming, it empowers trained professionals to write a lot more (vibe coded) software that would otherwise never be written.
From 100+ articles I read closely, I picked the following 5 as highlights of the year 2025. Find Your People - Jessica Livingston How Jensen Works How to...
The personal blog of Daniel Nitsikopoulos, software engineer from Canberra, ACT
Recap of my short posts on LinkedIn and elsewhere in January Truly Open Source AI: OLMo 3 and Nemotron 3 Nano The past few weeks have b...
Why Yann LeCun thinks a cat might be smarter than ChatGPT.
53 posts tagged ‘definitions’. Attempts to assign meaning to words and phrases.
What are the current techniques being employed to improve the performance of LLM-based systems? How is the industry shifting from post-training towards context engineering and multi-agent orchestration? This week on the show, Jodie Burchell, data scientist and Python Advocacy Team Lead at JetBrains, returns to discuss the current AI coding landscape.
9 vulnerable MCP servers to learn how to pen test AI agent infra, a knowledge base of 65+ AWS IAM privilege escalation paths, Jason Haddix's open-source classification system for LLM prompt injection attacks
This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …
Engineer’s Codex is a newsletter about real-world engineering.