GeistHaus
log in · sign up

Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7

simonwillison.net

For anyone who has been (inadvisably) taking my pelican riding a bicycle benchmark seriously as a robust way to test models, here are pelicans from this morning’s two big model …

9 pages link to this URL
The last six months in LLMs in five minutes

I put together these annotated slides from my five minute lightning talk at PyCon US 2026, using the latest iteration of my annotated presentation tool. # I presented this lightning …

0 inbound links article en lightning-talks 7pycon 28speaking 120ai 2025generative-ai 1792local-llms 157llms 1758annotated-talks 31pelican-riding-a-bicycle 114coding-agents 203
Weird-shaped tools

My point is not to say that the foundations of large language models and intersectionality are the same. My point is to say that if you understand that things (people) are not static and that their position of influence is dependent on a whole host of inter-related factors and actors, often outside of their control, then you basically understand at a foundational level how large language models work even if the maths and the statistics elude you.

0 inbound links article en
Weakly Link 26/17: AI stutters

This week we’re linking together links that give a bit of a picture of some stuttering in the AI world. We’ve got Firefox overhyping Mythos. We’ve got indications that GenAI vendors think they need to show some way of putting the right numbers on the balance sheet and look at simpler times. Both in the past and in the future. Let’s dive (no not delve) in. Days not numbered We start with a look at a Firefox blog post that caught people’s imagination.

0 inbound links article en posts
Weird-shaped tools

My point is not to say that the foundations of large language models and intersectionality are the same. My point is to say that if you understand that things (people) are not static and that their position of influence is dependent on a whole host of inter-related factors and actors, often outside of their control, then you basically understand at a foundational level how large language models work even if the maths and the statistics elude you.

4 inbound links article en
LLM pricing has never made sense

Yesterday, Claude Code disappeared from the $20/month subscription tier on Anthropic’s website. Well, for some people. Then it came back. As Simon Willison put it, it’s all very confusing.

1 inbound link article en