GeistHaus
log in · sign up

The AI App Experience Matters More Than Benchmarks Now

macstories.net

I was catching up on different articles after the release of Claude Opus 4.5 earlier this week, and this part from Simon Willison’s blog post about it stood out to me: I’m not saying the new model isn’t an improvement on Sonnet 4.5—but I can’t say with confidence that the challenges I posed it were

1 page links to this URL