GeistHaus
log in · sign up

Eval awareness in Claude Opus 4.6’s BrowseComp performance

anthropic.com

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

2 pages link to this URL