Found 40 bookmarks
Custom sorting
Introducing Claude Sonnet 4.5 \ Anthropic
Introducing Claude Sonnet 4.5 \ Anthropic
"Claude Sonnet 4.5 is state-of-the-art on the SWE-bench Verified evaluation, which measures real-world software coding abilities. Practically speaking, we’ve observed it maintaining focus for more than 30 hours on complex, multi-step tasks." (Next task: find managers with attention spans to match.)
·anthropic.com·
Introducing Claude Sonnet 4.5 \ Anthropic