Home SWE-bench score

SWE-bench score

OpenAI is reshaped and surpassed by Claude Opus 4’s seven hours...

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Anthropic released Claude Opus 4 and...
en_USEnglish