Anthropic's new Claude Opus 4.5 model achieved 80.9% on SWE-bench and scored higher than human candidates on a performance ...
On Tuesday, French AI startup Mistral AI released Devstral 2, a 123 billion parameter open-weights coding model designed to ...