AI is actually bad at math, ORCA shows

Theregister | 18-11-2025 10:22am |

ORCA benchmark trips up ChatGPT-5, Gemini 2.5 Flash, Claude Sonnet 4.5, Grok 4, and DeepSeek V3.2 In the world of George Orwell's 1984, two and two make five. And large language models are not much better at math....

Stay Updated with the Latest News!

Don't miss out on breaking stories and in-depth articles.