Did xAI Mislead About Grok 3’s Benchmarks? OpenAI Disputes Claims

TECHi 2025-02-23

Summary:

Debates over AI benchmarks have resurfaced following xAI’s recent claims about its latest model, Grok 3. An OpenAI employee publicly accused Elon Musk’s xAI of presenting misleading benchmark results, while xAI co-founder Igor Babushkin defended the company’s methodology. The controversy stems from a graph published by xAI showing Grok3 performance on AIME 2025, a benchmark […]

The post Did xAI Mislead About Grok 3’s Benchmarks? OpenAI Disputes Claims first appeared on TECHi and is written by Munazza Shaheen.

Link:

https://www.techi.com/xai-grok3-benchmarks-accuracy-dispute/

From feeds:

TECHi » TECHi

Authors:

Munazza Shaheen

Date tagged:

02/23/2025, 08:59

Date published:

02/23/2025, 03:59