AI benchmarks are mostly bogus, study finds
New research reveals most AI benchmarks lack scientific rigor, with companies like OpenAI relying on questionable testing methods to claim superiority. About half of benchmarks measure vague concepts without clear definitions, undermining claims about AI progress.