Categories: The Verge

OpenAI gets caught vibe graphing

During its big GPT-5 livestream on Thursday, OpenAI showed off a few charts that made the model seem quite impressive — but if you look closely, some graphs were a little bit off.

In one, ironically showing how well GPT-5 does in “deception evals across models,” the scale is all over the place. For “coding deception,” for example, GPT-5 apparently gets a 50.0 percent deception rate, but that’s compared to OpenAI’s smaller 47.4 percent o3 score which somehow has a larger bar.

Or this one, where one of GPT-5’s scores is lower than o3’s but is shown with a bigger bar. In this same chart, o3 and GPT-4o’s scores are different but shown with equally-sized bars. That chart was bad enough that CEO Sam Altman commented on it, calling it a “mega chart screwup.” An OpenAI marketing staffer also apologized for the “unintentional chart crime.”

OpenAI didn’t immediately respond to a request for comment. And while it’s unclear if OpenAI used GPT-5 to actually make the charts, it’s still not a great look for the company on its big launch day — especially when it is touting the “significant advances in reducing hallucinations” with its new model.

GPT-5’s big new feature: less lying?

Here's the tricky part about assessing an AI system's deception rate: if the system is really good at deceiving you, you might not notice. This week on The Vergecast, Adi Robertson and Alex Heath join me to discuss the launch of GPT-5 and GPT-OSS… and the strange charts we saw…

August 8, 2025

In "The Verge"

OpenAI Launches GPT-5.4 Mini and Nano to Provide Answers 2X Faster

OpenAI has officially launched GPT-5.4 mini and GPT-5.4 nano, releasing its most capable small models designed to handle high-volume, latency-sensitive workloads. The new mini iteration offers a significant performance upgrade over the previous GPT-5 mini across reasoning, coding, tool use, and multimodal understanding, while running more than twice as fast.…

March 18, 2026

In "Cyber Security News"

OpenAI says the brand-new GPT-5.1 is ‘warmer’ and has more ‘personality’ options

OpenAI is releasing GPT-5.1 today, an update to the flagship model it released in August. OpenAI calls it an “upgrade” to GPT-5 that “makes ChatGPT smarter and more enjoyable to talk to.” The new models include GPT-5.1 Instant and GPT-5.1 Thinking. The former is “warmer, more intelligent, and better at…

November 12, 2025

In "The Verge"

rssfeeds-admin

Next Louisiana Medicaid to cover doula services for pregnant women »

Previous « The Browser Company’s AI browser now has a $20 subscription

Published by

rssfeeds-admin

10 months ago

Hackers Compromised 34 Packages in npm, PyPI, and Crates in New Supply Chain Attack

New TrapDoor supply chain campaign, an active attack deploying 34 malicious packages and over 384…

1 hour ago

Indiana News

Late pass sends Felix Rosenqvist past David Malukas for the closest Indianapolis 500 win in history

INDIANAPOLIS (AP) — Felix Rosenqvist swung to the outside of David Malukas, then found a…

5 hours ago

Indiana News

This website uses cookies.

OpenAI gets caught vibe graphing

Related

GPT-5’s big new feature: less lying?

OpenAI Launches GPT-5.4 Mini and Nano to Provide Answers 2X Faster

OpenAI says the brand-new GPT-5.1 is ‘warmer’ and has more ‘personality’ options

Recent Posts

Hackers Compromised 34 Packages in npm, PyPI, and Crates in New Supply Chain Attack

Late pass sends Felix Rosenqvist past David Malukas for the closest Indianapolis 500 win in history

Late pass sends Felix Rosenqvist past David Malukas for the closest Indianapolis 500 win in history

Nicolas Cage Says Christopher Nolan Won’t ‘Call Me Back’ After Turning Down Insomnia Role

Sebastian Stan Reveals He Plays “Many Roles” in The Batman: Part II

Idris Elba Says He Was Never in the Race to Play James Bond for New 007 Movie

Top Posts & Pages