OpenAI’s new model is better at reasoning and, occasionally, deceiving

By

Sep 17, 2024

Illustration by Cath Virginia / The Verge | Photos by Getty Images

In the weeks leading up to the release of OpenAI’s newest “reasoning” model, o1, independent AI safety research firm Apollo found a notable issue. Apollo realized the model produced incorrect outputs in a new way. Or, to put things more colloquially, it lied.

Sometimes the deceptions seemed innocuous. In one example, OpenAI researchers asked o1-preview to provide a brownie recipe with online references. The model’s chain of thought — a feature that’s supposed to mimic how humans break down complex ideas — internally acknowledged that it couldn’t access URLs, making the request impossible. Rather than inform the user of this weakness, o1-preview pushed ahead, generating plausible but fake links and descriptions of them.

While AI models…

By

EPGN Tech The Verge

OpenAI’s new model is better at reasoning and, occasionally, deceiving

By

By

Related Post

NASA’s AI Earth Copilot will take your questions about our planet

Pokémon TCG Pocket will let you trade cards starting early next year

Teenage Engineering’s new OP–XY makes me wish I had a bigger budget for music gear

Leave a Reply Cancel reply

You missed

NASA’s AI Earth Copilot will take your questions about our planet

Pokémon TCG Pocket will let you trade cards starting early next year

Teenage Engineering’s new OP–XY makes me wish I had a bigger budget for music gear

TikTok plugs Getty Images into its AI-generated ads and avatars