Driven by new technology called OpenAI o1, the chatbot can test various strategies and try to identify mistakes as it tackles complex tasks.
Online chatbots like ChatGPT from OpenAI and Gemini from Google sometimes struggle with simple math problems. The computer code they generate is often buggy and incomplete. From time to time, they even make stuff up.
On Thursday, OpenAI unveiled a new version of ChatGPT that could alleviate these flaws. The company said the chatbot, underpinned by new artificial intelligence technology called OpenAI o1, could “reason” through tasks involving math, coding and science.
“With previous models like ChatGPT, you ask them a question and they immediately start responding,” said Jakub Pachocki, OpenAI’s chief scientist. “This model can take its time. It can think through the problem — in English — and try to break it down and look for angles in an effort to provide the best answer.”
In a demonstration for The New York Times, Dr. Pachocki and Szymon Sidor, an OpenAI technical fellow, showed the chatbot solving an acrostic, a kind of word puzzle that is significantly more complex than an ordinary crossword puzzle. The chatbot also answered a Ph.D.-level chemistry question and diagnosed an illness based on a detailed report about a patient’s symptoms and history.
The new technology is part of a wider effort to build A.I. that can reason through complex tasks. Companies like Google and Meta are building similar technologies, while Microsoft and its subsidiary GitHub are working to incorporate OpenAI’s new system into their products.