HomeScience & EnvironmentHumans triumph over AI...

Humans triumph over AI at annual math Olympiad, but the machines are catching up

Sydney — Humans beat generative AI models made by Google and OpenAI at a top international mathematics competition, but the programs reached gold-level scores for the first time, and the rate at which they are improving may be cause for some human introspection.

Neither of the AI models scored full marks — unlike five young people at the International Mathematical Olympiad (IMO), a prestigious annual competition where participants must be under 20 years old.

Google said Monday that an advanced version of its Gemini chatbot had solved five out of the six math problems set at the IMO, held in Australia’s Queensland this month.

“We can confirm that Google DeepMind has reached the much-desired milestone, earning 35 out of a possible 42 points – a gold medal score,” the U.S. tech giant cited IMO president Gregor Dolinar as saying. “Their solutions were astonishing in many respects. IMO graders found them to be clear, precise and most of them easy to follow.”

Around 10% of human contestants won gold-level medals, and five received perfect scores of 42 points.

U.S. ChatGPT maker OpenAI said its experimental reasoning model had also scored a gold-level 35 points on the test.

The result “achieved a longstanding grand challenge in AI” at “the world’s most prestigious math competition,” OpenAI researcher Alexander Wei said in a social media post.

“We evaluated our models on the 2025 IMO problems under the same rules as human contestants,” he said. “For each problem, three former IMO medalists independently graded the model’s submitted proof.”

Google achieved a silver-medal score at last year’s IMO in the city of Bath, in southwest England, solving four of the six problems.

That took two to three days of computation — far longer than this year, when its Gemini model solved the problems within the 4.5-hour time limit, it said.

The IMO said tech companies had “privately tested closed-source AI models on this year’s problems,” the same ones faced by 641 competing students from 112 countries.

“It is very exciting to see progress in the mathematical capabilities of AI models,” said IMO president Dolinar.

Contest organizers could not verify how much computing power had been used by the AI models or whether there had been human involvement, he noted.

In an interview with CBS’ 60 Minutes earlier this year, one of Google’s leading AI researchers predicted that within just five to 10 years, computers would be made that have human-level cognitive abilities — a landmark known as “artificial general intelligence.”

Google DeepMind CEO Demis Hassabis predicted that AI technology was on track to understand the world in nuanced ways, and to not only solve important problems, but even to develop a sense of imagination, within a decade, thanks to an increase in investment. 

“It’s moving incredibly fast,” Hassabis said. “I think we are on some kind of exponential curve of improvement. Of course, the success of the field in the last few years has attracted even more attention, more resources, more talent. So that’s adding to the, to this exponential progress.”

Source link

- A word from our sponsors -

spot_img

Most Popular

More from Author

- A word from our sponsors -

spot_img

Read Now

Optical illusion: Only 1% of people can spot the hidden face in this burger. Can you?

Optical illusions have gained a lot of popularity recently, as they get our brain to exercise, and can be the perfect test of our observational skills and keen eye. They are also super fun to solve, and can be the perfect recipe for a bored...

Dolly Parton leans on music industry for support amid health scares: Source

Dolly Parton has reportedly been finding comfort in the company of fellow music artists during a challenging period for...

Rs 1 lakh Crore Fund To Mitigate R&D Risks, Spur Private Investment In Cutting-Edge Technologies: Secretary DST | Economy News

New Delhi: The recently launched Rs 1 lakh crore Research Development and Innovation (RDI) fund, particularly focused on India's private sector, aims to support the private research and innovation mindset among players and mitigate the financial risks associated with it.   Speaking at a workshop organised by the Department...

With presidents and royalty in attendance, Egypt unveils $1bn cultural ‘GEM’

Prime ministers, presidents and royalty descended on Cairo on Saturday to attend the spectacle-laden inauguration of a sprawling new...

Obituary: James Watson

Getty ImagesIn February 1953, two men walked into a pub in Cambridge and announced they had found "the secret of life". It was not an idle boast.One was James Watson, an American biologist from the Cavendish laboratory; the other was his British research partner, Francis Crick....

T Rabi Sankar: Frauds up since July, battle on

MUMBAI: RBI deputy governor T Rabi Sankar said the fight against digital fraud is far from over, noting that the decline seen earlier this year reversed in July, with cases rising again.He said fraud levels had been falling since the start of the year before...

iOS 26.1 update boosts iPhone security and performance improvements

NEWYou can now listen to Fox News articles! Apple's iOS 26.1 update is more than a standard patch. It boosts security, speeds up performance and adds practical upgrades to features you already use. The update fixes dozens of vulnerabilities that impact Safari, Photos and Apple...

Florence Welch opens up about ectopic pregnancy and doubts about releasing new music

Florence Welch opens up about ectopic pregnancy and doubts about releasing new music - CBS News ...

Elon Musk’s $1tn pay deal approved by Tesla shareholders

Tesla shareholders have approved a record-breaking pay package for boss Elon Musk that could be worth nearly $1tn (£760bn).The unprecedented deal was approved by 75% of Tesla shareholders who cast votes at the firm's annual general meeting on Thursday.The deal requires Musk, who is already the world's...