[ad_1]
Manav first defeated a liberal AI model created by Google and Openai in a top international mathematics competition, despite the programs reaching gold levels for the first time.
Neither model scored full points – unlike five youth in the International Mathematical Olympiad (IMO), a reputed annual competition where participants should be under 20 years of age.
Google said on Monday that an advanced version of its Mithun Chatbot had solved five of the six Maths problems set in the IMO held in Queensland, Australia this month.
“We can confirm that Google Deepmind has reached a very desired milestone,” the US Tech veteran quoted the IMO president Gregor Dolinar as saying, “The possible milestone, which earns 35 out of the 42 marks-a gold medal score,” US Tech Giant said that IMO President Gregor Dolinar said.
“Their solutions were amazing in many cases. The IMO grader found him clear, accurate and easier to follow most of them.”
About 10 percent of human contestants won gold-level medals, and five got the correct score of 42 points.
US Chest Maker Openai said that her practical logic model scored a gold-level 35 points on the test.
OpenIE researcher Alexander Wei wrote on social media, “A long -grand challenge in AI” AI won a long grand challenge in AI.
“We evaluated our models on 2025 imo problems under the same rule as human contestants,” he said.
“For each problem, three former IMO medalists classified the proof of the model independently.”
Google achieved a silver-middle score in the previous year’s IMO in the British City of Bath, solving four of the six problems.
Two to three days calculated compared to this year-when its Gemini model solved problems within a 4.5-hour time limit, he said.
The IMO stated that the tech companies had “tested a privately closed AI model on this year’s problems”, the same that was faced by 641 competitive students from 112 countries.
IMO president Dolinar said, “It is very exciting to see the progress in the mathematical abilities of the AI model.”
Competition organizers could not verify how much the computing power was used by the AI model or whether there was human participation, he warned.
[ad_2]


