{"id":2681,"date":"2025-07-22T07:27:07","date_gmt":"2025-07-22T07:27:07","guid":{"rendered":"https:\/\/violethoward.com\/new\/google-deepmind-makes-ai-history-with-gold-medal-win-at-worlds-toughest-math-competition\/"},"modified":"2025-07-22T07:27:07","modified_gmt":"2025-07-22T07:27:07","slug":"google-deepmind-makes-ai-history-with-gold-medal-win-at-worlds-toughest-math-competition","status":"publish","type":"post","link":"https:\/\/violethoward.com\/new\/google-deepmind-makes-ai-history-with-gold-medal-win-at-worlds-toughest-math-competition\/","title":{"rendered":"Google DeepMind makes AI history with gold medal win at world&#8217;s toughest math competition"},"content":{"rendered":" \r\n<br><div>\n\t\t\t\t<div id=\"boilerplate_2682874\" class=\"post-boilerplate boilerplate-before\">\n<p><em>Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders.<\/em> <em>Subscribe Now<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n<\/div><p>Google DeepMind announced Monday that an advanced version of its Gemini artificial intelligence model has officially achieved gold medal-level performance at the International Mathematical Olympiad, solving five of six exceptionally difficult problems and earning recognition as the first AI system to receive official gold-level grading from competition organizers.<\/p>\n\n\n\n<p>The victory advances the field of AI reasoning and puts Google ahead in the intensifying battle between tech giants building next-generation artificial intelligence. More importantly, it demonstrates that AI can now tackle complex mathematical problems using natural language understanding rather than requiring specialized programming languages.<\/p>\n\n\n\n<p>\u201cOfficial results are in \u2014 Gemini achieved gold-medal level in the International Mathematical Olympiad!\u201d Demis Hassabis, CEO of Google DeepMind, wrote on social media platform X Monday morning. \u201cAn advanced version was able to solve 5 out of 6 problems. Incredible progress.\u201d<\/p>\n\n\n\n<blockquote class=\"twitter-tweet\"><p lang=\"en\" dir=\"ltr\">Official results are in \u2013 Gemini achieved gold-medal level in the International Mathematical Olympiad! ? An advanced version was able to solve 5 out of 6 problems. Incredible progress \u2013 huge congrats to <a href=\"https:\/\/twitter.com\/lmthang?ref_src=twsrc%5Etfw\">@lmthang<\/a> and the team! https:\/\/t.co\/pp9bXF7rVj<\/p>\u2014 Demis Hassabis (@demishassabis) <a href=\"https:\/\/twitter.com\/demishassabis\/status\/1947337615054671882?ref_src=twsrc%5Etfw\">July 21, 2025<\/a><\/blockquote> \n\n\n\n<p>The International Mathematical Olympiad, held annually since 1959, is widely considered the world\u2019s most prestigious mathematics competition for pre-university students. Each participating country sends six elite young mathematicians to compete in solving six exceptionally challenging problems spanning algebra, combinatorics, geometry, and number theory. Only about 8% of human participants typically earn gold medals.<\/p>\n\n\n\n<div id=\"boilerplate_2803147\" class=\"post-boilerplate boilerplate-speedbump\">\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>The AI Impact Series Returns to San Francisco &#8211; August 5<\/strong><\/p>\n\n\n\n<p>The next phase of AI is here &#8211; are you ready? Join leaders from Block, GSK, and SAP for an exclusive look at how autonomous agents are reshaping enterprise workflows &#8211; from real-time decision-making to end-to-end automation.<\/p>\n\n\n\n<p>Secure your spot now &#8211; space is limited: https:\/\/bit.ly\/3GuuPLF<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n<\/div><h2 class=\"wp-block-heading\" id=\"h-how-google-deepmind-s-gemini-deep-think-cracked-math-s-toughest-problems\">How Google DeepMind\u2019s Gemini Deep Think cracked math\u2019s toughest problems<\/h2>\n\n\n\n<p>Google\u2019s latest success far exceeds its 2024 performance, when the company\u2019s combined AlphaProof and AlphaGeometry systems earned silver medal status by solving four of six problems. That earlier system required human experts to first translate natural language problems into domain-specific programming languages and then interpret the AI\u2019s mathematical output.<\/p>\n\n\n\n<p>This year\u2019s breakthrough came through Gemini Deep Think, an enhanced reasoning system that employs what researchers call \u201cparallel thinking.\u201d Unlike traditional AI models that follow a single chain of reasoning, Deep Think simultaneously explores multiple possible solutions before arriving at a final answer.<\/p>\n\n\n\n<p>\u201cOur model operated end-to-end in natural language, producing rigorous mathematical proofs directly from the official problem descriptions,\u201d Hassabis explained in a follow-up post on the social media site X, emphasizing that the system completed its work within the competition\u2019s standard 4.5-hour time limit.<\/p>\n\n\n\n<blockquote class=\"twitter-tweet\"><p lang=\"en\" dir=\"ltr\">We achieved this year\u2019s impressive result using an advanced version of Gemini Deep Think (an enhanced reasoning mode for complex problems). Our model operated end-to-end in natural language, producing rigorous mathematical proofs directly from the official problem descriptions \u2013\u2026<\/p>\u2014 Demis Hassabis (@demishassabis) <a href=\"https:\/\/twitter.com\/demishassabis\/status\/1947337617231778168?ref_src=twsrc%5Etfw\">July 21, 2025<\/a><\/blockquote> \n\n\n\n<p>The model achieved 35 out of a possible 42 points, comfortably exceeding the gold medal threshold. According to IMO President Prof. Dr. Gregor Dolinar, the solutions were \u201castonishing in many respects\u201d and found to be \u201cclear, precise and most of them easy to follow\u201d by competition graders.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-openai-faces-backlash-for-bypassing-official-competition-rules\">OpenAI faces backlash for bypassing official competition rules<\/h2>\n\n\n\n<p>The announcement comes amid growing tension in the AI industry over competitive practices and transparency. Google DeepMind\u2019s measured approach to releasing its results has drawn praise from the AI community, particularly in contrast to rival OpenAI\u2019s handling of similar achievements.<\/p>\n\n\n\n<p>\u201cWe didn\u2019t announce on Friday because we respected the IMO Board\u2019s original request that all AI labs share their results only after the official results had been verified by independent experts &amp; the students had rightly received the acclamation they deserved,\u201d Hassabis wrote, appearing to reference OpenAI\u2019s earlier announcement of its own olympiad performance.<\/p>\n\n\n\n<blockquote class=\"twitter-tweet\"><p lang=\"en\" dir=\"ltr\">Btw as an aside, we didn\u2019t announce on Friday because we respected the IMO Board&#8217;s original request that all AI labs share their results only after the official results had been verified by independent experts &amp; the students had rightly received the acclamation they deserved<\/p>\u2014 Demis Hassabis (@demishassabis) <a href=\"https:\/\/twitter.com\/demishassabis\/status\/1947337618787615175?ref_src=twsrc%5Etfw\">July 21, 2025<\/a><\/blockquote> \n\n\n\n<p>Social media users were quick to note the distinction. \u201cYou see? OpenAI ignored the IMO request. Shame. No class. Straight up disrespect,\u201d wrote one user. \u201cGoogle DeepMind acted with integrity, aligned with humanity.\u201d<\/p>\n\n\n\n<p>The criticism stems from OpenAI\u2019s decision to announce its own mathematical olympiad results without participating in the official IMO evaluation process. Instead, OpenAI had a panel of former IMO participants grade its AI\u2019s performance, a approach that some in the community view as lacking credibility.<\/p>\n\n\n\n<p>\u201cOpenAI is quite possibly the worst company on the planet right now,\u201d wrote one critic, while others suggested the company needs to \u201ctake things seriously\u201d and \u201cbe more credible.\u201d<\/p>\n\n\n\n<blockquote class=\"twitter-tweet\"><p lang=\"en\" dir=\"ltr\">You see?<\/p><p>OpenAI ignored the IMO request. Shame. No class. Straight up disrespect. <\/p><p>Google DeepMind acted with integrity, aligned with humanity. <\/p><p>TRVTHNUKE <a href=\"https:\/\/t.co\/8LAOak6XUE\">pic.twitter.com\/8LAOak6XUE<\/a><\/p>\u2014 NIK (@ns123abc) <a href=\"https:\/\/twitter.com\/ns123abc\/status\/1947347617131680232?ref_src=twsrc%5Etfw\">July 21, 2025<\/a><\/blockquote> \n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-inside-the-training-methods-that-powered-gemini-s-mathematical-mastery\">Inside the training methods that powered Gemini\u2019s mathematical mastery<\/h2>\n\n\n\n<p>Google DeepMind\u2019s success appears to stem from novel training techniques that go beyond traditional approaches. The team used advanced reinforcement learning methods designed to leverage multi-step reasoning, problem-solving, and theorem-proving data. The model was also provided access to a curated collection of high-quality mathematical solutions and received specific guidance on approaching IMO-style problems.<\/p>\n\n\n\n<p>The technical achievement impressed AI researchers who noted its broader implications. \u201cNot just solving math\u2026 but understanding language-described problems and applying abstract logic to novel cases,\u201d wrote AI observer Elyss Wren. \u201cThis isn\u2019t rote memory \u2014 this is emergent cognition in motion.\u201d<\/p>\n\n\n\n<p>Ethan Mollick, a professor at the Wharton School who studies AI, emphasized the significance of using a general-purpose model rather than specialized tools. \u201cIncreasing evidence of the ability of LLMs to generalize to novel problem solving,\u201d he wrote, highlighting how this differs from previous approaches that required specialized mathematical software.<\/p>\n\n\n\n<blockquote class=\"twitter-tweet\"><p lang=\"en\" dir=\"ltr\">It wasn&#8217;t just OpenAI.<\/p><p>Google also used a general purpose model to solve the very hard math problems of the International Math Olympiad in plain language. Last year they used specialized tool use<\/p><p>Increasing evidence of the ability of LLMs to generalize to novel problem solving https:\/\/t.co\/Ve72fFmx2b<\/p>\u2014 Ethan Mollick (@emollick) <a href=\"https:\/\/twitter.com\/emollick\/status\/1947356382581137867?ref_src=twsrc%5Etfw\">July 21, 2025<\/a><\/blockquote> \n\n\n\n<p>The model demonstrated particularly impressive reasoning in one problem where many human competitors applied graduate-level mathematical concepts. According to DeepMind researcher Junehyuk Jung, Gemini \u201cmade a brilliant observation and used only elementary number theory to create a self-contained proof,\u201d finding a more elegant solution than many human participants.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-what-google-deepmind-s-victory-means-for-the-200-billion-ai-race\">What Google DeepMind\u2019s victory means for the $200 billion AI race<\/h2>\n\n\n\n<p>The breakthrough comes at a critical moment in the AI industry, where companies are racing to demonstrate superior reasoning capabilities. The success has immediate practical implications: Google plans to make a version of this Deep Think model available to mathematicians for testing before rolling it out to Google AI Ultra subscribers, who pay $250 monthly for access to the company\u2019s most advanced AI models.<\/p>\n\n\n\n<p>The timing also highlights the intensifying competition between major AI laboratories. While Google celebrated its methodical, officially-verified approach, the controversy surrounding OpenAI\u2019s announcement reflects broader tensions about transparency and credibility in AI development.<\/p>\n\n\n\n<p>This competitive dynamic extends beyond just mathematical reasoning. Recent weeks have seen various AI companies announce breakthrough capabilities, though not all have been received positively. Elon Musk\u2019s xAI recently launched Grok 4, which the company claimed was the \u201csmartest AI in the world,\u201d though leaderboard scores showed it trailing behind models from Google and OpenAI. Additionally, Grok has faced criticism for controversial features including sexualized AI companions and episodes of generating antisemitic content.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-the-dawn-of-ai-that-thinks-like-humans-with-real-world-consequences\">The dawn of AI that thinks like humans\u2014with real-world consequences<\/h2>\n\n\n\n<p>The mathematical olympiad victory goes beyond competitive bragging rights. Gemini\u2019s performance demonstrates that AI systems can now match human-level reasoning in complex tasks requiring creativity, abstract thinking, and the ability to synthesize insights across multiple domains.<\/p>\n\n\n\n<p>\u201cThis is a significant advance over last year\u2019s breakthrough result,\u201d the DeepMind team noted in their technical announcement. The progression from requiring specialized formal languages to operating entirely in natural language suggests that AI systems are becoming more intuitive and accessible.<\/p>\n\n\n\n<p>For businesses, this development signals that AI may soon tackle complex analytical problems across various industries without requiring specialized programming or domain expertise. The ability to reason through intricate challenges using everyday language could democratize sophisticated analytical capabilities across organizations.<\/p>\n\n\n\n<p>However, questions persist about whether these reasoning capabilities will translate effectively to messier real-world challenges. The mathematical olympiad provides well-defined problems with clear success criteria \u2014 a far cry from the ambiguous, multifaceted decisions that define most business and scientific endeavors.<\/p>\n\n\n\n<p>Google DeepMind plans to return to next year\u2019s competition \u201cin search of a perfect score.\u201d The company believes AI systems combining natural language fluency with rigorous reasoning \u201cwill become invaluable tools for mathematicians, scientists, engineers, and researchers, helping us advance human knowledge on the path to AGI.\u201d<\/p>\n\n\n\n<p>But perhaps the most telling detail emerged from the competition itself: when faced with the contest\u2019s most difficult problem, Gemini started from an incorrect hypothesis and never recovered. Only five human students solved that problem correctly. In the end, it seems, even gold medal-winning AI still has something to learn from teenage mathematicians.<\/p>\n<div id=\"boilerplate_2660155\" class=\"post-boilerplate boilerplate-after\"><div class=\"Boilerplate__newsletter-container vb\">\n<div class=\"Boilerplate__newsletter-main\">\n<p><strong>Daily insights on business use cases with VB Daily<\/strong><\/p>\n<p class=\"copy\">If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.<\/p>\n<p class=\"Form__newsletter-legal\">Read our Privacy Policy<\/p>\n<p class=\"Form__success\" id=\"boilerplateNewsletterConfirmation\">\n\t\t\t\t\tThanks for subscribing. Check out more VB newsletters here.\n\t\t\t\t<\/p>\n<p class=\"Form__error\">An error occured.<\/p>\n<\/p><\/div>\n<div class=\"image-container\">\n\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/venturebeat.com\/wp-content\/themes\/vb-news\/brand\/img\/vb-daily-phone.png\" alt=\"\"\/>\n\t\t\t\t<\/div>\n<\/p><\/div>\n<\/div>\t\t\t<\/div><template id="Ivn9GJic03Fia2nybFGf"></template><\/script>\r\n<br>\r\n<br><a href=\"https:\/\/venturebeat.com\/ai\/google-deepmind-makes-ai-history-with-gold-medal-win-at-worlds-toughest-math-competition\/\">Source link <\/a>","protected":false},"excerpt":{"rendered":"<p>Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Google DeepMind announced Monday that an advanced version of its Gemini artificial intelligence model has officially achieved gold medal-level performance at the International Mathematical Olympiad, solving five of six [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":2682,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[33],"tags":[],"class_list":["post-2681","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-automation"],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/violethoward.com\/new\/wp-content\/uploads\/2025\/07\/nuneybits_Vector_art_of_robot_winning_medal_5be7ef30-62b2-4f25-bd3c-9480201df4b7.webp.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/2681","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/comments?post=2681"}],"version-history":[{"count":0,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/2681\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media\/2682"}],"wp:attachment":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media?parent=2681"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/categories?post=2681"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/tags?post=2681"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69e302c146fa5c92dc28ac12. Config Timestamp: 2026-04-18 04:04:16 UTC, Cached Timestamp: 2026-04-29 15:32:22 UTC -->