{"id":1908,"date":"2025-06-07T19:14:09","date_gmt":"2025-06-07T19:14:09","guid":{"rendered":"https:\/\/violethoward.com\/new\/google-claims-gemini-2-5-pro-preview-beats-deepseek-r1-and-grok-3-beta-in-coding-performance\/"},"modified":"2025-06-07T19:14:09","modified_gmt":"2025-06-07T19:14:09","slug":"google-claims-gemini-2-5-pro-preview-beats-deepseek-r1-and-grok-3-beta-in-coding-performance","status":"publish","type":"post","link":"https:\/\/violethoward.com\/new\/google-claims-gemini-2-5-pro-preview-beats-deepseek-r1-and-grok-3-beta-in-coding-performance\/","title":{"rendered":"Google claims Gemini 2.5 Pro preview beats DeepSeek R1 and Grok 3 Beta in coding performance"},"content":{"rendered":" \r\n
\n\t\t\t\t
\n

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy.\u00a0Learn more<\/em><\/p>\n\n\n\n


\n<\/div>

Google has released an updated preview of\u200b\u200b Gemini 2.5 Pro, its \u201cmost intelligent\u201d model, first announced in March and upgraded in May, as a preview, intending to release the same model to general availability in a couple of weeks.\u00a0<\/p>\n\n\n\n

Enterprises can test building new applications or replace earlier versions with an updated version of the \u201cI\/O edition\u201d of Gemini 2.5 Pro that, according to a blog post by Google, is more creative in its responses and outperforms other models in coding and reasoning.\u00a0<\/p>\n\n\n\n

\n

Our latest Gemini 2.5 Pro update is now in preview.<\/p>

It\u2019s better at coding, reasoning, science + math, shows improved performance across key benchmarks (AIDER Polyglot, GPQA, HLE to name a few), and leads @lmarena_ai<\/a> with a 24pt Elo score jump since the previous version.<\/p>

We also\u2026 pic.twitter.com\/SVjdQ2k1tJ<\/a><\/p>\u2014 Sundar Pichai (@sundarpichai) June 5, 2025<\/a><\/blockquote>\n<\/div><\/figure>\n\n\n\n

During its annual I\/O developer conference in May, Google announced that it updated Gemini 2.5 Pro to be better than its earlier iteration, which it quietly released. Google DeepMind CEO Demis Hassabis said the I\/O edition is the company\u2019s best coding model yet.\u00a0<\/p>\n\n\n\n

But this new preview, called Gemini 2.5 Pro Preview 06-05 Thinking, is even better than the I\/O edition. The stable version Google plans to release publicly is \u201cready for enterprise-scale capabilities.\u201d<\/p>\n\n\n\n

The I\/O edition, or gemini-2.5-pro-preview-05-06, was first made available to developers and enterprises in May through Google AI Studio and Vertex AI. Gemini 2.5 Pro Preview 06-05 Thinking can be accessed via the same platforms.\u00a0<\/p>\n\n\n\n

Performance metrics<\/h2>\n\n\n\n

This new version of Gemini 2.5 Pro performs even better than the first release.\u00a0<\/p>\n\n\n\n

Google said the new version of Gemini 2.5 Pro improved by 24 points in LMArena and by 35 points in WebDevArena, where it currently tops the leaderboard. The company\u2019s benchmark tests showed that the model outscored competitors like OpenAI\u2019s o3, o3-mini, and o4-mini, Anthropic\u2019s Claude 4 Opus, Grok 3 Beta from xAI and DeepSeek R1.\u00a0<\/p>\n\n\n\n

\u201cWe\u2019ve also addressed feedback from our previous 2.5 Pro releases, improving its style and structure \u2014 it can be more creative with better-formatted responses,\u201d Google said in the blog post.\u00a0<\/p>\n\n\n\n

\"\"\/<\/figure>\n\n\n\n

What enterprises can expect<\/h2>\n\n\n\n

Google\u2019s continuous improvement of Gemini 2.5 Pro might be confusing for many, but Google previously framed these as a response to community feedback. Pricing for the new version is $1.25 per million tokens without caching for inputs and $10 for the output price.\u00a0<\/p>\n\n\n\n

When the very first version of Gemini 2.5 Pro launched in March, VentureBeat\u2019s Matt Marshall called it \u201cthe smartest model you\u2019re not using.\u201d Since then, Google has integrated the model into many of its new applications and services, including \u201cDeep Think,\u201d where Gemini considers multiple hypotheses before responding.\u00a0<\/p>\n\n\n\n

The release of Gemini 2.5 Pro, and its two upgraded versions, revived Google\u2019s place in the large language model space after competitors like DeepSeek and OpenAI diverted the industry\u2019s attention to their reasoning models.\u00a0<\/p>\n\n\n\n

In just a few hours of announcing the updated Gemini 2.5 Pro, developers have already begun playing around with it. While many found the update to live up to Google\u2019s promise of being faster, the jury is still out if this latest Gemini 2.5 Pro does actually perform better.\u00a0<\/p>\n\n\n\n

\n

First hour with “Gemini 2.5 Pro Preview 06-05”<\/p>

Positives:<\/p>

\u2013 It’s faster\u2013 It produces more output\u2013 It has a better macro play (multi file edits, better overview)\u2013 Output structure is better (readable)\u2013 It’s more concise and LESS APOLOGETIC!!<\/p>

Before: “You are absolutely\u2026<\/p>\u2014 Patrick Bade (@nishffx) June 5, 2025<\/a><\/blockquote>\n<\/div><\/figure>\n\n\n\n

\n

you guys cooked, really enjoying the app builder. <\/p>

made a game and tested it out, it was using imagen to build assets on the fly ? and it’s up, hosted, easy to share. Really the best no-experience no-code builder yet.<\/p>

keep building out the vibe app marketplace, this could\u2026<\/p>\u2014 bone (@boneGPT) June 5, 2025<\/a><\/blockquote>\n<\/div><\/figure>\n\n\n\n

\n

Gemini 2.5 Pro Preview is pretty good.. used it yesterday for deep research and the results are better than some of the big names..<\/p>\u2014 Janak (@janaks09) June 5, 2025<\/a><\/blockquote>\n<\/div><\/figure>\n\n\n\n\n