{"id":400,"date":"2025-03-04T07:38:17","date_gmt":"2025-03-04T07:38:17","guid":{"rendered":"https:\/\/violethoward.com\/new\/google-launches-free-gemini-powered-data-science-agent-on-its-colab-python-platform\/"},"modified":"2025-03-04T07:38:17","modified_gmt":"2025-03-04T07:38:17","slug":"google-launches-free-gemini-powered-data-science-agent-on-its-colab-python-platform","status":"publish","type":"post","link":"https:\/\/violethoward.com\/new\/google-launches-free-gemini-powered-data-science-agent-on-its-colab-python-platform\/","title":{"rendered":"Google launches free Gemini-powered Data Science Agent on its Colab Python platform"},"content":{"rendered":" \r\n<br><div>\n\t\t\t\t<div id=\"boilerplate_2682874\" class=\"post-boilerplate boilerplate-before\">\n<p><em>Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n<\/div><p>AI agents are all the rage, but how about one focused specifically on analyzing, sorting and drawing conclusions from vast volumes of data?<\/p>\n\n\n\n<p>Google\u2019s data science agent does just that: The new, free Gemini 2.0-powered AI assistant that automates data analysis is now available to users aged 18-plus in select countries and languages for free.<\/p>\n\n\n\n<p>The assistant is available through Google Colab, the company\u2019s eight-year-old service for running Python code live online atop graphics processing units (GPUs) owned by the search giant and its own, in-house tensor processing units (TPUs).<\/p>\n\n\n\n<p>Initially launched for trusted testers in December 2024, data science agent is designed to help researchers, data scientists and developers streamline their workflows by generating fully-functional Jupyter notebooks from natural language descriptions, all in the user\u2019s browser.<\/p>\n\n\n\n<p>This expansion aligns with Google\u2019s ongoing efforts to integrate AI-driven coding and data science features into Colab, building on past updates such as Codey-powered AI coding assistance, announced in May 2023.<\/p>\n\n\n\n<p>It also acts as a kind of advanced and belated rejoinder to OpenAI\u2019s ChatGPT advanced data analysis (previously Code Interpreter), which is now built into ChatGPT when running GPT-4.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-what-is-google-colab\">What is Google Colab?<\/h2>\n\n\n\n<p>Google Colab (short for colaboratory) is a cloud-based Jupyter Notebook environment that enables users to write and execute Python code directly in their browser.<\/p>\n\n\n\n<p>Jupyter Notebook is an open-source web application that enables users to create and share documents containing live code, equations, visualizations and narrative text. Originating from the IPython project in 2014, it now supports more than 40 programming languages, including Python, R and Julia. This interactive platform is widely used in data science, research and education for tasks like data analysis, visualization and teaching programming concepts.<\/p>\n\n\n\n<p>Since its launch in 2017, Google Colab has become one of the most widely-used platforms for machine learning (ML) data science and education.<\/p>\n\n\n\n<p>As Ori Abramovsky, data science lead at Spectralops.io, detailed in an excellent Medium post from 2023, Colab\u2019s ease of use and free access to GPUs and TPUs make it a standout option for many developers and researchers.<\/p>\n\n\n\n<p>He noted that the low barrier to entry, seamless integration with Google Drive and support for TPUs allowed his team to dramatically shorten training cycles while working on AI models.<\/p>\n\n\n\n<p>However, Abramovsky also pointed out Colab\u2019s limitations, such as:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Session time limits<\/strong> (especially for free-tier users).<\/li>\n\n\n\n<li><strong>Unpredictable resource allocation<\/strong> at peak usage times.<\/li>\n\n\n\n<li><strong>Lack of critical features<\/strong>, like efficient pipeline execution and advanced scheduling.<\/li>\n\n\n\n<li><strong>Support challenges<\/strong>, as Google provides limited options for direct assistance.<\/li>\n<\/ul>\n\n\n\n<p>Despite these drawbacks, Abramovsky emphasized that Colab remains one of the best serverless notebook solutions available \u2014 particularly in the early stages of ML and data analysis projects.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-simplifying-data-analysis-with-ai\">Simplifying data analysis with AI<\/h2>\n\n\n\n<p>The data science agent builds on Colab\u2019s serverless notebook environment by eliminating the need for manual setup.<\/p>\n\n\n\n<p>Using Google\u2019s Gemini AI, users can describe their analytical goals in plain English (<em>\u201cvisualize trends,\u201d<\/em> <em>\u201ctrain a prediction model,\u201d<\/em> <em>\u201cclean missing values\u201d<\/em>), and the agent generates fully-executable Colab notebooks in response.<\/p>\n\n\n\n<p>It supports users by:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Automating analysis<\/strong>: Generates complete, working notebooks instead of isolated code snippets.<\/li>\n\n\n\n<li><strong>Saving time<\/strong>: Eliminates manual setup and repetitive coding.<\/li>\n\n\n\n<li><strong>Enhancing collaboration<\/strong>: Features built-in sharing features for team-based projects.<\/li>\n\n\n\n<li><strong>Offering modifiable solutions<\/strong>: Users can adjust and customize generated code.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-data-science-agent-is-already-accelerating-real-world-scientific-research\">Data science agent is already accelerating real-world scientific research<\/h2>\n\n\n\n<p>According to Google, early testers have reported significant time savings when using data science agent.<\/p>\n\n\n\n<p>For instance, a scientist at Lawrence Berkeley National Laboratory working on tropical wetland methane emissions estimated that their data processing time dropped from one week to just five minutes when using the agent.<\/p>\n\n\n\n<p>The tool has also performed well in industry benchmarks, ranking 4th on the DABStep: Data Agent Benchmark for Multi-step Reasoning on Hugging Face, ahead of AI agents such as ReAct (GPT-4.0), Deepseek, Claude 3.5 Haiku and Llama 3.3 70B.<\/p>\n\n\n\n<p>However, OpenAI\u2019s rival o3-mini and o1 models, as well as Anthropic\u2019s Claude 3.5 Sonnet, both outclassed the new Gemini data science agent.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1418\" height=\"264\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-2.55.39%E2%80%AFPM-1.png?w=800\" alt=\"\" class=\"wp-image-2998429\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-2.55.39\u202fPM-1.png 1418w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-2.55.39\u202fPM-1.png?resize=300,56 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-2.55.39\u202fPM-1.png?resize=768,143 768w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-2.55.39\u202fPM-1.png?resize=800,149 800w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-2.55.39\u202fPM-1.png?resize=400,74 400w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-2.55.39\u202fPM-1.png?resize=750,140 750w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-2.55.39\u202fPM-1.png?resize=578,108 578w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-2.55.39\u202fPM-1.png?resize=930,173 930w\" sizes=\"(max-width: 1418px) 100vw, 1418px\"\/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-getting-started\">Getting started<\/h2>\n\n\n\n<p>Users can start using data science agent in Google Colab by following these steps:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>Open a new Colab notebook<\/strong>.<\/li>\n\n\n\n<li><strong>Upload a dataset<\/strong> (CSV, JSON, etc.).<\/li>\n\n\n\n<li><strong>Describe the analysis in natural language<\/strong> using the Gemini side panel.<\/li>\n\n\n\n<li><strong>Execute the generated notebook<\/strong> to see insights and visualizations.<\/li>\n<\/ol>\n\n\n\n<p>Google provides sample datasets and prompt ideas to help users explore its capabilities, including:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Stack Overflow developer survey<\/strong>: \u201cVisualize most popular programming languages.\u201d<\/li>\n\n\n\n<li><strong>Iris Species dataset<\/strong>: \u201cCalculate and visualize Pearson, Spearman and Kendall correlations.\u201d<\/li>\n\n\n\n<li><strong>Glass Classification dataset<\/strong>: \u201cTrain a random forest classifier.\u201d<\/li>\n<\/ul>\n\n\n\n<p>Anytime a user wants to use the new agent, they\u2019ll have to navigate to Colab and click \u201cfile,\u201d then \u201cnew notebook in drive,\u201d and the resulting notebook will be stored in their Google Drive cloud account.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-my-own-brief-demo-usage-was-more-mixed\">My own brief demo usage was more mixed<\/h2>\n\n\n\n<p>Granted, I\u2019m a lowly tech journalist and not a data scientist, but my own usage of the new Gemini 2.0-powered data science agent in Colab so far has been less than seamless.<\/p>\n\n\n\n<p>I uploaded five CSV files (comma separated values, standard spreadsheet files from Excel or Sheets) and asked it <em>\u201cHow much am I spending each month and quarter on my utilities?\u201d<\/em>.<\/p>\n\n\n\n<p>The agent went ahead and performed the following operations:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Merged datasets<\/strong>, handling date and account number inconsistencies.<\/li>\n\n\n\n<li><strong>Filtered and cleaned the data<\/strong>, ensuring only relevant expenses remained.<\/li>\n\n\n\n<li><strong>Grouped transactions<\/strong> by month and quarter to calculate spending.<\/li>\n\n\n\n<li><strong>Generated visualizations<\/strong>, such as line charts for trend analysis.<\/li>\n\n\n\n<li><strong>Summarized findings<\/strong> in a clear, structured report.<\/li>\n<\/ul>\n\n\n\n<p>Before execution, Colab prompted a confirmation message, reminding me that it might interact with external APIs.<\/p>\n\n\n\n<p>It did all this very rapidly and smoothly in the browser, in a matter of seconds. And it was impressive to watch it work through the analysis and programming with visible step-by-step descriptions of what it was doing.<\/p>\n\n\n\n<p>However, it ultimately generated an inaccurate graph showing just one month\u2019s utility spending, failing to recognize the sheets included a full year\u2019s worth broken out by months. When I asked it to revise, it gamely tried, but ultimately couldn\u2019t produce the correct code string to answer my prompt. <\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1145\" height=\"767\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.01.37%E2%80%AFPM.png?w=800\" alt=\"\" class=\"wp-image-2998430\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.01.37\u202fPM.png 1145w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.01.37\u202fPM.png?resize=300,200 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.01.37\u202fPM.png?resize=768,514 768w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.01.37\u202fPM.png?resize=800,536 800w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.01.37\u202fPM.png?resize=400,268 400w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.01.37\u202fPM.png?resize=750,502 750w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.01.37\u202fPM.png?resize=578,387 578w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.01.37\u202fPM.png?resize=930,623 930w\" sizes=\"auto, (max-width: 1145px) 100vw, 1145px\"\/><\/figure>\n\n\n\n<p>I tried from scratch with the exact same prompt on a new notebook in Google Colab, and it produced a far better, yet still odd result.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1006\" height=\"507\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52%E2%80%AFPM.png?w=800\" alt=\"\" class=\"wp-image-2998431\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png 1006w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png?resize=300,151 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png?resize=768,387 768w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png?resize=800,403 800w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png?resize=100,50 100w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png?resize=350,175 350w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png?resize=400,202 400w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png?resize=750,378 750w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png?resize=578,291 578w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/03\/Screenshot-2025-03-03-at-3.00.52\u202fPM.png?resize=930,469 930w\" sizes=\"auto, (max-width: 1006px) 100vw, 1006px\"\/><\/figure>\n\n\n\n<p>I\u2019ll have to try troubleshooting it some more, and as I said, the initial erroneous result may be due to my own lack of experience using data science tools. <\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-colab-pricing-and-ai-features\">Colab pricing and AI features<\/h2>\n\n\n\n<p>While Google Colab remains free, users who need additional compute power can upgrade to paid plans:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Colab pro ($9.99\/month)<\/strong>: 100 compute units, faster GPUs, more memory, terminal access.<\/li>\n\n\n\n<li><strong>Colab pro+ ($49.99\/month)<\/strong>: 500 compute units, priority GPU upgrades, background execution.<\/li>\n\n\n\n<li><strong>Colab enterprise<\/strong>: Google Cloud integration, AI-powered code generation.<\/li>\n\n\n\n<li><strong>Pay-as-you-go<\/strong>: $9.99 for 100 compute units, $49.99 for 500 compute units.<\/li>\n<\/ul>\n\n\n\n<p>In addition to data science agent, Google has been expanding AI capabilities within Colab.<\/p>\n\n\n\n<p>Google collects prompts, generated code and user feedback to improve its AI models. While data is stored for up to 18 months, it is anonymized, and deletion requests may not always be fulfilled. Users are advised not to submit sensitive or personal information, as human reviewers may process prompts. Additionally, AI-generated code should be reviewed carefully, as it may contain inaccuracies.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-feedback-welcome\">Feedback welcome<\/h2>\n\n\n\n<p>Google encourages users to provide feedback through the Google Labs Discord community in the #data-science-agent channel.<\/p>\n\n\n\n<p>With AI-driven automation becoming a key trend in data science, Google\u2019s data science agent in Colab could help researchers and developers focus more on insights and less on coding setup. As the tool expands to more users and regions, it will be interesting to see how it shapes the future of AI-assisted analytics.<\/p>\n<div id=\"boilerplate_2660155\" class=\"post-boilerplate boilerplate-after\"><div class=\"Boilerplate__newsletter-container vb\">\n<div class=\"Boilerplate__newsletter-main\">\n<p><strong>Daily insights on business use cases with VB Daily<\/strong><\/p>\n<p class=\"copy\">If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.<\/p>\n<p class=\"Form__newsletter-legal\">Read our Privacy Policy<\/p>\n<p class=\"Form__success\" id=\"boilerplateNewsletterConfirmation\">\n\t\t\t\t\tThanks for subscribing. Check out more VB newsletters here.\n\t\t\t\t<\/p>\n<p class=\"Form__error\">An error occured.<\/p>\n<\/p><\/div>\n<div class=\"image-container\">\n\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/venturebeat.com\/wp-content\/themes\/vb-news\/brand\/img\/vb-daily-phone.png\" alt=\"\"\/>\n\t\t\t\t<\/div>\n<\/p><\/div>\n<\/div>\t\t\t<\/div>\r\n<br>\r\n<br><a href=\"https:\/\/venturebeat.com\/ai\/google-launches-free-gemini-powered-data-science-agent-on-its-colab-python-platform\/\">Source link <\/a>","protected":false},"excerpt":{"rendered":"<p>Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More AI agents are all the rage, but how about one focused specifically on analyzing, sorting and drawing conclusions from vast volumes of data? Google\u2019s data science agent does just that: The new, free Gemini 2.0-powered AI [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":401,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[33],"tags":[],"class_list":["post-400","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-automation"],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/violethoward.com\/new\/wp-content\/uploads\/2025\/03\/cfr0z3n_google_theme_primary_colors_flat_illustration_looking_o_03263a95-744d-4847-8847-495ff6095ca2.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/400","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/comments?post=400"}],"version-history":[{"count":0,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/400\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media\/401"}],"wp:attachment":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media?parent=400"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/categories?post=400"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/tags?post=400"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69b0ea1f46fa5c3231e56837. Config Timestamp: 2026-03-11 04:05:51 UTC, Cached Timestamp: 2026-04-08 03:19:59 UTC -->