{"id":1526,"date":"2025-05-11T15:42:57","date_gmt":"2025-05-11T15:42:57","guid":{"rendered":"https:\/\/violethoward.com\/new\/alibabas-zerosearch-lets-ai-learn-to-google-itself-slashing-training-costs-by-88-percent\/"},"modified":"2025-05-11T15:42:57","modified_gmt":"2025-05-11T15:42:57","slug":"alibabas-zerosearch-lets-ai-learn-to-google-itself-slashing-training-costs-by-88-percent","status":"publish","type":"post","link":"https:\/\/violethoward.com\/new\/alibabas-zerosearch-lets-ai-learn-to-google-itself-slashing-training-costs-by-88-percent\/","title":{"rendered":"Alibaba\u2019s \u2018ZeroSearch\u2019 lets AI learn to google itself \u2014 slashing training costs by 88 percent"},"content":{"rendered":" \r\n<br><div>\n\t\t\t\t<div id=\"boilerplate_2682874\" class=\"post-boilerplate boilerplate-before\">\n<p><em>Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n<\/div><p>Researchers at Alibaba Group have developed a novel approach that could dramatically reduce the cost and complexity of training AI systems to search for information, eliminating the need for expensive commercial search engine APIs altogether.<\/p>\n\n\n\n<p>The technique, called \u201cZeroSearch,\u201d allows large language models (LLMs) to develop advanced search capabilities through a simulation approach rather than interacting with real search engines during the training process. This innovation could save companies significant API expenses while offering better control over how AI systems learn to retrieve information.<\/p>\n\n\n\n<p>\u201cReinforcement learning [RL] training requires frequent rollouts, potentially involving hundreds of thousands of search requests, which incur substantial API expenses and severely constrain scalability,\u201d write the researchers in their paper published on arXiv this week. \u201cTo address these challenges, we introduce ZeroSearch, a reinforcement learning framework that incentivizes the search capabilities of LLMs without interacting with real search engines.\u201d<\/p>\n\n\n\n<blockquote class=\"twitter-tweet\"><p lang=\"en\" dir=\"ltr\">Alibaba just dropped ZeroSearch on Hugging Face<\/p><p>Incentivize the Search Capability of LLMs without Searching <a href=\"https:\/\/t.co\/QfniJNO3LH\">pic.twitter.com\/QfniJNO3LH<\/a><\/p>\u2014 AK (@_akhaliq) <a href=\"https:\/\/twitter.com\/_akhaliq\/status\/1920397374007984516?ref_src=twsrc%5Etfw\">May 8, 2025<\/a><\/blockquote> \n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-how-zerosearch-trains-ai-to-search-without-search-engines\">How ZeroSearch trains AI to search without search engines<\/h2>\n\n\n\n<p>The problem that ZeroSearch solves is significant. Companies developing AI assistants that can autonomously search for information face two major challenges: the unpredictable quality of documents returned by search engines during training, and the prohibitively high costs of making hundreds of thousands of API calls to commercial search engines like Google.<\/p>\n\n\n\n<p>Alibaba\u2019s approach begins with a lightweight supervised fine-tuning process to transform an LLM into a retrieval module capable of generating both relevant and irrelevant documents in response to a query. During reinforcement learning training, the system employs what the researchers call a \u201ccurriculum-based rollout strategy\u201d that gradually degrades the quality of generated documents.<\/p>\n\n\n\n<p>\u201cOur key insight is that LLMs have acquired extensive world knowledge during large-scale pretraining and are capable of generating relevant documents given a search query,\u201d the researchers explain. \u201cThe primary difference between a real search engine and a simulation LLM lies in the textual style of the returned content.\u201d<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-outperforming-google-at-a-fraction-of-the-cost\">Outperforming Google at a fraction of the cost<\/h2>\n\n\n\n<p>In comprehensive experiments across seven question-answering datasets, ZeroSearch not only matched but often surpassed the performance of models trained with real search engines. Remarkably, a 7B-parameter retrieval module achieved performance comparable to Google Search, while a 14B-parameter module even outperformed it.<\/p>\n\n\n\n<p>The cost savings are substantial. According to the researchers\u2019 analysis, training with approximately 64,000 search queries using Google Search via SerpAPI would cost about $586.70, while using a 14B-parameter simulation LLM on four A100 GPUs costs only $70.80 \u2014 an 88% reduction.<\/p>\n\n\n\n<p>\u201cThis demonstrates the feasibility of using a well-trained LLM as a substitute for real search engines in reinforcement learning setups,\u201d the paper notes.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-what-this-means-for-the-future-of-ai-development\">What this means for the future of AI development<\/h2>\n\n\n\n<p>This breakthrough is a major shift in how AI systems can be trained. ZeroSearch shows that AI can improve without depending on external tools like search engines.<\/p>\n\n\n\n<p>The impact could be substantial for the AI industry. Until now, training advanced AI systems often required expensive API calls to services controlled by big tech companies. ZeroSearch changes this equation by allowing AI to simulate search instead of using actual search engines.<\/p>\n\n\n\n<p>For smaller AI companies and startups with limited budgets, this approach could level the playing field. The high costs of API calls have been a major barrier to entry in developing sophisticated AI assistants. By cutting these costs by nearly 90%, ZeroSearch makes advanced AI training more accessible.<\/p>\n\n\n\n<p>Beyond cost savings, this technique gives developers more control over the training process. When using real search engines, the quality of returned documents is unpredictable. With simulated search, developers can precisely control what information the AI sees during training.<\/p>\n\n\n\n<p>The technique works across multiple model families, including Qwen-2.5 and LLaMA-3.2, and with both base and instruction-tuned variants. The researchers have made their code, datasets, and pre-trained models available on GitHub and Hugging Face, allowing other researchers and companies to implement the approach.<\/p>\n\n\n\n<p>As large language models continue to evolve, techniques like ZeroSearch suggest a future where AI systems can develop increasingly sophisticated capabilities through self-simulation rather than relying on external services \u2014 potentially changing the economics of AI development and reducing dependencies on large technology platforms.<\/p>\n\n\n\n<p>The irony is clear: in teaching AI to search without search engines, Alibaba may have created a technology that makes traditional search engines less necessary for AI development. As these systems become more self-sufficient, the technology landscape could look very different in just a few years.<\/p>\n\n\n\n\n<div id=\"boilerplate_2660155\" class=\"post-boilerplate boilerplate-after\"><div class=\"Boilerplate__newsletter-container vb\">\n<div class=\"Boilerplate__newsletter-main\">\n<p><strong>Daily insights on business use cases with VB Daily<\/strong><\/p>\n<p class=\"copy\">If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.<\/p>\n<p class=\"Form__newsletter-legal\">Read our Privacy Policy<\/p>\n<p class=\"Form__success\" id=\"boilerplateNewsletterConfirmation\">\n\t\t\t\t\tThanks for subscribing. Check out more VB newsletters here.\n\t\t\t\t<\/p>\n<p class=\"Form__error\">An error occured.<\/p>\n<\/p><\/div>\n<div class=\"image-container\">\n\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/venturebeat.com\/wp-content\/themes\/vb-news\/brand\/img\/vb-daily-phone.png\" alt=\"\"\/>\n\t\t\t\t<\/div>\n<\/p><\/div>\n<\/div>\t\t\t<\/div><template id="3UIMpDPko9fRPzqWBc3p"></template><\/script>\r\n<br>\r\n<br><a href=\"https:\/\/venturebeat.com\/ai\/alibabas-zerosearch-lets-ai-learn-to-google-itself-slashing-training-costs-by-88-percent\/\">Source link <\/a>","protected":false},"excerpt":{"rendered":"<p>Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Researchers at Alibaba Group have developed a novel approach that could dramatically reduce the cost and complexity of training AI systems to search for information, eliminating the need for expensive commercial search engine APIs altogether. The [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":1527,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[33],"tags":[],"class_list":["post-1526","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-automation"],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/violethoward.com\/new\/wp-content\/uploads\/2025\/05\/nuneybits_Vector_art_of_web_search_b324c6ac-bcd1-4106-8ed1-dd4969cc9e5b.webp.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/1526","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/comments?post=1526"}],"version-history":[{"count":0,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/1526\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media\/1527"}],"wp:attachment":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media?parent=1526"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/categories?post=1526"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/tags?post=1526"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69e302c146fa5c92dc28ac12. Config Timestamp: 2026-04-18 04:04:16 UTC, Cached Timestamp: 2026-04-29 05:55:28 UTC -->