{"id":3016,"date":"2025-08-06T17:56:06","date_gmt":"2025-08-06T17:56:06","guid":{"rendered":"https:\/\/violethoward.com\/new\/googles-new-diffusion-ai-agent-mimics-human-writing-to-improve-enterprise-research\/"},"modified":"2025-08-06T17:56:06","modified_gmt":"2025-08-06T17:56:06","slug":"googles-new-diffusion-ai-agent-mimics-human-writing-to-improve-enterprise-research","status":"publish","type":"post","link":"https:\/\/violethoward.com\/new\/googles-new-diffusion-ai-agent-mimics-human-writing-to-improve-enterprise-research\/","title":{"rendered":"Google\u2019s new diffusion AI agent mimics human writing to improve enterprise research"},"content":{"rendered":" \r\n<br><div>\n\t\t\t\t<div id=\"boilerplate_2682874\" class=\"post-boilerplate boilerplate-before\">\n<p><em>Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders.<\/em> <em>Subscribe Now<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n<\/div><p><strong>Google researchers<\/strong> have developed a <strong>new framework for AI research agents that outperforms leading systems from rivals OpenAI, Perplexity, and others<\/strong> on key benchmarks. <\/p>\n\n\n\n<p>The new agent, called Test-Time Diffusion Deep Researcher (TTD-DR), is inspired by the way humans write by going through a process of drafting, searching for information, and making iterative revisions.<\/p>\n\n\n\n<p><strong>The system uses diffusion mechanisms and evolutionary algorithms to produce more comprehensive and accurate research on complex topics. <\/strong><\/p>\n\n\n\n<p>For enterprises, this framework<strong> could power a new generation of bespoke research assistants for high-value tasks <\/strong>that standard retrieval augmented generation (RAG) systems struggle with, such as generating a competitive analysis or a market entry report. <\/p>\n\n\n\n<div id=\"boilerplate_2803147\" class=\"post-boilerplate boilerplate-speedbump\">\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong\/><strong>AI Scaling Hits Its Limits<\/strong><\/p>\n\n\n\n<p>Power caps, rising token costs, and inference delays are reshaping enterprise AI. Join our exclusive salon to discover how top teams are:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Turning energy into a strategic advantage<\/li>\n\n\n\n<li>Architecting efficient inference for real throughput gains<\/li>\n\n\n\n<li>Unlocking competitive ROI with sustainable AI systems<\/li>\n<\/ul>\n\n\n\n<p><strong>Secure your spot to stay ahead<\/strong>: https:\/\/bit.ly\/4mwGngO<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n<\/div><p>According to the paper\u2019s authors, these real-world business use cases were the primary target for the system.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-the-limits-of-current-deep-research-agents\">The limits of current deep research agents<\/h2>\n\n\n\n<p>Deep research (DR) agents are designed to tackle complex queries that go beyond a simple search. They use large language models (LLMs) to plan, use tools like web search to gather information, and then synthesize the findings into a detailed report with the help of test-time scaling techniques such as chain-of-thought (CoT), best-of-N sampling, and Monte-Carlo Tree Search.<\/p>\n\n\n\n<p>However, many of these systems have fundamental design limitations. Most publicly available DR agents apply test-time algorithms and tools without a structure that mirrors human cognitive behavior. Open-source agents often follow a rigid linear or parallel process of planning, searching, and generating content, <strong>making it difficult for the different phases of the research to interact with and correct each other.<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img fetchpriority=\"high\" decoding=\"async\" width=\"547\" height=\"151\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_0237d0.png\" alt=\"\" class=\"wp-image-3015057\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_0237d0.png 547w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_0237d0.png?resize=300,83 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_0237d0.png?resize=400,110 400w\" sizes=\"(max-width: 547px) 100vw, 547px\"\/><figcaption class=\"wp-element-caption\">Example of linear research agent (source: arXiv)<\/figcaption><\/figure><\/div>\n\n\n<p>This can cause the agent to lose the global context of the research and miss critical connections between different pieces of information. <\/p>\n\n\n\n<p>As the paper\u2019s authors note, \u201cThis indicates a fundamental limitation in current DR agent work and highlights the need for a more cohesive, purpose-built framework for DR agents that imitates or surpasses human research capabilities.\u201d<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-a-new-approach-inspired-by-human-writing-and-diffusion\">A new approach inspired by human writing and diffusion<\/h2>\n\n\n\n<p><strong>Unlike the linear process of most AI agents, human researchers work iteratively<\/strong>. They typically start with a <strong>high-level plan, create an initial draft, and then engage in multiple revision cycles<\/strong>. During these revisions, they search for new information to strengthen their arguments and fill in gaps.<\/p>\n\n\n\n<p>The Google researchers observed that this <strong>human process could be emulated with the mechanism of a diffusion model <\/strong>augmented with a retrieval component. (Diffusion models are often used in image generation. They begin with a noisy image and gradually refine it until it becomes a detailed image.)<\/p>\n\n\n\n<p>As the researchers explain, \u201cIn this analogy, a trained diffusion model initially generates a noisy draft, and the denoising module, aided by retrieval tools, revises this draft into higher-quality (or higher-resolution) outputs.\u201d<\/p>\n\n\n\n<p>TTD-DR is built on this blueprint. <strong>The framework treats the creation of a research report as a diffusion process, where an initial, \u201cnoisy\u201d draft is progressively refined into a polished final report.<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"162\" width=\"800\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_a86202.png?w=800\" alt=\"\" class=\"wp-image-3015058\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_a86202.png 990w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_a86202.png?resize=300,61 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_a86202.png?resize=768,156 768w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_a86202.png?resize=800,162 800w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_a86202.png?resize=400,81 400w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_a86202.png?resize=750,152 750w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_a86202.png?resize=578,117 578w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_a86202.png?resize=930,189 930w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\"\/><figcaption class=\"wp-element-caption\">TTD-DR uses an iterative approach to refine its initial research plan (source: arXiv)<\/figcaption><\/figure><\/div>\n\n\n<p>This is achieved through two core mechanisms. The first, which the researchers call \u201cDenoising with Retrieval,\u201d starts with a preliminary draft and iteratively improves it. In each step, the agent uses the current draft to formulate new search queries, retrieves external information, and integrates it to \u201cdenoise\u201d the report by correcting inaccuracies and adding detail.<\/p>\n\n\n\n<p>The second mechanism, \u201cSelf-Evolution,\u201d ensures that each component of the agent (the planner, the question generator, and the answer synthesizer) independently optimizes its own performance. In comments to VentureBeat, Rujun Han, research scientist at Google and co-author of the paper, explained that this component-level evolution is crucial because it makes the \u201creport denoising more effective.\u201d This is akin to an evolutionary process where each part of the system gets progressively better at its specific task, providing higher-quality context for the main revision process.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"282\" width=\"800\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image.png?w=800\" alt=\"\" class=\"wp-image-3015059\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image.png 1210w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image.png?resize=300,106 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image.png?resize=768,270 768w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image.png?resize=800,282 800w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image.png?resize=400,141 400w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image.png?resize=750,264 750w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image.png?resize=578,203 578w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image.png?resize=930,327 930w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\"\/><figcaption class=\"wp-element-caption\">Each of the components in TTD-DR use evolutionary algorithms to sample and refine multiple responses in parallel and finally combine them to create a final answer (source: arXiv)<\/figcaption><\/figure>\n\n\n\n<p>\u201cThe intricate interplay and synergistic combination of these two algorithms are crucial for achieving high quality research outcomes,\u201d the authors state. This iterative process directly results in reports that are not just more accurate, but also more logically coherent. As Han notes, since the model was evaluated on helpfulness, which includes fluency and coherence, the performance gains are a direct measure of its ability to produce well-structured business documents.<\/p>\n\n\n\n<p>According to the paper,<strong> the resulting research companion is \u201ccapable of generating helpful and comprehensive reports for complex research questions across diverse industry domains,<\/strong> including finance, biomedical, recreation, and technology,\u201d putting it in the same class as deep research products from OpenAI, Perplexity, and Grok.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-ttd-dr-in-action\">TTD-DR in action<\/h2>\n\n\n\n<p>To build and test their framework, the researchers used Google\u2019s Agent Development Kit (ADK), an extensible platform for orchestrating complex AI workflows, with Gemini 2.5 Pro as the core LLM (though you can swap it for other models). <\/p>\n\n\n\n<p>They benchmarked TTD-DR against leading commercial and open-source systems, including OpenAI Deep Research, Perplexity Deep Research, Grok DeepSearch, and the open source GPT-Researcher.\u00a0<\/p>\n\n\n\n<p>The evaluation focused on two main areas. For generating long-form comprehensive reports, they used the DeepConsult benchmark, a collection of business and consulting-related prompts, alongside their own LongForm Research dataset. For answering multi-hop questions that require extensive search and reasoning, they tested the agent on challenging academic and real-world benchmarks like Humanity\u2019s Last Exam (HLE) and GAIA.<\/p>\n\n\n\n<p><strong>The results showed TTD-DR consistently outperforming its competitors. <\/strong>In side-by-side comparisons with OpenAI Deep Research on long-form report generation, TTD-DR achieved win rates of 69.1% and 74.5% on two different datasets. It also surpassed OpenAI\u2019s system on three separate benchmarks that required multi-hop reasoning to find concise answers, with performance gains of 4.8%, 7.7%, and 1.7%.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"183\" width=\"800\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_40608f.png?w=800\" alt=\"\" class=\"wp-image-3015060\" srcset=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_40608f.png 1462w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_40608f.png?resize=300,69 300w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_40608f.png?resize=768,175 768w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_40608f.png?resize=800,183 800w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_40608f.png?resize=400,91 400w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_40608f.png?resize=750,171 750w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_40608f.png?resize=578,132 578w, https:\/\/venturebeat.com\/wp-content\/uploads\/2025\/08\/image_40608f.png?resize=930,212 930w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\"\/><figcaption class=\"wp-element-caption\">TTD-DR outperforms other deep research agents on key benchmarks (source: arXiv)<\/figcaption><\/figure><\/div>\n\n\n<h2 class=\"wp-block-heading\" id=\"h-the-future-of-test-time-diffusion\">The Future of Test-Time Diffusion<\/h2>\n\n\n\n<p>While the current research focuses on text-based reports using web search, the framework is designed to be highly adaptable. Han confirmed that the team plans to extend the work to incorporate more tools for complex enterprise tasks.<\/p>\n\n\n\n<p>A <strong>similar \u201ctest-time diffusion\u201d process could be used to generate complex software code<\/strong>, <strong>create a detailed financial model<\/strong>, or <strong>design a multi-stage marketing campaign<\/strong>, where an initial \u201cdraft\u201d of the project is<strong> iteratively refined with new information<\/strong> and feedback from various specialized tools.<\/p>\n\n\n\n<p>\u201cAll of these tools can be naturally incorporated in our framework,\u201d Han said, suggesting that this draft-centric approach could become a foundational architecture for a wide range of complex, multi-step AI agents.<\/p>\n<div id=\"boilerplate_2660155\" class=\"post-boilerplate boilerplate-after\"><div class=\"Boilerplate__newsletter-container vb\">\n<div class=\"Boilerplate__newsletter-main\">\n<p><strong>Daily insights on business use cases with VB Daily<\/strong><\/p>\n<p class=\"copy\">If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.<\/p>\n<p class=\"Form__newsletter-legal\">Read our Privacy Policy<\/p>\n<p class=\"Form__success\" id=\"boilerplateNewsletterConfirmation\">\n\t\t\t\t\tThanks for subscribing. Check out more VB newsletters here.\n\t\t\t\t<\/p>\n<p class=\"Form__error\">An error occured.<\/p>\n<\/p><\/div>\n<div class=\"image-container\">\n\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/venturebeat.com\/wp-content\/themes\/vb-news\/brand\/img\/vb-daily-phone.png\" alt=\"\"\/>\n\t\t\t\t<\/div>\n<\/p><\/div>\n<\/div>\t\t\t<\/div>\r\n<br>\r\n<br><a href=\"https:\/\/venturebeat.com\/ai\/googles-new-diffusion-ai-agent-mimics-human-writing-to-improve-enterprise-research\/\">Source link <\/a>","protected":false},"excerpt":{"rendered":"<p>Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Google researchers have developed a new framework for AI research agents that outperforms leading systems from rivals OpenAI, Perplexity, and others on key benchmarks. The new agent, called Test-Time [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":3017,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[33],"tags":[],"class_list":["post-3016","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-automation"],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/violethoward.com\/new\/wp-content\/uploads\/2025\/08\/deep-research-agent.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/3016","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/comments?post=3016"}],"version-history":[{"count":0,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/3016\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media\/3017"}],"wp:attachment":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media?parent=3016"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/categories?post=3016"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/tags?post=3016"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69e302c146fa5c92dc28ac12. Config Timestamp: 2026-04-18 04:04:16 UTC, Cached Timestamp: 2026-04-29 18:01:04 UTC -->