{"id":4765,"date":"2025-12-09T07:50:24","date_gmt":"2025-12-09T07:50:24","guid":{"rendered":"https:\/\/violethoward.com\/new\/booking-coms-agent-strategy-disciplined-modular-and-already-delivering-2x-accuracy\/"},"modified":"2025-12-09T07:50:24","modified_gmt":"2025-12-09T07:50:24","slug":"booking-coms-agent-strategy-disciplined-modular-and-already-delivering-2x-accuracy","status":"publish","type":"post","link":"https:\/\/violethoward.com\/new\/booking-coms-agent-strategy-disciplined-modular-and-already-delivering-2x-accuracy\/","title":{"rendered":"Booking.com\u2019s agent strategy: Disciplined, modular and already delivering 2\u00d7 accuracy"},"content":{"rendered":"<p> <br \/>\n<br \/><img decoding=\"async\" src=\"https:\/\/images.ctfassets.net\/jdtwqhzvc2n1\/2t04hE5kMFifED9FQ12PcA\/86ec4e3895472b25edb77273488d1c7a\/Booking.png?w=300&amp;q=30\" \/><\/p>\n<p>When many enterprises weren\u2019t even thinking about agentic behaviors or infrastructures, Booking.com had already \u201cstumbled\u201d into them with its homegrown conversational recommendation system. <\/p>\n<p>This early experimentation has allowed the company to take a step back and avoid getting swept up in the frantic AI agent hype. Instead, it is taking a disciplined, layered, modular approach to model development: small, travel-specific models for cheap, fast inference; larger large language models (LLMs) for reasoning and understanding; and domain-tuned evaluations built in-house when precision is critical. <\/p>\n<p>With this hybrid strategy \u2014 combined with selective collaboration with OpenAI \u2014 Booking.com has seen accuracy double across key retrieval, ranking and customer-interaction tasks.<\/p>\n<p>As Pranav Pathak, Booking.com\u2019s AI product development lead, posed to VentureBeat in a new podcast: \u201cDo you build it very, very specialized and bespoke and then have an army of a hundred agents? Or do you keep it general enough and have five agents that are good at generalized tasks, but then you have to orchestrate a lot around them? That&#x27;s a balance that I think we&#x27;re still trying to figure out, as is the rest of the industry.\u201d<\/p>\n<p>Check out the new <i>Beyond the Pilot<\/i> podcast here, and continue reading for highlights. <\/p>\n<h3>Moving from guessing to deep personalization without being \u2018creepy\u2019<\/h3>\n<p>Recommendation systems are core to Booking.com\u2019s customer-facing platforms; however, traditional recommendation tools have been less about recommendation and more about guessing, Pathak conceded. So, from the start, he and his team vowed to avoid generic tools: As he put it, the price and recommendation should be based on customer context. <\/p>\n<p>Booking.com\u2019s initial pre-gen AI tooling for intent and topic detection was a small language model, what Pathak described as \u201cthe scale and size of BERT.\u201d The model ingested the customer\u2019s inputs around their problem to determine whether it could be solved through self-service or bumped to a human agent. <\/p>\n<p>\u201cWe started with an architecture of \u2018you have to call a tool if this is the intent you detect and this is how you&#x27;ve parsed the structure,\u201d Pathak explained. \u201cThat was very, very similar to the first few agentic architectures that came out in terms of reason and defining a tool call.\u201d <\/p>\n<p>His team has since built out that architecture to include an LLM orchestrator that classifies queries, triggers retrieval-augmented generation (RAG) and calls APIs or smaller, specialized language models. \u201cWe&#x27;ve been able to scale that system quite well because it was so close in architecture that, with a few tweaks, we now have a full agentic stack,\u201d said Pathak. <\/p>\n<p>As a result, Booking.com is seeing a 2X increase in topic detection, which in turn is freeing up human agents\u2019 bandwidth by 1.5 to 1.7X. More topics, even complicated ones previously identified as \u2018other\u2019 and requiring escalation, are being automated. <\/p>\n<p>Ultimately, this supports more self-service, freeing human agents to focus on customers with uniquely-specific problems that the platform doesn\u2019t have a dedicated tool flow for \u2014 say, a family that is unable to access its hotel room at 2 a.m. when the front desk is closed. <\/p>\n<p>That not only \u201creally starts to compound,\u201d but has a direct, long-term impact on customer retention, Pathak noted. \u201cOne of the things we&#x27;ve seen is, the better we are at customer service, the more loyal our customers are.\u201d<\/p>\n<p>Another recent rollout is personalized filtering. Booking.com has between 200 and 250 search filters on its website \u2014 an unrealistic amount for any human to sift through, Pathak pointed out. So, his team introduced a free text box that users can type into to immediately receive tailored filters. <\/p>\n<p>\u201cThat becomes such an important cue for personalization in terms of what you&#x27;re looking for in your own words rather than a clickstream,\u201d said Pathak. <\/p>\n<p>In turn, it cues Booking.com into what customers actually want. For instance, hot tubs \u2014 when filter personalization first rolled out, jacuzzi\u2019s were one of the most popular requests. That wasn\u2019t even a consideration previously; there wasn\u2019t even a filter. Now that filter is live. <\/p>\n<p>\u201cI had no idea,\u201d Pathak noted. \u201cI had never searched for a hot tub in my room honestly.\u201d<\/p>\n<p>When it comes to personalization, though, there is a fine line; memory remains complicated, Pathak emphasized. While it\u2019s important to have long-term memories and evolving threads with customers \u2014 retaining information like their typical budgets, preferred hotel star ratings or whether they need disability access \u2014 it must be on their terms and protective of their privacy. <\/p>\n<p>Booking.com is extremely mindful with memory, seeking consent so as to not be \u201ccreepy\u201d when collecting customer information. <\/p>\n<p>\u201cManaging memory is much harder than actually building memory,\u201d said Pathak. \u201cThe tech is out there, we have the technical chops to build it. We want to make sure we don&#x27;t launch a memory object that doesn&#x27;t respect customer consent, that doesn&#x27;t feel very natural.\u201d<\/p>\n<h3>Finding a balance of build versus buy<\/h3>\n<p>\nAs agents mature, Booking.com is navigating a central question facing the entire industry: How narrow should agents become? <\/p>\n<p>Instead of committing to either a swarm of highly specialized agents or a few generalized ones, the company aims for reversible decisions and avoids \u201cone-way doors\u201d that lock its architecture into long-term, costly paths. Pathak\u2019s strategy is: Generalize where possible, specialize where necessary and keep agent design flexible to help ensure resiliency. <\/p>\n<p>Pathak and his team are \u201cvery mindful\u201d of use cases, evaluating where to build more generalized, reusable agents or more task-specific ones. They strive to use the smallest model possible, with the highest level of accuracy and output quality, for each use case. Whatever can be generalized is. <\/p>\n<p>Latency is another important consideration. When factual accuracy and avoiding hallucinations is paramount, his team will use a larger, much slower model; but with search and recommendations, user expectations set speed. (Pathak noted: \u201cNo one\u2019s patient.\u201d)<\/p>\n<p>\u201cWe would, for example, never use something as heavy as GPT-5 for just topic detection or for entity extraction,\u201d he said. <\/p>\n<p>Booking.com takes a similarly elastic tack when it comes to monitoring and evaluations: If it&#x27;s general-purpose monitoring that someone else is better at building and has horizontal capability, they\u2019ll buy it. But if it\u2019s instances where brand guidelines must be enforced, they\u2019ll build their own evals. <\/p>\n<p>Ultimately, Booking.com has leaned into being \u201csuper anticipatory,\u201d agile and flexible. \u201cAt this point with everything that&#x27;s happening with AI, we are a little bit averse to walking through one way doors,\u201d said Pathak. \u201cWe want as many of our decisions to be reversible as possible. We don&#x27;t want to get locked into a decision that we cannot reverse two years from now.\u201d<\/p>\n<h3>What other builders can learn from Booking.com\u2019s AI journey<\/h3>\n<p>Booking.com\u2019s AI journey can serve as an important blueprint for other enterprises. <\/p>\n<p>Looking back, Pathak acknowledged that they started out with a \u201cpretty complicated\u201d tech stack. They\u2019re now in a good place with that, \u201cbut we probably could have started something much simpler and seen how customers interacted with it.\u201d<\/p>\n<p>Given that, he offered this valuable advice: If you\u2019re just starting out with LLMs or agents, out-of-the-box APIs will do just fine. \u201cThere&#x27;s enough customization with APIs that you can already get a lot of leverage before you decide you want to go do more.\u201d <\/p>\n<p>On the other hand, if a use case requires customization not available through a standard API call, that makes a case for in-house tools. <\/p>\n<p>Still, he emphasized: Don&#x27;t start with the complicated stuff. Tackle the \u201csimplest, most painful problem you can find and the simplest, most obvious solution to that.\u201d <\/p>\n<p>Identify the product market fit, then investigate the ecosystems, he advised \u2014 but don\u2019t just rip out old infrastructures because a new use case demands something specific (like moving an entire cloud strategy from AWS to Azure just to use the OpenAI endpoint). <\/p>\n<p>Ultimately: \u201cDon&#x27;t lock yourself in too early,\u201d Pathak noted. \u201cDon&#x27;t make decisions that are one-way doors until you are very confident that that&#x27;s the solution that you want to go with.\u201d<\/p>\n<p><br \/>\n<br \/><a href=\"https:\/\/venturebeat.com\/ai\/booking-coms-agent-strategy-disciplined-modular-and-already-delivering-2\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>When many enterprises weren\u2019t even thinking about agentic behaviors or infrastructures, Booking.com had already \u201cstumbled\u201d into them with its homegrown conversational recommendation system. This early experimentation has allowed the company to take a step back and avoid getting swept up in the frantic AI agent hype. Instead, it is taking a disciplined, layered, modular approach [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":4766,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[33],"tags":[],"class_list":["post-4765","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-automation"],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/violethoward.com\/new\/wp-content\/uploads\/2025\/12\/Booking.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/4765","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/comments?post=4765"}],"version-history":[{"count":0,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/4765\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media\/4766"}],"wp:attachment":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media?parent=4765"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/categories?post=4765"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/tags?post=4765"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69d79d7d46fa5cbf45858bd1. Config Timestamp: 2026-04-09 12:37:16 UTC, Cached Timestamp: 2026-04-29 18:35:49 UTC -->