{"id":766,"date":"2025-03-23T07:51:43","date_gmt":"2025-03-23T07:51:43","guid":{"rendered":"https:\/\/violethoward.com\/new\/nvidia-debuts-llama-nemotron-open-reasoning-models-in-a-bid-to-advance-agentic-ai\/"},"modified":"2025-03-23T07:51:43","modified_gmt":"2025-03-23T07:51:43","slug":"nvidia-debuts-llama-nemotron-open-reasoning-models-in-a-bid-to-advance-agentic-ai","status":"publish","type":"post","link":"https:\/\/violethoward.com\/new\/nvidia-debuts-llama-nemotron-open-reasoning-models-in-a-bid-to-advance-agentic-ai\/","title":{"rendered":"Nvidia debuts Llama Nemotron open reasoning models in a bid to advance agentic AI"},"content":{"rendered":" \r\n<br><div>\n\t\t\t\t<div id=\"boilerplate_2682874\" class=\"post-boilerplate boilerplate-before\">\n<p><em>Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More<\/em><\/p>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity is-style-wide\"\/>\n<\/div><p>Nvidia is getting into the open source reasoning model market.<\/p>\n\n\n\n<p>At the Nvidia GTC event today, the AI giant made a series of hardware and software announcements. Buried amidst the big silicon announcements, the company announced a new set of open source Llama Nemotron reasoning models to help accelerate agentic AI workloads. The new models are an extension of the Nvidia Nemotron models that were first announced in January at the Consumer Electronics Show (CES).<\/p>\n\n\n\n<p>The new\u00a0Llama Nemotron reasoning models are in part a response to the dramatic rise of reasoning models in 2025. Nvidia (and its stock price) were rocked to the core earlier this year when DeepSeek R1 came out, offering the promise of an open source reasoning model and superior performance.<\/p>\n\n\n\n<p>The Llama Nemotron family models are competitive with DeepSeek offering business-ready AI reasoning models for advanced agents.\u00a0<\/p>\n\n\n\n<p>\u201cAgents are autonomous software systems designed to reason, plan, act and critique their work,\u201d Kari Briski, vice president of Generative AI Software Product Managements at Nvidia said during a GTC pre-briefing with press. \u201cJust like humans, agents need to understand context to breakdown complex requests, understand the user\u2019s intent, and adapt in real time.\u201d<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-what-s-inside-llama-nemotron-for-agentic-ai\">What\u2019s inside Llama Nemotron for agentic AI<\/h2>\n\n\n\n<p>As the name implies Llama Nemotron is based on Meta\u2019s open source Llama models.<\/p>\n\n\n\n<p>With Llama as the foundation,\u00a0Briski said that Nvidia algorithmically pruned the model to optimize compute requirements while maintaining accuracy.<\/p>\n\n\n\n<p>Nvidia also applied sophisticated post-training techniques using synthetic data. The training involved  360,000 H100 inference hours and 45,000 human annotation hours to enhance reasoning capabilities. All that training results in models that have exceptional reasoning capabilities across key benchmarks for math, tool calling, instruction following and conversational tasks, according to Nvidia.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-the-llama-nemotron-family-has-three-different-models\">The Llama Nemotron family has three different models<\/h2>\n\n\n\n<p>The family includes three models targeting different deployment scenarios:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Nemotron Nano<\/strong>: Optimized for edge and smaller deployments while maintaining high reasoning accuracy.<\/li>\n\n\n\n<li><strong>Nemotron Super<\/strong>: Balanced for optimal throughput and accuracy on single data center GPUs.<\/li>\n\n\n\n<li><strong>Nemotron Ultra<\/strong>: Designed for maximum \u201cagentic accuracy\u201d in multi-GPU data center environments.<\/li>\n<\/ul>\n\n\n\n<p>For availability, Nano and Super are now available at NIM micro services and can be downloaded at AI.NVIDIA.com. Ultra is coming soon.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-hybrid-reasoning-helps-to-advance-agentic-ai-workloads\">Hybrid reasoning helps to advance agentic AI workloads<\/h2>\n\n\n\n<p>One of the key features in Nvidia Llama Nemotron is the ability to toggle reasoning on or off.<\/p>\n\n\n\n<p>The ability to toggle reasoning is an emerging capability in the AI market. Anthropic Claude 3.7 has a somewhat similar functionality, though that model is a closed proprietary model. In the open source space IBM Granite 3.2 also has a reasoning toggle that IBM refers to as \u2013 conditional reasoning.<\/p>\n\n\n\n<p>The promise of hybrid or conditional reasoning is that it allows systems to bypass computationally expensive reasoning steps for simple queries. In a demonstration, Nvidia showed how the model could engage complex reasoning when solving a combinatorial problem but switch to direct response mode for simple factual queries.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-nvidia-agent-ai-q-blueprint-provides-an-enterprise-integration-layer\">Nvidia Agent AI-Q blueprint provides an enterprise integration layer<\/h2>\n\n\n\n<p>Recognizing that models alone aren\u2019t sufficient for enterprise deployment, Nvidia also\u00a0 announced the Agent AI-Q blueprint, an open-source framework for connecting AI agents to enterprise systems and data sources.<\/p>\n\n\n\n<p>\u201cAI-Q is a new blueprint that enables agents to query multiple data types\u2014text, images, video\u2014and leverage external tools like web search and other agents,\u201d Briski said. \u201cFor teams of connected agents, the blueprint provides observability and transparency into agent activity, allowing developers to improve the system over time.\u201d<\/p>\n\n\n\n<p>The AI-Q blueprint is set to become available in April<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-why-this-matters-for-enterprise-ai-adoption\">Why this matters for enterprise AI adoption<\/h2>\n\n\n\n<p>For enterprises considering advanced AI agent deployments, Nvidia\u2019s\u00a0announcements address several key challenges.<\/p>\n\n\n\n<p>The open nature of Llama Nemotron models allows businesses to deploy reasoning-capable AI within their own infrastructure. That\u2019s important as it can address data sovereignty and privacy concerns that can have limited adoption of cloud-only solutions. By building the new models as NIMs, Nvidia is also making it easier for organizations to deploy and manage deployments, whether on-premises or in the cloud.<\/p>\n\n\n\n<p>The hybrid, conditional reasoning approach is also important to note as it provides organizations with another option to choose from for this type of emerging capability. Hybrid reasoning allows enterprises to optimize for either thoroughness or speed, saving on latency and compute for simpler tasks while still enabling complex reasoning when needed.<\/p>\n\n\n\n<p>As enterprise AI moves beyond simple applications to more complex reasoning tasks, Nvidia\u2019s combined offering of efficient reasoning models and integration frameworks positions companies to deploy more sophisticated AI agents that can handle multi-step logical problems while maintaining deployment flexibility and cost efficiency.<\/p>\n<div id=\"boilerplate_2660155\" class=\"post-boilerplate boilerplate-after\"><div class=\"Boilerplate__newsletter-container vb\">\n<div class=\"Boilerplate__newsletter-main\">\n<p><strong>Daily insights on business use cases with VB Daily<\/strong><\/p>\n<p class=\"copy\">If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.<\/p>\n<p class=\"Form__newsletter-legal\">Read our Privacy Policy<\/p>\n<p class=\"Form__success\" id=\"boilerplateNewsletterConfirmation\">\n\t\t\t\t\tThanks for subscribing. Check out more VB newsletters here.\n\t\t\t\t<\/p>\n<p class=\"Form__error\">An error occured.<\/p>\n<\/p><\/div>\n<div class=\"image-container\">\n\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/venturebeat.com\/wp-content\/themes\/vb-news\/brand\/img\/vb-daily-phone.png\" alt=\"\"\/>\n\t\t\t\t<\/div>\n<\/p><\/div>\n<\/div>\t\t\t<\/div>\r\n<br>\r\n<br><a href=\"https:\/\/venturebeat.com\/ai\/nvidia-debuts-llama-nemotron-open-reasoning-models-in-a-bid-to-advance-agentic-ai\/\">Source link <\/a>","protected":false},"excerpt":{"rendered":"<p>Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Nvidia is getting into the open source reasoning model market. At the Nvidia GTC event today, the AI giant made a series of hardware and software announcements. Buried amidst the big silicon announcements, the company announced [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":767,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[33],"tags":[],"class_list":["post-766","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-automation"],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/violethoward.com\/new\/wp-content\/uploads\/2025\/03\/ai_generated_code_nvidia-smk.jpg","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/766","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/comments?post=766"}],"version-history":[{"count":0,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/766\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media\/767"}],"wp:attachment":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media?parent=766"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/categories?post=766"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/tags?post=766"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}<!-- This website is optimized by Airlift. Learn more: https://airlift.net. Template:. Learn more: https://airlift.net. Template: 69e302c146fa5c92dc28ac12. Config Timestamp: 2026-04-18 04:04:16 UTC, Cached Timestamp: 2026-04-28 23:16:36 UTC -->