Post-Trained by NVIDIA, New Llama Nemotron Reasoning Models Provide Business-Ready Foundation for Agentic AI
NVIDIA has unveiled the open Llama Nemotron family of models, engineered with advanced reasoning capabilities to empower developers and enterprises in building AI-driven agents. These models are designed to function autonomously or collaboratively, addressing complex tasks with precision and efficiency.
Enhanced AI Reasoning for Business Applications
Built upon the Llama model architecture, the NVIDIA Llama Nemotron reasoning family introduces on-demand AI reasoning capabilities. NVIDIA has enhanced these models through post-training refinements, significantly improving their proficiency in multistep mathematics, coding, reasoning, and complex decision-making.
Through this optimization, model accuracy has increased by up to 20% compared to the base version, while inference speed has been boosted by five times relative to other leading open reasoning models. These advancements enable enterprises to tackle sophisticated reasoning tasks, improve decision-making processes, and minimize operational costs.
Several leading AI platform innovators—including Accenture, Amdocs, Atlassian, Box, Cadence, CrowdStrike, Deloitte, IQVIA, Microsoft, SAP, and ServiceNow—are collaborating with NVIDIA to integrate these reasoning models into their software solutions.
“Reasoning and agentic AI adoption is incredible,” said Jensen Huang, founder and CEO of NVIDIA. “NVIDIA’s open reasoning models, software, and tools give developers and enterprises everywhere the building blocks to create an accelerated agentic AI workforce.”
Optimized AI Models for Enterprise-Grade Deployment
To meet diverse deployment needs, the Llama Nemotron model family is available as NVIDIA NIM™ microservices in three tiers:
- Nano Model: Optimized for high-accuracy performance on PCs and edge devices.
- Super Model: Delivers superior accuracy and high throughput on a single GPU.
- Ultra Model: Provides maximum accuracy for AI agents running on multi-GPU servers.
These models have undergone rigorous post-training on NVIDIA DGX™ Cloud using high-quality synthetic datasets generated by NVIDIA Nemotron™ and other open models. NVIDIA also collaborated on additional curated datasets to ensure optimal performance.
By making these tools, datasets, and optimization techniques openly accessible, NVIDIA offers enterprises the flexibility to tailor custom reasoning models to their specific needs.
Industry Leaders Leverage AI for Enhanced Enterprise Solutions
AI platform leaders are incorporating NVIDIA’s Llama Nemotron reasoning models into their solutions to enhance business intelligence and automation.
Microsoft is integrating these models into Microsoft Azure AI Foundry, expanding its model catalog and enhancing services like Azure AI Agent Service for Microsoft 365.
SAP is leveraging Llama Nemotron models to enhance its SAP Business AI solutions and Joule, its AI copilot. By utilizing NVIDIA NIM and NVIDIA NeMo™ microservices, SAP aims to improve code completion accuracy for SAP ABAP programming.
“We are collaborating with NVIDIA to integrate Llama Nemotron reasoning models into Joule to enhance our AI agents, making them more intuitive, accurate, and cost-effective,” said Walter Sun, global head of AI at SAP. “These advanced reasoning models will refine and rewrite user queries, enabling our AI to better understand inquiries and deliver smarter, more efficient AI-powered experiences that drive business innovation.”
ServiceNow is leveraging these models to develop AI agents that drive efficiency and productivity across industries.
Accenture has incorporated Llama Nemotron models into its AI Refinery platform, offering industry-specific AI agent solutions that accelerate enterprise transformation.
Deloitte plans to integrate Llama Nemotron models into Zora AI, its recently launched agentic AI platform designed to support human decision-making and business operations with transparent, industry-specific AI agents.
Also Read: Quantexa Unveils Next-Gen AI Decision Intelligence at QuanCon25
NVIDIA AI Enterprise: The Foundation for Agentic AI
To facilitate enterprise adoption of agentic AI, NVIDIA has introduced a suite of AI tools within the NVIDIA AI Enterprise platform:
- NVIDIA AI-Q Blueprint: Enables enterprises to build AI agents with autonomous reasoning, multimodal search, and structured data integration.
- NVIDIA AI Data Platform: Provides a customizable reference infrastructure for AI query agents using the AI-Q Blueprint.
- New NVIDIA NIM Microservices: Optimize inference for complex AI applications, allowing continuous learning and real-time adaptation.
- NVIDIA NeMo Microservices: Support enterprise AI data strategies by enabling agents to learn dynamically from user feedback, enhancing adaptability and performance.
These tools collectively establish a robust foundation for businesses seeking to integrate AI-driven decision-making and automation into their operations.
With the introduction of the Llama Nemotron reasoning models, NVIDIA is empowering organizations to harness the full potential of AI, driving innovation and efficiency in an increasingly complex digital landscape.