An AI agent is barely as correct, related and well timed as the information that powers it.
Now usually accessible, NVIDIA NeMo microservices are serving to enterprise IT rapidly construct AI teammates that faucet into information flywheels to scale worker productiveness. The microservices present an end-to-end developer platform for creating state-of-the-art agentic AI methods and frequently optimizing them with information flywheels knowledgeable by inference and enterprise information, in addition to consumer preferences.
With an information flywheel, enterprise IT can onboard AI brokers as digital teammates. These brokers can faucet into consumer interactions and information generated throughout AI inference to constantly enhance mannequin efficiency — turning utilization into perception and perception into motion.
Constructing Highly effective Knowledge Flywheels for Agentic AI
With no fixed stream of high-quality inputs — from databases, consumer interactions or real-world alerts — an agent’s understanding can weaken, making responses much less dependable and brokers much less productive.
Sustaining and enhancing the fashions that energy AI brokers in manufacturing requires three forms of information: inference information to assemble insights and adapt to evolving information patterns, up-to-date enterprise information to offer intelligence, and consumer suggestions information to advise if the mannequin and utility are performing as anticipated. NeMo microservices assist builders faucet into these three information varieties.
NeMo microservices pace AI agent growth with end-to-end instruments for curating, customizing, evaluating and guardrailing the fashions that drive their brokers.
NVIDIA NeMo microservices — together with NeMo Customizer, NeMo Evaluator and NeMo Guardrails — can be utilized alongside NeMo Retriever and NeMo Curator to ease enterprises’ experiences constructing, optimizing and scaling AI brokers by way of customized enterprise information flywheels. For instance:
- NeMo Customizer accelerates massive language mannequin fine-tuning, delivering as much as 1.8x increased coaching throughput. This high-performance, scalable microservice makes use of in style post-training methods together with supervised fine-tuning and low-rank adaptation.
- NeMo Evaluator simplifies the analysis of AI fashions and workflows on customized and trade benchmarks with simply 5 utility programming interface (API) calls.
- NeMo Guardrails improves compliance safety by as much as 1.4x with solely half a second of extra latency, serving to organizations implement strong security and safety measures that align with organizational insurance policies and pointers.
With NeMo microservices, builders can construct information flywheels that enhance AI agent accuracy and effectivity. Deployed by way of the NVIDIA AI Enterprise software program platform, NeMo microservices are simple to function and may run on any accelerated computing infrastructure, on premises or within the cloud, with enterprise-grade safety, stability and help.
The microservices have turn out to be usually accessible at a time when enterprises are constructing large-scale multi-agent methods, the place tons of of specialised brokers — with distinct objectives and workflows — collaborate to deal with complicated duties as digital teammates, working alongside staff to help, increase and speed up work throughout features.
This enterprise-wide affect positions AI brokers as a trillion-dollar alternative — with purposes spanning automated fraud detection, shopping assistants, predictive machine upkeep and doc evaluation — and underscores the crucial position information flywheels play in remodeling enterprise information into actionable insights.

Business Pioneers Enhance AI Agent Accuracy With NeMo Microservices
NVIDIA companions and trade pioneers are utilizing NeMo microservices to construct responsive AI agent platforms in order that digital teammates might help get extra carried out.
Working with Arize and Quantiphi, AT&T has constructed a complicated AI-powered agent utilizing NVIDIA NeMo, designed to course of a data base of practically 10,000 paperwork, refreshed weekly. The scalable, high-performance AI agent is fine-tuned for 3 key enterprise priorities: pace, value effectivity and accuracy — all more and more crucial as adoption scales.
AT&T boosted AI agent accuracy by as much as 40% utilizing NeMo Customizer and Evaluator by fine-tuning a Mistral 7B mannequin to assist ship customized providers, stop fraud and optimize community efficiency.
BlackRock is working with NeMo microservices for agentic AI capabilities in its Aladdin tech platform, which unifies the funding administration course of by way of a typical information language.
Teaming with Galileo, Cisco’s Outshift group is utilizing NVIDIA NeMo microservices to energy a coding assistant that delivers 40% fewer software choice errors and achieves as much as 10x quicker response instances.
Nasdaq is accelerating its Nasdaq Gen AI Platform with NeMo Retriever microservices and NVIDIA NIM microservices. NeMo Retriever enhanced the platform’s search capabilities, resulting in as much as 30% improved accuracy and response instances, along with value financial savings.
Broad Mannequin and Companion Ecosystem Help for NeMo Microservices
NeMo microservices help a broad vary of in style open fashions, together with Llama, the Microsoft Phi household of small language fashions, Google Gemma, Mistral and Llama Nemotron Extremely, at present the highest open mannequin on scientific reasoning, coding and complicated math benchmarks.
Meta has tapped NVIDIA NeMo microservices by way of new connectors for Meta Llamastack. Customers can entry the identical capabilities — together with Customizer, Evaluator and Guardrails — by way of APIs, enabling them to run the total suite of agent-building workflows inside their atmosphere.
“With Llamastack integration, agent builders can implement information flywheels powered by NeMo microservices,” stated Raghotham Murthy, software program engineer, GenAI, at Meta. “This permits them to constantly optimize fashions to enhance accuracy, enhance effectivity and scale back complete value of possession.”
Main AI software program suppliers comparable to Cloudera, Datadog, Dataiku, DataRobot, DataStax, SuperAnnotate, Weights & Biases and extra have built-in NeMo microservices into their platforms. Builders can use NeMo microservices in in style AI frameworks together with CrewAI, Haystack by deepset, LangChain, LlamaIndex and Llamastack.
Enterprises can construct information flywheels with NeMo Retriever microservices utilizing NVIDIA AI Knowledge Platform choices from NVIDIA-Licensed Storage companions together with DDN, Dell Applied sciences, Hewlett Packard Enterprise, Hitachi Vantara, IBM, NetApp, Nutanix, Pure Storage, VAST Knowledge and WEKA.
Main enterprise platforms together with Amdocs, Cadence, Cohesity, SAP, ServiceNow and Synopsys are utilizing NeMo Retriever microservices of their AI agent options.
Enterprises can run AI brokers on NVIDIA-accelerated infrastructure, networking and software program from main system suppliers together with Cisco, Dell, Hewlett Packard Enterprise and Lenovo.
Consulting giants together with Accenture, Deloitte and EY are constructing AI agent platforms for enterprises utilizing NeMo microservices.
Builders can obtain NeMo microservices from the NVIDIA NGC catalog. The microservices might be deployed as a part of NVIDIA AI Enterprise with extended-life software program branches for API stability, proactive safety remediation and enterprise-grade help.