EU and US could reach trade deal this weekend - Reuters
SAN JOSE, Calif. - NVIDIA (NVDA), the semiconductor giant with a market capitalization of $2.83 trillion and a perfect Piotroski Score of 9 according to InvestingPro, has introduced NVIDIA Dynamo, a new open-source software designed to enhance the efficiency and scalability of AI reasoning models in AI factories. Announced today, this platform is poised to help service providers grow and increase revenue by optimizing AI inference requests across extensive GPU networks.
NVIDIA Dynamo, which succeeds the NVIDIA Triton Inference Server™, focuses on maximizing the utilization of GPU resources. It achieves this by orchestrating inference communication across thousands of GPUs and employing disaggregated serving to independently optimize processing and generation phases of large language models (LLMs) on different GPUs.
Jensen Huang, NVIDIA’s CEO, emphasized the importance of industries training AI models to think and learn in various ways. With the company achieving an impressive 114.2% revenue growth and maintaining a robust 75% gross profit margin in the last twelve months, he stated that NVIDIA Dynamo serves these models at scale, driving cost savings and enhancing efficiencies across AI factories.
The software’s intelligent inference optimizations are reported to double the performance and revenue of AI factories using the same number of GPUs on the NVIDIA Hopper™ platform. For instance, running the DeepSeek-R1 model on a cluster of GB200 NVL72 racks, NVIDIA Dynamo increased the number of tokens generated per GPU by over 30 times.
NVIDIA Dynamo’s features include dynamic GPU allocation in response to changing request volumes, the ability to route queries to specific GPUs to minimize response computations, and offloading inference data to more affordable memory and storage devices. These features collectively aim to increase throughput and reduce costs.
The open-source nature of NVIDIA Dynamo supports various frameworks, including PyTorch and NVIDIA TensorRT™-LLM, facilitating the development and optimization of serving AI models across disaggregated inference. Companies like AWS, Cohere, CoreWeave, Dell, and Google Cloud are expected to accelerate their AI inference adoption with NVIDIA Dynamo.
Denis Yarats, CTO of Perplexity AI, expressed anticipation for Dynamo’s distributed serving capabilities to boost inference-serving efficiencies. Similarly, Cohere’s SVP of Engineering, Saurabh Baji, anticipates that NVIDIA Dynamo will enhance their enterprise customer experience.
NVIDIA Dynamo’s innovations include a GPU Planner for dynamic GPU management, a Smart Router for efficient request distribution, a low-latency communication library for rapid GPU-to-GPU data transfer, and a Memory Manager for cost-effective data offloading.
The software will be available in NVIDIA NIM™ microservices and is set to be supported by the NVIDIA AI Enterprise software platform in a future release. This announcement was made during the NVIDIA GTC keynote, and the software’s capabilities are further detailed in a related blog and sessions at the conference, which continues through March 21.
This information is based on a press release statement from NVIDIA. The company’s strong financial position and growth trajectory have caught analysts’ attention, with 25 analysts recently revising their earnings estimates upward. Investors seeking deeper insights into NVIDIA’s financial health and growth prospects can access comprehensive analysis through InvestingPro, which offers exclusive access to over 30 additional key metrics and expert insights not covered in this article.
In other recent news, NVIDIA has announced collaborations with major telecom companies, including T-Mobile, MITRE, Cisco, ODC, and Booz Allen Hamilton, to advance AI-native 6G network infrastructure. This partnership aims to enhance connectivity for various devices by integrating AI into next-generation wireless networks, focusing on improved spectral efficiency and performance. Additionally, Truist Securities has maintained a Buy rating on NVIDIA with a price target of $205, expressing confidence in the company’s prospects ahead of its GTC event. The firm highlighted NVIDIA’s potential to boost investor confidence by demonstrating medium-term visibility into customer spending commitments.
Similarly, UBS has reaffirmed its Buy rating on NVIDIA, setting a price target of $185. The firm noted adjustments in NVIDIA’s product mix due to changes in TSMC’s expansion plans but maintained its projection for GPU shipments. UBS has set revenue estimates for NVIDIA’s first fiscal quarter at approximately $46 billion, with EPS projections for 2025 and 2026 at $5.27 and $6.22, respectively. Meanwhile, expectations for NVIDIA’s GTC 2025 conference suggest a focus on AI servers, with potential announcements about the B300 AI chip and data center networking solutions. Investors are keenly anticipating NVIDIA’s AI conference, hoping for insights that could drive a new wave of optimism and momentum for the company.
This article was generated with the support of AI and reviewed by an editor. For more information see our T&C.