NVIDIA Tensorrt Inference Server

Nvidia launches NIM to make it smoother to deploy AI models into production

At its GTC conference, Nvidia today announced Nvidia NIM, a new software platform designed to streamline the deployment of custom and pre-trained AI models into production environments. NIM takes the ...

SDxCentral

Nvidia rack-scale Blackwell systems lead new AI inference benchmark

Nvidia’s rack-scale Blackwell systems topped a new benchmark of AI inference performance, with the tech giant's networking technologies helping to play a key role in the results. The InferenceMAX v1 ...

VentureBeat

Nvidia’s Vera Rubin is months away — Blackwell is getting faster right now

The big news this week from Nvidia, splashed in headlines across all forms of media, was the company's announcement about its Vera Rubin GPU. This week, Nvidia CEO Jensen Huang used his CES keynote to ...

Network World

Nvidia claims 10x cost savings with open-source inference models

Nvidia has released analysis showing a 4X to 10X reduction in cost per token for AI inferencing by switching to open source models. The cost discounts required combining Blackwell hardware with two ...

The Next Platform

The First AI Benchmarks Pitting AMD Against Nvidia

Rated horsepower for a compute engine is an interesting intellectual exercise, but it is where the rubber hits the road that really matters. We finally have the first benchmarks from MLCommons, the ...

BGR

NVIDIA Is Helping Apple Build A Faster And Better AI Experience

Apple and NVIDIA shared details of a collaboration to improve the performance of LLMs with a new text generation technique for AI. Cupertino writes: Accelerating LLM inference is an important ML ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results