itb-nz logo
Story image

NVIDIA smashes MLPerf benchmarks for AI

08 Nov 2019

NVIDIA has achieved its fastest results yet for its AI inference workloads in data centers and the edge.

MLPerf Inference 0.5 is the industry’s first independent suite of AI benchmarks for inference. The benchmarks cover a range of form factors and inferencing scenarios for AI operations such as image classification, object detection, and translation.

NVIDIA Turing GPUs for data centers and NVIDIA Xavier system-on-a-chip for edge computing topped all five MLPerf benchmark tests, the company reports.

NVIDIA was the only AI platform company to submit results across all five MLPerf benchmarks.

Turing GPUs reportedly provided the highest performance per processor amongst commercially available entries; while Xavier performed highest amongst commercially available edge and mobile SoCs under both edge-focused scenarios (single-stream and multi-stream).

All of NVIDIA’s MLPerf results were achieved using NVIDIA TensorRT 6, which is a high-performance deep learning inference software that optimizes and deploys AI applications easily in production from the data center to the edge. New TensorRT optimizations are also available as open source in the GitHub repository.

NVIDIA’s general manager and vice president of accelerated computing, Ian Buck, says AI is now at a tipping point as it moves from research to large-scale deployment for real applications.

“AI inference is a tremendous computational challenge. Combining the industry’s most advanced programmable accelerator, the CUDA-X suite of AI algorithms and our deep expertise in AI computing, NVIDIA can help data centers deploy their large and growing body of complex AI models.”

NVIDIA says that GPUs accelerate large-scale inference workloads in the world’s largest cloud infrastructures, including Alibaba Cloud, AWS, Google Cloud Platform, Microsoft Azure and Tencent. AI is now moving to the edge at the point of action and data creation.

NVIDIA also announced Jetson Xavier NX, which is a small and powerful AI supercomputer for robotic and embedded computing devices at the edge.  It joins other solutions in the Jetson family, including the Jetson Nano, Jetson AGX Xavier series, and the Jetson TX2 series.

The Xavier NX is designed to help create embedded edge computing devices that demand increased performance but are constrained by size, weight, power budgets or cost. These include small commercial robots, drones, intelligent high-resolution sensors for factory logistics and production lines, optical inspection, network video recorders, portable medical devices and other industrial IoT systems.

The Jetson Xavier NX module will be available in March from NVIDIA’s distribution channels for companies looking to create high-volume production edge systems.

Story image
IBM, Alphabet and well-funded startups in the race for quantum supremacy
"It may not come as a surprise that quantum computing one day replaces artificial intelligence as the mainstream technology to help industries tackle problems they never would have attempted to solve before.”More
Story image
Video: 10 Minute IT Jams - Who is Globalization Partners?
Today, Techday speaks to Globalization Partners general manager for Asia-Pacific Charles Ferguson, who talks about Employer of Record technology and its strategic advantage for companies looking to expand internationally.More
Story image
National Party announces $1.29 billion tech policy ahead of election
The policy, announced today, pledges to create 100,000 jobs in the industry by 2030 if the party is elected next month.More
Story image
Gartner: Security leaders must balance risk, trust and opportunity
Security and risk leaders must focus on balancing risk, trust and opportunity to help maintain the ability of their organisations to function.More
Story image
Gartner: By 2023, 65% of the world will have personal data covered under modern privacy regulations
“Security and risk management (SRM) leaders need to help their organisation adapt their personal data handling practices without exposing the business to loss."More
Story image
ServiceNow extends Microsoft partnership with new Teams functionality
Powered by ServiceNow’s digital workflow platform, the Now Platform, the new capabilities are also said to improve agent productivity by enabling them to more effectively collaborate and complete key tasks in Microsoft Teams.More