AMAX AI Factory Solutions Accelerated by NVIDIA Vera Rubin Platforms

AMAX Engineering delivers turnkey AI factory systems based on NVIDIA DGX platforms using the Vera Rubin architecture. These systems are engineered for predictable performance, controlled scale-out growth, and operational consistency under sustained production workloads.

NVIDIA DGX™ Vera Rubin NVL72

The foundation for gigascale AI training and inference. NVIDIA DGX™ Vera Rubin NVL72 is a turnkey AI infrastructure platform designed to provide leading edge performance with significantly greater energy efficiency. It offers industry-leading performance per watt and tokens per watt across any AI workload, enabling enterprises to scale intelligence while improving the economics of token generation.

NVIDIA DGX™ Vera Rubin NVL72 Specifications

GPU and CPU 72x NVIDIA Rubin GPUs, 36x NVIDIA Vera CPUs
Total Fast Memory 75 TB
Performance NVFP4 Inference: 3,600 PFLOPS
NVFP4 Training: 2,520 PFLOPS*
FP8/FP6 Training: 1,260 PFLOPS*
Networking > 144x OSFP single-port NVIDIA® ConnectX®-9 VPI with 800 Gb/s NVIDIA InfiniBand and Ethernet
> 18x dual-port NVIDIA BlueField®-4 VPI with 400 Gb/s NVIDIA InfiniBand and Ethernet
NVIDIA NVLink™ Switch System 9x L1 NVIDIA NVLink Switches
Management Network Host baseboard management controller (BMC) with RJ45
Software NVIDIA Mission Control, NVIDIA AI Enterprise, NVIDIA DGX OS
Enterprise Support Three years of enterprise business-standard support for hardware and software

Specifications subject to change.
*Dense specification.

Nvidia Vera Rubin NVL72

NVIDIA DGX™ Rubin NVL8

Supercharged performance for agentic AI.

NVIDIA DGX™ Rubin NVL8 serves as a proven foundation for success in the era of agentic AI. Built on the NVIDIA Rubin architecture, DGX Rubin NVL8 is a turnkey AI infrastructure solution purpose-built to accelerate any AI workload and deliver intelligence at scale.

DGX Rubin NVL8

NVIDIA DGX™ Rubin NVL8 Specifications

GPU 8x NVIDIA Rubin GPUs
Total GPU Memory | Bandwidth 2.3 TB | 160 TB/s
Performance NVFP4 Inference: 400 PFLOPS
NVFP4 Training: 280 PFLOPS*
FP8/FP6 Training: 140 PFLOPS*
CPU 2x Intel® Xeon® 6776P processors
NVIDIA NVLink Switch System 4x
NVIDIA NVLink Bandwidth 28.8 TB/s total bandwidth
System Power Usage ~24 kW
Networking 8x OSFP ports serving 8x single-port NVIDIA® ConnectX®-9 VPI
• Up to 800 Gb/s NVIDIA InfiniBand and Ethernet

2x 400G QSP112 NVIDIA BlueField®-4 DPUs
• Up to 800 Gb/s NVIDIA InfiniBand and Ethernet
Software NVIDIA DGX OS, Ubuntu, Red Hat Enterprise Linux, Rocky

Specifications subject to change.
*Dense specification.

NVIDIA Vera Rubin NVL72

Building the next frontier of AI.

NVIDIA Vera Rubin NVL72 unifies leading-edge technologies from NVIDIA - 72 Rubin GPUs, 36 Vera CPUs, ConnectX®-9 SuperNIC™s, and BlueField®-4 DPUs. It scales up intelligence in a rack-scale platform with the NVIDIA NVLink™ 6 switch and scales out with NVIDIA Quantum-X800 InfiniBand and Spectrum-X™ Ethernet to power the AI industrial revolution at scale.

Vera Rubin NVL72 delivers AI training with one-fourth the GPUs and AI inference at one-tenth the cost per million tokens versus NVIDIA Blackwell.

NVIDIA Vera Rubin NVL72 Specifications

NVIDIA Vera Rubin NVL72 Specs
Technical specifications comparison
Specification NVIDIA Vera Rubin NVL72 NVIDIA Vera Rubin Superchip NVIDIA Rubin GPU
Configuration 72 NVIDIA Rubin GPUs | 36 NVIDIA Vera CPUs 2 NVIDIA Rubin GPUs | 1 NVIDIA Vera CPU 1 NVIDIA Rubin GPU
NVFP4 Inference 3,600 PFLOPS 100 PFLOPS 50 PFLOPS
NVFP4 Training² 2,520 PFLOPS 70 PFLOPS 35 PFLOPS
FP8/FP6 Training² 1,260 PFLOPS 35 PFLOPS 17.5 PFLOPS
INT8² 18 POPS 0.5 POPS 0.25 POPS
FP16/BF16² 288 PFLOPS 8 PFLOPS 4 PFLOPS
TF32² 144 PFLOPS 4 PFLOPS 2 PFLOPS
FP32 9,360 TFLOPS 260 TFLOPS 130 TFLOPS
FP64 2,400 TFLOPS 67 TFLOPS 33 TFLOPS
FP32 SGEMM³ 28,800 TFLOPS 800 TFLOPS 400 TFLOPS
FP64 DGEMM³ 14,400 TFLOPS 400 TFLOPS 200 TFLOPS
GPU Memory | Bandwidth 20.7 TB HBM4 | 1,580 TB/s 576 GB HBM4 | 44 TB/s 288 GB HBM4 | 22 TB/s
NVLink Bandwidth 260 TB/s 7.2 TB/s 3.6 TB/s
NVLink-C2C Bandwidth 65 TB/s 1.8 TB/s -
CPU Core Count 3,168 custom NVIDIA Olympus cores (Arm compatible) 88 custom NVIDIA Olympus cores (Arm compatible) -
CPU Memory 54 TB LPD DR5X 1.5 TB LPD DR5X -
Total NVIDIA + HBM4 Chips 1,296 30 12

1. Preliminary information. All values are up to and subject to change.
2. Dense specification.
3. Peak performance using Tensor Core-based emulation algorithms.

NVIDIA HGX Rubin NVL8

Supercharging AI and high-performance computing for every data center.
The NVIDIA HGX™ Rubin NVL8 integrates eight NVIDIA Rubin GPUs with sixth-generation high-speed NVLink interconnects to propel the data center into a new era of accelerated computing and generative AI.

NVIDIA Rubin NVL8 Specifications

HGX Rubin NVL8*
Form Factor 8x NVIDIA Rubin SXM
NVFP4 Inference 400 PFLOPS
NVFP4 Training 280 PFLOPS
FP8/FP6 Training 140 PFLOPS
INT8 Tensor Core 2 PFLOPS
FP16/BF16 Tensor Core 32 PFLOPS
TF32 Tensor Core 16 PFLOPS
FP32 1040 TFLOPS
FP64 / FP64 Tensor Core 264 TFLOPS
FP32 SGEMM | FP64 DGEMM 3200 TFLOPS | 1600 TFLOPS
Total Memory 2.3 TB
NVIDIA NVLink Sixth generation
NVIDIA NVLink Switch NVLink 6 Switch
NVLink GPU-to-GPU Bandwidth 3.6 TB/s
Total NVLink Switch Bandwidth 28.8 TB/s
Networking Bandwidth 1.6 TB/s

1. Preliminary information. All values are up to and subject to change.
2. Dense specification.
3. Peak performance using Tensor Core-based emulation algorithms.

Ready to build your next-generation AI Factory?

Talk to AMAX about deploying NVIDIA Vera Rubin Solutions