Instinct Accelerators - Page 3

AMD continues to cornerstone the technology needed to enable the computational capabilities and possibilities of the world’s most powerful supercomputers across various sectors.

2 0 2,729

We are at a stage in our product ramp where we are consistently identifying new paths to unlock performance with our ROCM software and AMD Instinct MI300 accelerators. We have made a lot of progress since we recorded data in November that we used at our launch event and are delighted to share our latest results highlighting these gains.

These gains show that AMD Instinct MI300X with ROCm 6 continues to show leadership inference performance using the popular FP16 datatype and vLLM inference library compared to Nvidia H100 using TensorRT-LLM and FP16 or FP8 datatypes.

14 0 66.4K

The newest family of AMD accelerators, the AMD Instinct™ MI300 Series, featuring the third-generation Compute DNA (AMD CDNA™ 3) architecture, offer two distinct variants designed to address AI and HPC markets.

0 0 6,637

Most Machine Learning (ML) engineers use single precision (FP32) datatype for developing ML models. TensorFloat32 (TF32) has recently become popular as a drop-in replacement for these FP32 based models. However, there is a pressing need to provide additional performance gains for these models by using faster datatypes (such as BFloat16 (BF16)) without requiring additional code changes.

0 0 5,388

AMD Instinct Accelerators 16
AMD Instinct Accelerators Blog 10
Instinct AI 10
Other 1
ROCm 25
ROCm Blogs 10

Instinct Accelerators - Page 3

Instinct Accelerators - Page 3

AMD Instinct™ Accelerators Expand Reach in HPC and AI Workloads

Competitive performance claims and industry leading Inference performance on AMD Instinct MI300X

Introducing the AMD Instinct™ MI300 Series accelerators, Powering the Growth of AI and HPC at Scale

Application-Transparent Emulation of TensorFloat32^1 using BFloat16 on AMD GPUs

Competitive performance claims and industry leading Inference performance on AMD Instinct MI300X

New ROCm™ 5.6 Release Brings Enhancements and Optimizations for AI and HPC Workloads

Available now: new HIP SDK helps democratize GPU computing