NVIDIA Releases Pascal GP100 GPU Die Shot at Hot Chips

NVIDIA GP100 GPU is a big, greenish monster chip.

NVIDIA has but released the get-go dice shot of their biggest fleck to date, the Pascal based GP100 GPU. Unveiled back at GTC 2022, the GP100 GPU is powering NVIDIA'due south fastest hyperscale accelerator, the Tesla P100. The chip is the first to apply HBM2 and NVLINK interfaces that evangelize increased bandwidth and interconnect speeds.

NVIDIA GP100 GPU Die Shot Released at Hot Fries - The Biggest FinFET Production To Appointment Featuring HBM2

The GP100 is the big daddy of the Pascal GPU lineup. In fact, this is the merely bit that hasn't fabricated its style to consumers yet. This is considering it is solely dedicated to the HPC market place. NVIDIA is shipping GP100 based Tesla P100 units to the data center market since June 2022. The GPU is specifically designed to handle tasks for HPC environments and comes with a range of features that aren't available to consumers.

Before we get into the technical details of Pascal GP100, let's accept a look at the fabulous shot of the dice which was posted past Anandtech. The chip tin can be seen to be very dense in pattern. The die houses a total of xv.3 billion transistors. The NVLINK interface is situated on the right side of the die while four 1024-bit buses can be plant on the top and bottom. The chip as a whole measures 610mm2, that's without including the HBM2 memory on the same SOC.

The NVIDIA GP100 GPU Technical Details - A Recap of The Big Green Pascal Fleck

Like previous Tesla GPUs, GP100 is composed of an array of Graphics Processing Clusters (GPCs), Streaming Multiprocessors (SMs), and retentiveness controllers. The flake achieves its colossal throughput past providing vi GPCs, up to 60 SMs, and 8 512-bit memory controllers (4096 $.25 total).

The Pascal architecture'southward computational prowess is more only brute force: it increases operation not merely by adding more SMs than previous GPUs, but by making each SM more efficient. Each SM has 64 CUDA cores and iv texture units, for a full of 3840 CUDA cores and 240 texture units. These SM units have been arranged into 32 TPCs comprising of two SMs.

The block diagram of NVIDIA'south GP100 supercomputing chip.

Because of the importance of high-precision computation for technical calculating and HPC codes, a key pattern goal for Tesla P100 is high double-precision performance. Each GP100 SM has 32 FP64 units, providing a 2:ane ratio of single- to double-precision throughput. Compared to the iii:ane ratio in Kepler GK110 GPUs, this allows Tesla P100 to process FP64 workloads more than efficiently.

A close up shot of the NVIDIA Pascal chip forth side the HBM2 package. (Prototype Credits: Anandtech)

The GPU as well packs four stacks of HBM2 memory. The total VRAM featured on this flake is sixteen GB which will exist upgraded to 32 GB one time HBM2 hits volume production in 2022. The chip features 720 GB/due south bandwidth. For more than information and details on GP100 GPU, y'all tin read our article here. You can besides find performance benchmarks of GP100 GPU hither.

GPU Compages	NVIDIA Fermi	NVIDIA Kepler	NVIDIA Maxwell	NVIDIA Pascal
GPU Process	40nm	28nm	28nm	16nm (TSMC FinFET)
Flagship Flake	GF110	GK210	GM200	GP100
GPU Pattern	SM (Streaming Multiprocessor)	SMX (Streaming Multiprocessor)	SMM (Streaming Multiprocessor Maxwell)	SMP (Streaming Multiprocessor Pascal)
Maximum Transistors	three.00 Billion	7.08 Billion	eight.00 Billion	15.3 Billion
Maximum Die Size	520mm2	561mm2	601mm2	610mm2
Stream Processors Per Compute Unit	32 SPs	192 SPs	128 SPs	64 SPs
Maximum CUDA Cores	512 CCs (sixteen CUs)	2880 CCs (xv CUs)	3072 CCs (24 CUs)	3840 CCs (60 CUs)
FP32 Compute	i.33 TFLOPs(Tesla)	5.10 TFLOPs (Tesla)	half-dozen.10 TFLOPs (Tesla)	~12 TFLOPs (Tesla)
FP64 Compute	0.66 TFLOPs (Tesla)	1.43 TFLOPs (Tesla)	0.xx TFLOPs (Tesla)	~6 TFLOPs(Tesla)
Maximum VRAM	1.v GB GDDR5	6 GB GDDR5	12 GB GDDR5	16 / 32 GB HBM2
Maximum Bandwidth	192 GB/southward	336 GB/s	336 GB/southward	720 GB/south - 1 TB/southward
Maximum TDP	244W	250W	250W	300W
Launch Year	2010 (GTX 580)	2014 (GTX Titan Black)	2015 (GTX Titan X)	2016