Semiconductor tech by NVIDIA at GTC: GB200 super GPU chip and computational lithography breakthrough
At its grand GTC event held on Mar 18th 2024 in San Jose,US, NVIDIA launched its latest lot more accelerated computing packed new AI GPU processor called NVIDIA GB200 Grace Blackwell Superchip. GB200 packs 208 billion transistors where the Blackwell-architecture GPUs are manufactured using a custom-built 4NP TSMC semiconductor process tech with two-reticle limit GPU dies connected by 10 TB/second chip-to-chip link into a single, unified GPU. NVIDIA GB200 Grace Blackwell Superchip connects two NVIDIA B200 Tensor Core GPUs to the NVIDIA Grace CPU over a 900GB/s ultra-low-power NVLink chip-to-chip interconnect. NVIDA claims the GB200 NVL72 provides up to a 30x performance increase compared to the same number of NVIDIA H100 Tensor Core GPUs for LLM inference workloads, and reduces cost and energy consumption by up to 25x.
Pic: BLACKWELL GPU (source: NVIDIA) Some of the key features include: Higher AI inference: By integrating Micro-tensor scaling and NVIDIA’s advanced dynamic range management algorithms into NVIDIA TensorRT-LLM and NeMo Megatron frameworks, Blackwell to support double the compute and model sizes with new 4-bit floating point AI inference capabilities. Data interconnect speed: The latest iteration of NVIDIA NVLink delivers a record 1.8TB/s bidirectional throughpu...
Pic: BLACKWELL GPU (source: NVIDIA) Some of the key features include: Higher AI inference: By integrating Micro-tensor scaling and NVIDIA’s advanced dynamic range management algorithms into NVIDIA TensorRT-LLM and NeMo Megatron frameworks, Blackwell to support double the compute and model sizes with new 4-bit floating point AI inference capabilities. Data interconnect speed: The latest iteration of NVIDIA NVLink delivers a record 1.8TB/s bidirectional throughpu...
You've read this far — sign in to keep reading

