《AMD InstinctTM MI300X生成式人工智能加速器和平台架构.pdf》由会员分享,可在线阅读,更多相关《AMD InstinctTM MI300X生成式人工智能加速器和平台架构.pdf(22页珍藏版)》请在三个皮匠报告上搜索。
1、AMD Instinct MI300X Generative AI Accelerator and Platform ArchitectureAlan Smith,Sr.Fellow,Instinct Lead SoC ArchitectVamsi Alla,Fellow,Instinct Chief EngineerHot Chips 20242|AMD INSTINCT MI300X GENERATIVE AI ACCELERATOR AND PLATFORM ARCHITECTURE|AUGUST 2024Agenda AMD Instinct MI300X Accelerator Ov
2、erview AMD CDNA 3 Architecture Memory System Overview Spatial Partitioning 4th Gen Infinity Architecture System Architecture AMD Instinct MI300X Platform Application Performance3|AMD INSTINCT MI300X GENERATIVE AI ACCELERATOR AND PLATFORM ARCHITECTURE|AUGUST 2024Multiple generations of architecture f
3、ocused advancing HPC&AI computeMI100AMD CDNA ECOSYSTEM GROWTHFirst purpose-built GPU architecture to accelerate FP64 and FP32 HPC workloadsMI200AMD CDNA 2DRIVING HPC AND AITO A NEW FRONTIERDenser compute architecture with leading memory capacity/bandwidthMI300AMD CDNA3 20202023MI300A MI300XDATA CENT
4、ER APU&DISCRETE GPUFocused improvements on unified memory,AI data format performance,and in-node networkingThe AMD Instinct Accelerator Journey4|AMD INSTINCT MI300X GENERATIVE AI ACCELERATOR AND PLATFORM ARCHITECTURE|AUGUST 2024AMD Instinct MI300X Multi-chiplet Accelerator153 Billion Transistors in
5、TSMC 5nm|6nm FinFET4th Generation Infinity Fabric 896 GB/sInfinity Fabric AP6 TB/s bisectionInfinity Fabric Advanced Package(AP)4.8 TB/s bisectionINFINITY FABRIC LINKPCI EXPRESSTM GEN5 LINKINFINITY FABRIC LINKINFINITY FABRIC LINKINFINITY FABRIC LINKINFINITY FABRIC LINKINFINITY FABRIC LINKINFINITY FA
6、BRIC LINKXCDSHADERENGINESHADERENGINESHADERENGINESHADERENGINEL2XCDL2SHADERENGINESHADERENGINESHADERENGINESHADERENGINEL2L2XCDSHADERENGINESHADERENGINESHADERENGINESHADERENGINEL2XCDL2SHADERENGINESHADERENGINESHADERENGINESHADERENGINEL2L2XCDSHADERENGINESHADERENGINESHADERENGINESHADERENGINEL2XCDL2SHADERENGINES