1、Technical journey to develop an exploit the advantages of an all-photonic switch for AI/ML cluster designs.Re-Configurable Photonic Interconnects for AI and MLWilliam Koss,CEO of Drut TechnologiesRe-Configurable Photonic Interconnects for AI and MLSPECIAL FOCUS:OPTICSOPTICSA scalable photonic fabric
2、 for GPU-to-GPU connectivity under software control is an alterative architecture to legacy electrical packet switched networks that never have enough buffers,bandwidth and accelerators to overcome congestion controlsAll photonic fabrics provide significant advantages in operating costs,power consum
3、ption,latency,resource efficiency,network hop reduction and multi-vendor opennessBy using a system level architecture designed to exploit an all-optical fabric,a far better overall TCO for accelerator era can be realized by providing dynamic reconfigurability,finer fidelity of resource upgrades and
4、workload resource slicingWHY TRANSITION TO A PHOTONIC FABRICDynamic set of wires that can be changed through software-ReconfigurabilityData rate/format agnostic Extensible across multiple generations long useful lifeLow latency/low power per portLarge,extensible radix-flat network fabric,fewer layer
5、sSlower switching time,but offset by software control and fabric topology slicing150-175W optical switches versus 1500-1700W electrical packet switchesSupports multiple generations of future fabric speedsDeterministic traffic patterns Users fit the fabric around the workload,rather than the workload
6、 trying to fit around the fabricPhotonic Fabric versus Electrical Packet SwitchedDYNAMIC PHOTONIC INTERCONNECTS NEEDEDAI(Tensorflow)such as Large Language Model(LLM)creation need to move large amounts of data between GPUsGPU to CPU memory swap due to GPU memory page sizesML models such as BBN,Isolat