《使用 NVIDIA Lepton on OCI 释放您的 GPU 性能 [LRN3343].pdf》由会员分享,可在线阅读,更多相关《使用 NVIDIA Lepton on OCI 释放您的 GPU 性能 [LRN3343].pdf(13页珍藏版)》请在三个皮匠报告上搜索。
1、Oracle AI World 2025Jake Bloom(NVIDIA)DGX Cloud Product ManagementTaylor Newill(Oracle)OCI Strategic InitiativesAI Developers ChallengesScaling workloads across multiple regions and clouds is complexChallenging to discover GPUs based on region,cost,and performanceAdministration across multiple Cloud
2、s is hardFragmented Experience for developing,customizing and deploying appsDGX Cloud LeptonConnecting Developers to GPU Compute An AI platform that connects developers with GPU compute across a network of cloud providers Access Global GPU ComputePlatform for Developing,Customizing&Deploying Applica
3、tionsNVIDIA CONFIDENTIAL.DO NOT DISTRIBUTE.InferenceDeveloper ToolsTrainingDesigned for Developers Unified experience to easily access all services Train and Scale Across Clouds With The DGX Cloud Lepton AI PlatformDevelopment Run interactive development sessions,SSH,Jupyter notebooks,VS CodeInferen
4、ce and Endpoints Fast and scalable inference across multiple clusters and regions powered by NVIDIA Cloud Functions(NVCF).Easily create NVIDIA NIM endpoints Training and Fine Tuning Run distributed training or batch processing jobs,with high performance interconnects and accelerated storageGPU-Bare
5、Metal or VM InstanceNetworkingNVIDIA Cloud Partners StorageNVIDIA AI EnterpriseAI&Data Science Development&Deployment ToolsDGX Cloud Lepton AI StackBatch JobsAI Infrastructure ManagementGPU Cloud ProvidersDev PodsEndpointsHealth MonitoringObservabilityResilience Compute Resource Management The DGX C
6、loud Lepton stackEnterprise grade stack to support your AI workloadsBYO ComputeNvidia Managed CapacityOn-Demand*Monitor SystemMonitor overall system metrics(CPU,memory,disk).Monitor MetricsMonitor critical GPU and GPU fabric metrics(power,temperature)Report StatusReports GPU and GPU fabric status(nv