当前位置:首页 > 报告详情

为什么可观测性在人工智能应用中更加重要?.pdf

上传人: 竿*** 编号:981794 2025-11-29 45页 3.51MB

1、Why Observability Matters(More!)with AI ApplicationsMonitoring vLLM and Llamastack in Kubernetes,with OpenTelemetry,Prometheus,Tempo,and GrafanaSally OMalleyPrincipal Software EngineerRed Hat“Im Sally OMalley,Principal SWE within Red Hats Emerging Technologies,Office of the CTO.I aim to help custome

2、rs run enterprise AI applications in production,on OpenShift and RHEL,with a focus on observability.”3Where were atObservability for LLMs4Reliable&Transparent:LLMLLMs are moving from research labs into business-critical enterprise applicationsIncrease of LLM use-casesOptimize PerformanceObservabilit

3、y for LLMs5LLMs must run not just efficiently,but reliably and with full transparency into their runtime behaviourReliable&Transparent:LLMLLMs are moving from research labs into business-critical enterprise applicationsIncrease of LLM use-casesOptimize PerformanceComplex PipelinesObservability for L

4、LMs6LLMs must run not just efficiently,but reliably and with full transparency into their runtime behaviourReliable&Transparent:LLMLLMs are moving from research labs into business-critical enterprise applicationsIncrease of LLM use-casesOptimize PerformanceComplex PipelinesEssential to debug multipl

5、e components and phases:retrieval,prompting,generation,distributed inferenceObservability for LLMs7LLMs must run not just efficiently,but reliably and with full transparency into their runtime behaviourReliable&Transparent:LLMLLMs are moving from research labs into business-critical enterprise appli

6、cationsIncrease of LLM use-casesOptimize PerformanceComplex PipelinesEssential to debug multiple components and phases:retrieval,prompting,generation,distributed inferenceTracking GPU usage,token throughput,and inference latency for fast and responsive outputWhy,How,What,Where?Why do LLMs pose uniqu

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据《Why Observability Matters(More!) with AI Applications》的内容,以下是全文关键点的概括: 1. **LLM的重要性**:大型语言模型(LLM)正从研究实验室转向企业级应用,其使用案例增加,需要高效、可靠和透明的运行。 2. **可观测性挑战**:LLM的复杂管道需要全面的可观测性来调试检索、提示、生成和分布式推理等组件。 3. **可观测性解决方案**:使用开源工具如Prometheus、OpenTelemetry、Tempo和Grafana构建可观测性栈。 4. **监控内容**:监控LLM的性能、质量和成本,包括响应时间、准确性、资源利用等。 5. **部署示例**:通过llm-d-deployer部署vLLM,使用vLLM远程后端部署Llama Stack。 6. **资源**:提供MiniKube设置、llm-d、llm-d-deployer、Llama Stack等资源链接。
"LLM可观测性挑战" "开源监控栈部署" "监控vLLM与Llama Stack"
客服
商务合作
小程序
服务号
折叠