《Best Practices of running vLLM on Xeon-Tony Wu 李江.pdf》由会员分享,可在线阅读,更多相关《Best Practices of running vLLM on Xeon-Tony Wu 李江.pdf(32页珍藏版)》请在三个皮匠报告上搜索。
1、Driving vLLM Inference Performance on Intel XeonPresenters:Tony Wu,Jiang LiIntel ChinaApril 2024LEGAL DISCLAIMERS3Intel technologies features and benefits depend on system configuration and may require enabled hardware,software or service activation.Performance varies depending on system configurati
2、on.No computer system can be absolutely secure.Check with your system manufacturer or retailer or learn more at .For more complete information about performance and benchmark results,visit and workloads used in performance tests may have been optimized for performance only on Intel microprocessors.P
3、erformance tests,such as SYSmark and MobileMark,are measured using specific computer systems,components,software,operations and functions.Any change to any of those factors may cause the results to vary.You should consult other information and performance tests to assist you in fully evaluating your
4、 contemplated purchases,including the performance of that product when combined with other products.For more complete information visit document performance of components on a particular test,in specific systems.Differences in hardware,software,or configuration will affect actual performance.Consult
5、 other sources of information to evaluate performance as you consider your purchase.For more complete information about performance and benchmark results,visit Cost reduction scenarios described are intended as examples of how a given Intel-based product,in the specified circumstances and configurat
6、ions,may affect future costs and provide cost savings.Circumstances will vary.Intel does not guarantee any costs or cost reduction.Results have been estimated or simulated using internal Intel analysis or architecture simulation or modeling,and provided to you for informational purposes.Any differen