《Lakeflow 声明式管道的最佳性能和成本优化.pdf》由会员分享,可在线阅读,更多相关《Lakeflow 声明式管道的最佳性能和成本优化.pdf(59页珍藏版)》请在三个皮匠报告上搜索。
1、Top Performance&Cost OptimizationsFor Lakeflow DLTSteven YuPrincipal Solutions Architect,DatabricksJune 2025Forward-looking StatementThis presentation has been prepared for informational purposes only.The information set forth herein does not purport to be complete or contain all relevant informatio
2、n.Statements contained herein are made as of the date of this presentation unless stated otherwise.This presentation and the accompanying oral commentary may contain forward-looking statements.In some cases,forward-looking statements can be identified by terms such as“may”,“will”,“should”,“expects”,
3、“plans”,“anticipates”,“could”,“intends”,“projects”,“believes”,“estimates”,“predicts”,or“continue”,or the negative of these words or other similar terms or expressions that concern Databricks expectations,strategy,plans,or intentions.Forward-looking statements are based on information available at th
4、e time those statements are made and are inherently subject to risks and uncertainties that could cause actual results to differ materially from those expressed in or suggested by the forward-looking statements.Forward-looking statements should not be read as a guarantee of future performance or out
5、comes.Except as required by law,Databricks does not undertake any obligation to publicly update or revise any forward-looking statement,whether as a result of new information,future developments or otherwise.3I wanna go fast.-Ricky Bobby(Will Ferrell)AgendaKey Factors that Affect PerformanceCost Opt
6、imization TradeoffsWhats built into Lakeflow DLT(including Serverless!)Now is your chance to escapePerformanceKey Factors that AffectBack to BasicsUnderstanding the WorkloadBack to BasicsStep 1:Parallelize the Readspark.sql.files.maxPartitionBytesReading the DataBack to BasicsStep 2:Do Important Thi