《扩展数据工程管道:为机器学习准备信用卡交易数据.pdf》由会员分享,可在线阅读,更多相关《扩展数据工程管道:为机器学习准备信用卡交易数据.pdf(27页珍藏版)》请在三个皮匠报告上搜索。
1、Scaling Data Engineering PipelinesPreparing Credit Card Transactions for Machine LearningBrandon DeShonLuke GarziaForward-looking StatementThis presentation has been prepared for informational purposes only.The information set forth herein does not purport to be complete or contain all relevant info
2、rmation.Statements contained herein are made as of the date of this presentation unless stated otherwise.This presentation and the accompanying oral commentary may contain forward-looking statements.In some cases,forward-looking statements can be identified by terms such as“may”,“will”,“should”,“exp
3、ects”,“plans”,“anticipates”,“could”,“intends”,“projects”,“believes”,“estimates”,“predicts”,or“continue”,or the negative of these words or other similar terms or expressions that concern Databricks expectations,strategy,plans,or intentions.Forward-looking statements are based on information available
4、 at the time those statements are made and are inherently subject to risks and uncertainties that could cause actual results to differ materially from those expressed in or suggested by the forward-looking statements.Forward-looking statements should not be read as a guarantee of future performance
5、or outcomes.Except as required by law,Databricks does not undertake any obligation to publicly update or revise any forward-looking statement,whether as a result of new information,future developments or otherwise.3Everything works-until you scale.Focus of TalkScaling:Optimize+Elastic ComputeCluster
6、 TuneCode OptimizationDelta LakeFor EachEnabling Petabyte Analytics with Delta LakeFirst a Little StoryPreparing for Croatia tripMessi jersey!Where is my Messi jersey?-My sonOur Problem:oTouch every draweroEvery draw has mix of clothingoEvery piece of clothing needs touched7Our Solution-CubesOrganiz