《Apache Spark™ 流式处理和增量实时表加速毕马威客户实时物联网洞察.pdf》由会员分享,可在线阅读,更多相关《Apache Spark™ 流式处理和增量实时表加速毕马威客户实时物联网洞察.pdf(26页珍藏版)》请在三个皮匠报告上搜索。
1、Spark Streaming and Delta Live tables accelerates KPMG clients for real time IoT insightsMacGregor WinegardAssociate,Data EngineerJune 20232 2023 KPMG LLP,a Delaware limited liability partnership and a member firm of the KPMG global organization of independent member firms affiliated with KPMG Inter
2、national Limited,a private English company limited by guarantee.All rights reserved.NDP460732-4B Databricks Certified Data Engineer with experience orchestrating cloud infrastructure and building data pipelines Passion for writing reliable,efficient and elegant code Integrated S&P Global Data with i
3、nternal Sustainable Finance Disclosure Regulation tool using Delta Sharing Architected Terraform Implementation of Energy Reporting tool in Databricks that feeds over 2 billion rows into a Power BI Dashboard Led Databricks development of market forecasting tool to develop models of firm revenueWith
4、You TodayI understand the importance of the data architecture and support Databricks development effortsMacGregor WinegardAssociate,Data EngineerKPMG3 2023 KPMG LLP,a Delaware limited liability partnership and a member firm of the KPMG global organization of independent member firms affiliated with
5、KPMG International Limited,a private English company limited by guarantee.All rights reserved.NDP460732-4BWhy do you need a modern data platform?Poor data qualityIssues in scalingIncorrect integrationReal-time dataLack of expertiseSecurity and privacyOrganizational resistanceData verificationHeavy e
6、xpensesWide technologies varietyModern layered architecture3Business vision for data value1Commitment to change journey6Industrialized delivery5Enterprise data management2Data governance and literacy4Key success factors for data modernizationDataChallenges4 2023 KPMG LLP,a Delaware limited liability