1、2024 Databricks Inc.All rights reserved1State State Reader APIReader APICraig Lukasik(Sr.Specialist Solutions Architect)Craig Lukasik(Sr.Specialist Solutions Architect)DatabricksDatabricks2024 Databricks Inc.All rights reserved2024 Databricks Inc.All rights reserved2024 Databricks Inc.All rights res
2、ervedState Reader API State Reader API-DBR 14.3+DBR 14.3+spark.read.formatspark.read.format(state(state-metadata)metadata)and spark.read.formatspark.read.format(statestorestatestore)New capability to access and analyze Structured Streamings internal state data.Aimed at facilitating development,debug
3、ging,and troubleshooting of stateful Structured Streaming workloads.To be included in Apache Spark 4.0.0,expected later this year.2024 Databricks Inc.All rights reserved2024 Databricks Inc.All rights reservedDemoDemo42024 Databricks Inc.All rights reservedDevelopment Challenges AddressedDevelopment
4、Challenges AddressedDebugging complexity&difficulty properly(unit)testingDebugging complexity&difficulty properly(unit)testingExcessive logging for debugging due to challenges in understanding the state store.Slower project progress from difficulties in development.Complexity in handling event time
5、leads to unreliable tests.Bypassing crucial unit tests due to testing challenges.2024 Databricks Inc.All rights reservedProduction Challenges AddressedProduction Challenges AddressedTroubleshooting complexity,slowing down issue resolutionTroubleshooting complexity,slowing down issue resolutionAnalys
6、ts face data inconsistencies and access limitations.Time-consuming coding workarounds needed to resolve urgent issues.2024 Databricks Inc.All rights reservedstatestate-metadatametadataHigh-level APIColumnDescriptionoperatorIdThe integer ID of the stateful streaming operator.opera