《Apache Spark™的英文SDK Apache Spark™的英文SDK.pdf》由会员分享,可在线阅读,更多相关《Apache Spark™的英文SDK Apache Spark™的英文SDK.pdf(78页珍藏版)》请在三个皮匠报告上搜索。
1、English SDK for Apache SparkBoosting Development with LLMsGengliang WangAllison WangAmanda LiuAbout UsThe Spark team at Gengliang WangGithub:gengliangwangAllison WangGithub:allisonwang-dbAmanda LiuGithub:asl31_DAIS_Title_SlideWhy Were Excited01Story Behind English SDK03Future Work02With DemoSDK Feat
2、ures1_DAIS_Title_SlideStory Behind English SDK5100+Data Sources1+Billions Annual Downloads100K+Stack Overflow questions40K+Commits3600+GitHub PR Authors208Number of countries and regions downloaded PySpark in 2022Apache Spark:Power&Complexity Apache Spark:A robust analytics engine for large-scale da
3、ta processing.Rich feature set provides great capability,but takes time to masterPrint length:951 pagesLLMs and Apache Spark:A Powerful Synergy LLMs have extensive resources to learn Apache Spark Over 37,000 commits on Github Over 120,000 questions on Stack Overflow LLMs understands Apache SparkLLMs
4、 and Apache Spark:A Powerful Synergy10GitHub Copilot Requires understanding of complex code Limited to editors,not usable in notebooks.Suggestions for Spark development can be inconsistent.The Challenge for Spark DevelopmentGitHub Copilot Requires understanding of complex code Limited to editors,not
5、 usable in notebooks.Suggestions for Spark development can be inconsistent.The Challenge for Spark DevelopmentERROR:A column or function parameter with name dept_id cannot be resolved14LangChainLangChain Facilitates the creation of LLM-powered applications Challenges for Spark development:Returns st
6、rings instead of PySpark objects like DataFrame,less seamless code integrationCan we have a easy-to-use tool which seamlessly integrates with Spark?What if we use English as code?English as codeDesign Elements:Integration Make AI the chauffeur and we take the luxury backseat,instead of AI as the cop