LanceDB:用于服务生产规模 AI 应用的完整搜索和分析存储.pdf

编号:718706 PDF 21页 963.59KB 下载积分:VIP专享
下载报告请您先登录!

LanceDB:用于服务生产规模 AI 应用的完整搜索和分析存储.pdf

1、Lakehouse Architecture for AI DataSearch,Analytics,Processing,TrainingChang She and Zhidong Qu2025-06-10Chang SheChang SheCEO/CofounderZhidong QuZhidong QuSr Software EngineerWho we areCEO/Co-founder,LanceDBco-author of pandas2 decades building data toolsBuilding LanceDB for all the AI data that doe

2、snt fit neatly into pandas dataframesSr.Software Engineer,DatabricksFounding engineer-Mosaic AI Vector Search and Feature StoreProject Lead-Storage Optimized Mosaic AI Vector SearchChang SheZhidong(Zero)Qu3We Multimodal Lakehouse for AI dataNew data infrastructure challengesLance formatDiverse workl

3、oads:analytics,processing,training4What modern architecture for AI data infrastructure looks likeStorage-optimized Mosaic AI Vector SearchCloud-native Vector Search at Massive Scale using Lakehouse ArchitectureChallenges with first-gen Vector DBsCoupled Storage&ComputeCompute nodes and disk attached

4、 to them act as persistent storage layerStateful systemDifficult to operate at scaleVector indexes are memory residentFull precision embeddings are huge!Extremely high serving costScatter-gather queriesInherited from traditional search architectureVector indexes are coupled with immutable data fragm

5、entsQuery performance drops significantly as number of data segments scaleMerging segments involves expensive operation to rebuild the index6Cloud-native Vector SearchDecoupled Storage&ComputeVector indexes and raw data fragments live in durable cloud object storageQuery nodes are stateless and only

6、 cache data in local SSD/memory when neededDecoupled Ingestion&Serving ComputeIngestion runs on a fully distributed,in-house vector indexing engine built on SparkQuery runs on lightweight Rust serversCloud-native Vector Search ArchitectureUnparallel scalability at far lower cost by leveraging cloud

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(LanceDB:用于服务生产规模 AI 应用的完整搜索和分析存储.pdf)为本站 (Flechazo) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠