《GPU 在向量搜索中的应用.pdf》由会员分享,可在线阅读,更多相关《GPU 在向量搜索中的应用.pdf(25页珍藏版)》请在三个皮匠报告上搜索。
1、GPU在向量搜索中的应用金海ZillizZILLI2820207#page#Speaker bio金海ZILLI2R&D DirectorProject LeaderCareer historyEducation2020 Milvus.All righhts reserved#page#Unlock the treasure of unstructured dataAlalgorithms transform image,video,voice,naturallanguageinto vectors,and enable understandingandutilization ofunstru
2、ctured data at scale.mUnstructured dataDeep learning modelsEmbedding vectorsKnowledge,insight,$#page#The flow-based Al applicationsVideoThe most popularwayVoiceExtractExtractFlexibleframcsmodeltagsEasytocomposeweb-based UISample pipelinesImageThe challengeData fragmentationVsual modelVGG.gEmbeddings
3、EmbeddingsAttributesVisualVoice2020 Milvus.All righhts reserved#page#The unstructured data service (UDS) for AlUnstructured Dataimage,video,voice,naturallanguageSearchInsertModel Inference RuntimeInference LayerStoreTensorRT.ONNX RT.TFRT.ctc.InsertSearchMilvusObject URIObjectVectorsAttributesData Se
4、rvice LayerStorageHigh dense +SparsexperimentalVector IDMultimodalScoringon roadmapon roadmapOutputResult Setimage,video,voice,natural language#page#Milvus:The big pictureQueryProcessing EngineBuffer PoolSchedulerANNSCollaborative QuenIndexM-FAISS,Mi-Annoytag/structureddataSDK/ Web APIResultFiestop-
5、K resultReducerScoringMultimodalapp specificSegmentSegmentMetadataquery objSelectioninsert objX86:supporisSSE4.2,AVX2,AVX512GPU:Pareorlaler.CUDA10.0imieroarehlX86ArmrlateNewIndexArm:requirosaarch64IndexFilesFileGPUOtherVarious ProcessorsStorage Tier#page#Vectors are differentNumbersVectorsArithmetic
6、 operationSimilarity(cg.Euclidcan distance)-+d(4B)=a-a)OperatiSimilaritycomparisonNumber comparisonTopK(A)=argmin(d(AB)91-106-101-5123891045672020 Milvus.All rights reserve#page#Faiss-GPUTraining data0.32,0.21.0.510.52,0.11.0.60IVF Flatf.0.91,0.33.0.13CentroidsClustersGPUVector data0.32,0.21.0.510.5