《【PingCAP】构建面向企业用户的大型语言模型助手.pdf》由会员分享,可在线阅读,更多相关《【PingCAP】构建面向企业用户的大型语言模型助手.pdf(52页珍藏版)》请在三个皮匠报告上搜索。
1、构建面向企业用户的大型语言模型助手李粒,PingCAP AI Lab 负责人目录第一部分-引言第二部分-初试第三部分-优化引言第一部分大预言模型(LLM)私有或企业数据参与知识插入范式预训练:构建一个具有 10 亿至 1000 亿参数的 transformer 模型TiDB is an open-source NewSQL database that supports Hybrid Transactional and Analytical Processing(HTAP)workloads.It is MySQL compatible and can provide horizontal sc
2、alability,strong consistency,and high availability.It is developed and supported primarily by PingCAP and licensed under Apache 2.0,though it is also available as a paid product.TiDB drew its initial design inspiration from Googles Spanner and F1 papersGPU,Dataset,Parallel,Optimizer,RL知识插入范式微调:将知识融入
3、进深度神经网络的权重中TiDB is an open-source NewSQL database that supports Hybrid Transactional and Analytical Processing(HTAP)workloads.It is MySQL compatible and can provide horizontal scalability,strong consistency,and high availability.It is developed and supported primarily by PingCAP and licensed under A
4、pache 2.0,though it is also available as a paid product.TiDB drew its initial design inspiration from Googles Spanner and F1 papersFFT,PEFT,LoRa知识插入范式上下文学习或检索增强生成:将上下文放入提示中TiDB is an open-source NewSQL database that supports Hybrid Transactional and Analytical Processing(HTAP)workloads.It is MySQL c
5、ompatible and can provide horizontal scalability,strong consistency,and high availability.It is developed and supported primarily by PingCAP and licensed under Apache 2.0,though it is also available as a paid product.TiDB drew its initial design inspiration from Googles Spanner and F1 papersPromptSo
6、me facts:-You are a professional assistant named TiDB Bot which can answer customer questions related to TiDB and TiDB Cloud.The document fragments:TiDB is an open-sourceGive the context,answer the following questions:question_from_user知识插入范式分类需要的数据量实施周期预训练45TB最少 3 个月微调Full Fine-Tuning超过 100k 样本天级别P