当前位置:首页 > 报告详情

Learning Substructure Invariance for Out-of-Distribution Molecular Representations.pdf

上传人: 2*** 编号:144890 2023-10-28 25页 2.83MB

1、DataFunSummitDataFunSummit#20232023Learning Substructure Invariance for Out-of-Distribution Molecular Representations Presented by Nianzu Yang,PhD candidate SJTU-ReThinkLabFormulation:denotes the support of environments,is the prediction model and represents a loss function.The risk function under a

2、 given environment e:Background-OoD1MoleOODNianzu Yang Out-of-Distribution Generalization:Assume that there is a potential environment variable accounting for the distribution shift between the training and testing data.In general cases the goal is to predict the target label given the associated in

3、put .Background-Invariant Learning2MoleOODNianzu Yang Invariant Learning is an emerging line for solving the OOD generalization problem.These methods propose to find an invariant predictor that could uncover invariant relationships between inputs and targets across all environments.The invariant pre

4、dictor aims to learn an invariant representation satisfying such a invariance principle.Invariance Principle:1)sufficiency:shows sufficient predictive power for the target2)invariance:contributes to equal performance for the downstream tasks across all environmentsA molecular graph can be represente

5、d as ,where is the graphs node set corresponding to atoms constituting the molecule and denotes the graphs edge sets corresponding to chemical bonds.Background-MRL3MoleOODNianzu Yang Molecular Representation Learning(MRL)aims at embedding a molecule into a vector in latent space as a foundation mode

6、l,on top of which the learned representations could be used for a variety of downstream tasks.SMILES-based methodsStructure-based methodsOoD Molecular Represention Learning4MoleOODNianzu Yang OOD General Formulation:OoD on MRL:Motivating Examples5MoleOODNianzu YangKey Observation:the(bio)chemical pr

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
本文介绍了在分子表示学习领域中的两项研究。第一项研究关注于学习具有不变性的分子表示,以应对训练和测试数据之间的分布偏移。研究者提出了一种变分推断方法,通过最小化不同环境中的风险,学习一个不变的预测器。理论定理证明了该方法可以使得分子在所有环境中的下游任务表现平等。实验结果表明,该方法在四个现实世界数据集上取得了显著的性能提升。 第二项研究针对药物组合推荐问题,提出了一种考虑药物分子结构信息的 substructure-aware 方法。研究者引入了一个自适应权重调整策略,以处理药物推荐中的约束优化问题,同时考虑准确性和安全性。实验表明,该方法在 MIMIC-III 数据集上,无论是准确性还是安全性,都显著优于现有方法。
"分子表示如何应对环境变化?" "药物组合推荐如何确保准确性和安全性?" "如何通过不变学习解决分子表示的分布偏移问题?"
客服
商务合作
小程序
服务号
折叠