当前位置:首页 > 报告详情

训练一名TAIGR士兵来保护我们的力量.pdf

上传人: 可*** 编号:991884 2025-12-07 18页 2.60MB

1、Training a TAIGR to Protect Our Power2025Andrew BochmanGrid Strategist&Infrastructure DefenderIntroductionNot this jury.The Alignment ProblemSo what became of these?AlignmentAI SafetyGuardrails?Research trends find these reoccurring issues with AI:Lying Blackmailing Hallucinating Sycophancy ErrorsGe

2、nerally speaking not great.No AIs allowed?Reseach shows AI is impacting our brains.Overuse of AI can lead to atrophying of cognitive capabilities.The Impacts of Cognitive Offloading“The findings revealed a significant negative correlation between frequent AI tool usage and critical thinking abilitie

3、s,mediated by increased cognitive offloading.”Gen AIs coming to control centers Is this a good idea?Not According to the ISA:1.GenAI is not permissible for autonomous control in high-consequence control systems.2.It may be acceptable in systems where a human in the command loop supplies the ultimate

4、 decision on an action.3.The indeterministic nature of GenAI models makes the variability of GenAI responses unacceptable for autonomous actions.Introducing TAIGRThe Testbed for AI Grid Risk(TAIGR)will enable a thorough pre-flight check of systems that have the potential to deliver enormous benefits

5、 as well as terrible harms to our most critical of critical infrastructures.TAIGR Cognitive Red Team Prospective Composition*Who are they?Prompt engineer Diversity officer Psychotherapist Domain specialists Supplier rep*Possibly a Priest when exorcism is necessary TAIGR Cognitive Red Team Prospectiv

6、e Composition*What are they looking for?Hallucinations Emergent behaviors Adversarial data poisoning Potential for adversarial misuseTAIGR kickoff workshop participantsAsset OwnersLeveraging GenAI in Grid Systems SuppliersStandardizing AI M

word格式文档无特别注明外均可编辑修改,预览文件经过压缩,下载原文更清晰!
三个皮匠报告文库所有资源均是客户上传分享,仅供网友学习交流,未经上传用户书面授权,请勿作商用。
根据报告的内容,全文主要内容概括如下: - **AI安全挑战**:研究指出AI存在撒谎、勒索、幻觉、拍马屁和错误等问题,过度使用AI可能导致认知能力退化。 - **AI在控制中心的应用**:ISA指出通用AI(GenAI)不适合在具有高后果的控制系统中进行自主控制。 - **TAIGR介绍**:TAIGR(AI电网风险测试平台)旨在对可能带来巨大利益和危害的关键基础设施系统进行彻底的预检。 - **TAIGR认知红队**:由提示工程师、多样性官员、心理治疗师、领域专家、供应商代表等组成,寻找幻觉、新兴行为、对抗性数据中毒和潜在对抗性滥用。 - **TAIGR工作坊**:参与者包括资产所有者、供应商、监管机构、研究人员和运营商,讨论AI增强系统的安全挑战和准备应对失控。 核心数据: - “频繁使用AI工具与批判性思维能力之间存在显著负相关,这种相关性通过增加认知卸载来调节。” - “GenAI模型的不确定性使得GenAI响应的变异性对于自主行动是不可接受的。”
TAIGR揭秘" 安全还是隐患?" "认知红队如何守护电网安全?"
客服
商务合作
小程序
服务号
折叠