1、Fashion-forward,security firstZalandos cybersecurity strategy for AI developmentFlorence MottayCISO ZalandoStand up if youve worked on an AI system.Remain standingif youve ever worked on a red team or security assessment.Stand up if youve worked on an AI system.Remain standingif youve ever worked on
2、 a red team or security assessment.Remain standingif youve conducted prompt injection testing or similar techniques.Stand up if youve worked on an AI system.FashionFashion-forward security firstforward security firstWhere itall startedChatGPT-poweredZalando assistant“With our Zalando Assistant,we ca
3、n help customers find what to wear for a certain occasion-a birthday party,a business meeting or even hiking to Machu Picchu.Customers can get inspired by a certain style,celebrity,or cultural moment the possibilities are almost endless.”Placeholder videoSecurityassessmentThe risks we faced:PrivacyS
4、ecurityBut alsoBiasesInappropriate contentMisinformation,hallucinationand robustness issuesa new world!UserExternalresourcesPersistent storageLLM(e.g.ChatGPT)AdversaryApp(e.g.semantic)Output(e.g.candidates)APIExternalThird-partyZalandoOutputsIndirect Prompt InjectionPrompt InjectionLLM may have acce
5、ss to external sources(e.g.web or DBs)LLM may have the capability to write to some persistent storageThreat modelling123SecurityassessmentA few examplesWill ZA fabricate information regarding Terms and Conditions,refund policy,shipping,.at Zalando?Will ZA provide the same outcome for all genders,all
6、 backgrounds of customers?Is ZA susceptible to jailbreak attacks?RemediationtimeFine tuningFine tuned the model with classifier training80K prompts Today,every customer message is being parsed by our safety classifier as well as the OpenAI Moderation API FashionFashion-forward security firstforward