An open-source toolkit for debugging AI models of all data typesTechnical noteKevin Eykholt and Taesung Lee08 Sep 2023Adversarial Robustness and PrivacyAI TestingData and AI Security
Did an AI write that? If so, which one? Introducing the new field of AI forensicsExplainerKim Martineau24 Jul 2023Adversarial Robustness and PrivacyAIExplainable AIFoundation ModelsGenerative AITrustworthy AI
Manipulating stock prices with an adversarial tweetResearchKim Martineau13 Jul 2022Adversarial Robustness and PrivacyTrustworthy AI
Securing AI systems with adversarial robustnessDeep DivePin-Yu Chen15 Dec 20218 minute readAdversarial Robustness and PrivacyAIData and AI Security
VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data SynthesisChia-yi HsuJia You Chenet al.2025ICASSP 2025
Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language ModelsXiaomeng XuPin-Yu Chenet al.2025AAAI 2025
Retention Score: Quantifying Jailbreak Risks for Vision Language ModelsZhaitang LiPin-Yu Chenet al.2025AAAI 2025
Is My Data in Your Retrieval Database? Membership Inference Attacks Against Retrieval Augmented GenerationMaya AndersonGuy Amitet al.2025ICISSP 2025
The Inherent Adversarial Robustness of Analog In-Memory ComputingCorey Liam LammieJulian Büchelet al.2025Nature Communications