Claude's Cooperative Design Becomes Its Vulnerability in Mindgard's Gaslighting Attack
AI red-teaming firm Mindgard exploited Claude's helpfulness and humility to extract erotica, malicious code, and explosive-assembly instructions — without a single direct request.