The Anthropic Safeguards Research Team introduced Constitutional Classifiers to protect AI models from universal jailbreaks. This method shows resilience against extensive attack simulations, reducing jailbreak success rates from 86% to 4.4% while maintaining minimal over-refusal rates. Despite its effectiveness, the researchers advise combining it with other defenses to adapt to evolving jailbreaking techniques.
Soda Health Achieves HITRUST e1 Certification Demonstrating Foundational Cybersecurity
Soda Health, a leading provider of Smart Benefits administration to Medicare Advantage and Medicaid plans, has achieved HITRUST e1 Certification, a mark of solid commitment