cognitive cybersecurity intelligence

News and Analysis

New LLM jailbreak uses models’ evaluation skills against them

Researchers from Palo Alto Networks have discovered a method of exploiting large language models (LLMs) to generate harmful content, including malware or harassment, calling it the “Bad Likert Judge”. It succeeded with an attack rate of 71.6% across six models, a significant improvement compared to single-turn attacks. The method works by encouraging the model to score prompts based on the amount of harmful content and then generate examples. Measures to counter the exploit include applying content filters to evaluate input and output.

Source: www.scworld.com –

Subscribe to newsletter

Subscribe to HEAL Security Dispatch for the latest healthcare cybersecurity news and analysis.

More Posts

UK’s Legal Aid Agency Experiences Cyberattack

UK’s Legal Aid Agency Experiences Cyberattack

A UK Ministry of Justice executive agency was targeted in a cyberattack, compromising its systems. The incident highlights vulnerabilities in cybersecurity, prompting a review of

HIMSSCast: Help with the labor shortage and more can come from tech dealmaking

HIMSSCast: Help with the labor shortage and more can come from tech dealmaking

Berkeley Research Group (BRG) launched its “2025 U.S. Healthcare & Life Sciences Transactions Outlook,” revealing that providers are considering deal activities to tackle AI, cybersecurity,

How telemedicine bridges the rural maternity care gap

How telemedicine bridges the rural maternity care gap

The content centers on “Enterprise Taxonomy” related to telehealth, focusing on patient access for rural and underserved communities, as well as population and public health.

LockBit ransomware group falls victim to hackers itself

LockBit ransomware group falls victim to hackers itself

A data leak has disclosed information about negotiations with victims, Bitcoin wallet addresses, affiliate accounts, and details of attacks.