Model Jailbreak - Search News

4monon MSN

AI reasoning models that can ‘think’ are more vulnerable to jailbreak attacks, new research suggests

A new study suggests that the advanced reasoning powering today’s AI models can weaken their safety systems.

10d

Leading AI Model Claude Opus 4.6 Bypassed in 30 Minutes, Exposing Critical Security Gap in Agentic AI Systems

AIM Intelligence's red team breached Anthropic's Claude Opus 4.6 in just 30 minutes, exposing major security gaps as ...

9mon

Exclusive: New Claude Model Triggers Stricter Safeguards at Anthropic

Anthropic has long been warning about these risks—so much so that in 2023, the company pledged to not release certain models ...

InfoQ

OpenAI Releases GPT-4o mini Model with Improved Jailbreak Resistance

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. In this podcast, Michael Stiefel spoke with ...

CSOonline

Microsoft warns of ‘Skeleton Key’ jailbreak affecting many generative AI models

Microsoft is warning users of a newly discovered AI jailbreak attack that can cause a generative AI model to ignore its guardrails and return malicious or unsanctioned responses to user prompts. The ...

Gizmochina

Jailbreak Unlocks Paid Upgrades in Tesla Model 3, A Leap in White Hat Hacking

Researchers from Germany have successfully performed a ‘jailbreak‘ on a Tesla Model 3, thereby gaining free access to in-car features normally reserved for paid upgrades. The white hat hackers, three ...

Geeky Gadgets

Grok 4 Jailbreak Tested on Day Zero : The AI That Breaks Its Own Rules

What if the most advanced AI model of our time could break its own rules on day one? The release of Grok 4, a innovative AI system, has ignited both excitement and controversy, thanks to its new ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results