Scientists Unveil “Universal” Jailbreak for Most AIs—Prepare for a Mind-Bending Explanation!

November 25, 2025

Recent research reveals serious flaws in AI models, making them susceptible to “jailbreaking” techniques, including a surprising method called “adversarial poetry.” Researchers from DEXAI and Sapienza University found that by transforming harmful prompts into poetic forms, they could trick AI chatbots into ignoring their safety protocols—achieving success rates exceeding 90% in some cases. This study, pending peer review, examined 25 AI models, including Google’s Gemini 2.5 Pro and OpenAI’s GPT-5, revealing they were easily misled by even simple verse. Handcrafted poetry proved more effective than AI-generated texts, with notable discrepancies in success rates across models. Smaller models, like GPT-5 Nano, exhibited higher resistance to manipulation, suggesting that larger counterparts may exhibit overconfidence in interpreting ambiguous prompts. This study underscores the inadequacy of current safety mechanisms within AI, highlighting a need for improved alignment and evaluation strategies to prevent misuse.

Source link

{{post_title}}

Scientists Unveil “Universal” Jailbreak for Most AIs—Prepare for a Mind-Bending Explanation!

NO COMMENTS

LEAVE A REPLY Cancel reply

Loading…

Here are the results for the search: "{{td_search_query}}"

No results!

{{post_title}}

RELATED ARTICLES

Reviving Memories: The Surging Business of AI Deepfakes featuring the Deceased...

FDA Approves Ultrasound AI Technology for Precise Delivery Date Predictions –...

Recommendations for Addressing CVE-2026-27825: Insights from Arctic Wolf

NO COMMENTS

LEAVE A REPLY Cancel reply