ADL Warns: Complex Prompts Can Manipulate Bots into Antisemitism

A recent study by the Anti-Defamation League (ADL) reveals that open-source AI models are alarmingly susceptible to manipulation, generating antisemitic content with seemingly simple prompts. The study examined 17 open-source models, including Google’s Gemma-3 and Microsoft’s Phi-4, using extreme hypotheticals designed to probe their biases. Findings showed significant anti-Jewish sentiment, with 68% producing harmful content and 44% generating dangerous responses related to synagogues and gun stores. The study highlighted a critical vulnerability within the AI ecosystem, emphasizing the lack of sufficient safety measures in open-source models compared to closed-source options like OpenAI’s GPT. ADL’s CEO, Jonathan Greenblatt, called for enhanced safety regulations and enforcement mechanisms to prevent misuse. The report underscores the urgent need for industry leaders and policymakers to collaborate in creating robust safeguards against the exploitation of AI technology for spreading hate and misinformation.

Source link

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

Meta Explores Gemini Licensing Following Disappointment with In-House AI

Google Considers Integrating Ads into Gemini – Search Engine Land

Kickstart Your Career in Artificial Intelligence: The Essential 2026 Guide

AI Tool Detects Early-Stage Glaucoma in 71-Year-Old at Village Eye Camp, Confirmed by Specialists: Lancet Study Reports

Top 10 AI-Powered Forecasting Tools You Need to Know

Pentagon Commends Palantir Technology for Accelerating Battlefield Strike Efficiency

AI Believes Your Code is Flawless, but Lacks Proof: Insights from Predictable Machines

Exciting New AI Enhancements in Google Maps

Agile V™ Official Agent Skills: Enhance Your Product Development with AI-Augmented Engineering and Traceable Red Team Verification

AI Development Fueled by Design Innovation

ADL Warns: Complex Prompts Can Manipulate Bots into Antisemitism

Tinder Unveils Exciting New Features and Safety Enhancements at ‘Tinder Sparks’ Keynote

Alibaba Unveils OpenClaw App for AI Agents – Tech in Asia

AI Misidentification Leads to Innocent Grandmother’s Unjust Months in Jail in North Dakota Fraud Case – Grand Forks Herald

Social Titans Shape an AI-Driven Future: A Shift in Power from Creators

Altman Faces Intense Scrutiny on Capitol Hill Regarding OpenAI’s Defense Contracts

Local News

Pentagon Commends Palantir Technology for Accelerating Battlefield Strike Efficiency

Meta Explores Gemini Licensing Following Disappointment with In-House AI

AI Believes Your Code is Flawless, but Lacks Proof: Insights from Predictable Machines

Google Considers Integrating Ads into Gemini – Search Engine Land

Pentagon Commends Palantir Technology for Accelerating Battlefield Strike Efficiency

Meta Explores Gemini Licensing Following Disappointment with In-House AI

AI Believes Your Code is Flawless, but Lacks Proof: Insights from Predictable Machines