Korean-Led Large Language Model Exhibits Increased Vulnerabilities to Malicious Attacks

The AI Safety Research Center at Soongsil University conducted a comprehensive analysis of over 20 major Large Language Models (LLMs), revealing that domestic Korean models are significantly more vulnerable to attacks such as prompt injection and jailbreak. The study, presented at a seminar on AI model security, showed that the safety level of these domestic models is only 82% that of their overseas counterparts. The research involved implementing a range of 57 attack techniques across various models, including SK Telecom AOTX and LG’s Ex-One Series, which were scored anonymously. Notably, Anthropic’s Claude Sonet 4 and OpenAI’s GPT-5 ranked highest in safety with scores of 628 and 626, respectively, while the leading domestic model scored 495. The findings highlighted the need for systematic evaluation and continuous improvement in domestic AI safety, as underscored by Choi Dae-seon, head of the center, emphasizing ongoing efforts to enhance the reliability of local AI technologies.

Source link

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

CoverGo Launches Innovative AI Insurance Agents – itij.com

Stagwell and Emberos Unveil Agentic Tool to Empower Brands in AI Search Navigation

Claude Soars to the Top of the App Store Amid ‘Cancel ChatGPT’ Trend, Yet the US Military Struggles to Move On

Can the Military Safeguard Against Rogue Actions by Claude or OpenAI?

OpenAI Uncovers China’s Major Campaign to Suppress Online Dissent

AI: A Tool, Not a Teammate – Key Insights

One-Bit/OC-Mnemoria: Persistent Shared Memory for OpenCode Agents, Driven by the Mnemoria Rust Engine

AgentEmail: A GitHub Repository by zaddy6

AI Unravels the Secrets of an Ancient Roman Board Game

Opsyhq/Claw: Empowering Your Agent on Every Machine 🦞

Korean-Led Large Language Model Exhibits Increased Vulnerabilities to Malicious Attacks

ElevenLabs and Google Lead the Way in Artificial Analysis’ Latest Speech-to-Text Benchmark Update

oborchers/pydantypes: An Extensive Collection of Unique Pydantic Types

Streamlining Purchases: The Emergence of Agentic Commerce – FTI Consulting

SEBI’s AI Tool Eliminates 120,000 Posts from Unregistered Financial Influencers

Evaluating the Impact of Marketing Tactics on AI Shopping Agents: Experimental Insights from aaronbatchelder/claude-marketing-susceptibility-eval

Local News

AI: A Tool, Not a Teammate – Key Insights

CoverGo Launches Innovative AI Insurance Agents – itij.com

One-Bit/OC-Mnemoria: Persistent Shared Memory for OpenCode Agents, Driven by the Mnemoria Rust Engine

Stagwell and Emberos Unveil Agentic Tool to Empower Brands in AI Search Navigation

AI: A Tool, Not a Teammate – Key Insights

CoverGo Launches Innovative AI Insurance Agents – itij.com

One-Bit/OC-Mnemoria: Persistent Shared Memory for OpenCode Agents, Driven by the Mnemoria Rust Engine