New Prompt Injection Attack: Exploiting OpenAI Account Names to Bypass ChatGPT Security

August 26, 2025

AI researcher @LLMSherpa has revealed a significant vulnerability in OpenAI’s ChatGPT through a prompt insertion attack that exploits users’ account names. Unlike traditional prompt injections that manipulate input at runtime, this method embeds instructions into the internal system prompt using the account name, which the AI prioritizes. When @LLMSherpa modified his name to include specific directives, ChatGPT inadvertently exposed its entire internal system prompt, bypassing content filters. This vulnerability presents significant risks to user privacy and AI safety, as attackers could tailor account names to trigger unintended responses or access confidential information. The discovery emphasizes the need for robust security measures in LLM deployments, including sanitizing metadata and isolating user identifiers from prompt logic. As AI adoption grows, awareness of such vulnerabilities is crucial, making it essential for security teams to enhance threat modeling against unexpected attack surfaces. Follow us for continuous updates on AI security.

Source link

{{post_title}}

New Prompt Injection Attack: Exploiting OpenAI Account Names to Bypass ChatGPT Security

NO COMMENTS

LEAVE A REPLY Cancel reply

Loading…

Here are the results for the search: "{{td_search_query}}"

No results!

{{post_title}}

RELATED ARTICLES

Veza Research Report Uncovers Alarming Surge in Identity Permissions Sprawl Due...

Enhancing Account Reconciliation: Introducing the EPM AI Reconciliation Agent for Streamlined...

“Instagram Unveils ‘Your Algorithm’: A New AI Tool Empowering Users to...

NO COMMENTS

LEAVE A REPLY Cancel reply