Leading AI Companies Collaborate with US and UK Governments on Model Safety Initiatives

OpenAI and Anthropic are collaborating with U.S. and U.K. governments to enhance the safety of their large language models (LLMs) against misuse. This partnership, documented in recent blogs, includes granting access to their models for independent evaluations by researchers at the National Institute of Standards and Technology (NIST) and the U.K. AI Security Institute. The aim is to identify vulnerabilities, including potential attack vectors that could compromise security. OpenAI discovered two significant vulnerabilities that could allow sophisticated hacks but has since worked on reinforcing safeguards in products like GPT-5 and ChatGPT. Similarly, Anthropic has shared its Claude AI for testing and discovered critical vulnerabilities, prompting a complete restructured safeguard architecture. Despite concerns over prioritizing competitiveness over safety, experts affirm that commercial models are becoming more secure. AI safety remains a debated topic, though ongoing collaboration signifies continued commitment to addressing vulnerabilities in these systems.

Source link

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

Critical Chrome Gemini Flaw Allows Malicious Extensions to Monitor Your PC

Google’s Nano Banana 2 Promises to Solve AI Text Rendering Challenges

Expansion of Gemini AI Software on the Horizon

Rokid Glasses Update Introduces Native Integration with Google Gemini AI

FDB Enhances Workflow Automation with Safe Agentic Integrations and Intelligent Solutions Ahead of HIMSS26

Show HN: Aigent – An AI Agent Designed for Continuous Self-Improvement

ProxyBase OpenClaw: Empower Your AI Agent with Unrestricted Internet Access

Kelos: The Kubernetes-Native Framework for Orchestrating Autonomous AI Coding Agents

Can AI Agents Shop for Laptops Online?

Markdown Viewer

Leading AI Companies Collaborate with US and UK Governments on Model Safety Initiatives

nmbrthirteen/podcli: Podcast Clip Generation Server

Google API Keys: Not as Secret as You Think

Simplifying Agent Workflows: AreteDriver/agent-audit – A CLI Tool for Cost Estimation and Anti-Pattern Detection in YAML Configs

ASML Charts the Future of AI Chipmaking Tools Beyond EUV: An Exclusive Report by Reuters – Investing.com

Introducing Simaic: An AI Backend with Memory for a Nostalgic 90s Cursor/Windsurf Experience!

Local News

Critical Chrome Gemini Flaw Allows Malicious Extensions to Monitor Your PC

Show HN: Aigent – An AI Agent Designed for Continuous Self-Improvement

Google’s Nano Banana 2 Promises to Solve AI Text Rendering Challenges

ProxyBase OpenClaw: Empower Your AI Agent with Unrestricted Internet Access

Critical Chrome Gemini Flaw Allows Malicious Extensions to Monitor Your PC

Show HN: Aigent – An AI Agent Designed for Continuous Self-Improvement

Google’s Nano Banana 2 Promises to Solve AI Text Rendering Challenges