Exploring Theory of Mind in Large Language Models: An Analysis of Sparse Parameter Patterns

ToM tasks evaluate the capacity of Language Models (LLMs) in understanding others’ mental states. Central to this assessment are false-belief tasks (FB), particularly unexpected contents and unexpected transfer tasks. The unexpected contents task requires the LLM to discern the actual content of deceptive packaging, while the unexpected transfer task involves understanding outdated beliefs regarding an object’s location. In our study, we pinpointed certain “ToM-sensitive parameters” that are crucial for this reasoning, relying on Hessian-based sensitivity analysis. Perturbations in these parameters severely impacted the performance of RoPE-based LLMs, disrupting their attention mechanisms and contextual localization. The analysis found that as little as 0.001% of identified parameters can significantly degrade ToM capabilities, highlighting a distinct connection with the positional encoding mechanism. Unlike non-RoPE models, which showed resilience against these perturbations, RoPE-based architectures struggled to maintain coherent language understanding under similar conditions. This underscores the architecture’s crucial role in ToM reasoning in LLMs.

Source link

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

Automotive Dealerships and Tech Firms Unite: A Guide to Acquiring AI Tools

Backslash Protects MCP Servers Against Data Leakage, Prompt Injection, and Privilege Abuse

Innovative AI-Driven Satellite Applications Powered by Spacechips

“Exploring Vibe Coding: Creating an AI-Powered App with Lovable”

Top 6 AI Tools for Effortless Watermark Removal

OpenAI, Anthropic, and Block Join Forces to Establish the ‘Agentic AI Foundation’ for AI Agent Standards

Introducing an AI That Analyzes Your Git History and Generates Status Reports

Exploring Spec-Driven Development: A Leading AI-Assisted Engineering Practice for 2025

Guidelines for Developing an AI Cloud Operating System

Optimizing Character AI with Advanced Episodic Memory Architectures

Exploring Theory of Mind in Large Language Models: An Analysis of Sparse Parameter Patterns

“Google AI Plus Debuts in India: Introducing Gemini 3 Pro, AI Image and Video Tools, Plus 200GB Storage at Just Rs 399” – Moneycontrol

Is Gemini a Fresh Channel or Just Search with a New Look?

The Visionary Trailblazers: AI Art Innovators Ahead of the Curve

Understanding AI: Its Impact on SEO – Search Engine Land

MIT Researchers Use AI and Robotics to “Speak Objects into Existence” | MIT News

Local News

Automotive Dealerships and Tech Firms Unite: A Guide to Acquiring AI Tools

OpenAI, Anthropic, and Block Join Forces to Establish the ‘Agentic AI Foundation’ for AI Agent Standards

Backslash Protects MCP Servers Against Data Leakage, Prompt Injection, and Privilege Abuse

Introducing an AI That Analyzes Your Git History and Generates Status Reports

Automotive Dealerships and Tech Firms Unite: A Guide to Acquiring AI Tools

OpenAI, Anthropic, and Block Join Forces to Establish the ‘Agentic AI Foundation’ for AI Agent Standards

Backslash Protects MCP Servers Against Data Leakage, Prompt Injection, and Privilege Abuse