Precise Weight-Matrix Fingerprints for Identifying Both From-Scratch and Base-Derived Large Language Models

October 12, 2025

A significant challenge in protecting large language models (LLMs) is verifying their origins, crucial for safeguarding intellectual property. Researchers from Shanghai Jiao Tong University have introduced a training-free fingerprinting technique that analyzes weight matrices to determine whether an LLM is newly trained or derived from an existing model. This method effectively addresses challenges posed by common post-training processes like fine-tuning, achieving remarkable robustness with near-zero false positives and perfect scores across all classification metrics. Importantly, the process is rapid, completing within 30 seconds on standard hardware. Additionally, the research highlights vulnerabilities in LLMs, such as the Attacking with Weight Manipulation (AWM) threat, which can exploit subtle input alterations. To enhance LLM security, the authors suggest defenses like input sanitization and adversarial training. This innovative approach sets a solid foundation for reliable model provenance verification, ensuring the integrity of LLMs in the fast-evolving AI landscape.

Source link

{{post_title}}

Precise Weight-Matrix Fingerprints for Identifying Both From-Scratch and Base-Derived Large Language Models

NO COMMENTS

LEAVE A REPLY Cancel reply

Loading…

Here are the results for the search: "{{td_search_query}}"

No results!

{{post_title}}

RELATED ARTICLES

AI Revolutionizes Cybersecurity Access: Empowering Defenders with Advanced Tools

Adobe Unveils Firefly AI Assistant, Featuring Enhanced Generative AI and Creative...

IDC MarketScape: Vendor Assessment of Global AI-Driven Enterprise Asset Management Solutions...

NO COMMENTS

LEAVE A REPLY Cancel reply