Friday, January 16, 2026

Researchers Develop Advanced Memory Probe for AI Models • The Register

Researchers from Carnegie Mellon University and others have introduced “RECAP,” a groundbreaking tool to extract memorized content from large language models (LLMs), addressing concerns about copyright infringement and AI model transparency. In their preprint paper, they highlight how traditional extraction techniques, like Prefix-Probing, have become less reliable due to model alignment that often prevents disclosure of memorized content. RECAP enhances this process with an iterative feedback loop, enabling LLMs to refine their responses after each attempt without including verbatim text, thus maintaining compliance with copyright regulations. The research emphasizes the need for clarity regarding AI training data, especially since many companies do not disclose their datasets. With their innovative approach, RECAP has demonstrated a 78% improvement over previous methods, successfully extracting thousands of passages, including from popular works like “Harry Potter.” This advancement is crucial for compliance discussions in the rapidly evolving AI landscape.

Source link

Share

Read more

Local News