Four approximate unlearning methods all pass output-level tests but RULER's M2 detects residuals in 10 of 12 conditions (p<0.05)?

Four approximate unlearning methods all pass output-level tests but RULER's M2 detects residuals in 10 of 12 conditions (p<0.05).

RULER uses both an oracle-comparative metric (M2) and an oracle-free metric (M4) to measure representation-level memorization?

RULER uses both an oracle-comparative metric (M2) and an oracle-free metric (M4) to measure representation-level memorization.

Research & Papers

RULER: New method reveals AI 'unlearning' still leaves hidden data traces

Q: RULER uses both an oracle-comparative metric (M2) and an oracle-free metric (M4) to measure representation-level memorization?

RULER uses both an oracle-comparative metric (M2) and an oracle-free metric (M4) to measure representation-level memorization.

Q: M4 acts as a pre-unlearning diagnostic, revealing identity-level memorization in face recognition models that current methods cannot erase?

M4 acts as a pre-unlearning diagnostic, revealing identity-level memorization in face recognition models that current methods cannot erase.

arXiv cs.AI May 28, 2026

⚡Current unlearning checks miss residual data in model internals—RULER catches them.

Deep Dive

Machine unlearning aims to remove the influence of specific training records from a deployed model without full retraining. Current verification relies on output-level protocols—membership inference, retain accuracy, and forget-set accuracy. But a model can pass all three while still encoding forgotten records in its intermediate representations. Researchers Georgina Cosma and Axel Finke have introduced RULER, a set of representation-level verification metrics that dig into the model's internal activations to detect residual traces.

RULER includes two key metrics: M2, an oracle-comparative metric that measures whether forget-set records occupy the same representational position as in a retrained model, and M4, an oracle-free metric that detects residuals purely from the unlearned model's internal similarity structure. Tests on four approximate unlearning methods showed all passed output-level evaluation, yet M2 detected significant residuals in 10 of 12 conditions (p<0.05), with effect sizes increasing as the forget fraction grew. A fifth method, Bad Teacher, showed the same residuals despite a different forgetting mechanism. M4 also functioned as a pre-unlearning diagnostic across tabular, image, clinical text, and face-identity settings, detecting identity-level memorization in face recognition models where no tested method fully erased the signal. The findings have major implications for privacy compliance and trust in AI systems.

Key Points

Four approximate unlearning methods all pass output-level tests but RULER's M2 detects residuals in 10 of 12 conditions (p<0.05).
RULER uses both an oracle-comparative metric (M2) and an oracle-free metric (M4) to measure representation-level memorization.
M4 acts as a pre-unlearning diagnostic, revealing identity-level memorization in face recognition models that current methods cannot erase.

Why It Matters

Exposes that current AI unlearning is far from reliable, risking privacy violations in regulated industries.

Read Original Article

RULER: New method reveals AI 'unlearning' still leaves hidden data traces

Why It Matters

Related Articles

🚀 Stay Ahead in AI