Research

This page should redirect you to the arXiv paper.

Research

Understanding Memorization via Loss Curvature

November 6, 2025

Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering

November 1, 2025

Deploying Interpretability to Production with Rakuten: SAE Probes for PII Detection

October 28, 2025
Ekdeep Singh Lubana
,
Can Rager
,
Sai Sumedh R. Hindupur
,
Fundamental Research
Link post