AnalysisAI Models
6 days ago
Paper presents geometric account of activation steering via angle-norm decomposition
The paper proposes a spherical steering paradigm and analyzes activation steering through angle-norm decomposition, addressing limitations of additive interventions. It offers a geometric framework for understanding and improving steering effectiveness.
·
6 days ago