Back to AIBriefs
AnalysisAI Models

Paper presents geometric account of activation steering via angle-norm decomposition

The paper proposes a spherical steering paradigm and analyzes activation steering through angle-norm decomposition, addressing limitations of additive interventions. It offers a geometric framework for understanding and improving steering effectiveness.

·
6 days ago
Paper presents geometric account of activation steering via angle-norm decomposition — AIBriefs