AnalysisAI Models
8 days ago
Study finds cross-model activation transfer fails in Pythia multi-hop setting
A negative result: direct activation transfer between Pythia language models does not succeed in a multi-hop reasoning setting. The study examines whether one model can pass intermediate signals to another via hidden layer activations, finding no evidence of effective transfer.
·
8 days ago