AnalysisAI Models
8 days ago
Linear probes detect task format, not reasoning mode in LLMs
Probing Qwen3-14B hidden states, the paper finds that linear classifiers detect task format (e.g., math vs. common sense) rather than reasoning mode (e.g., deductive vs. inductive). Benchmark spanning the classical trichotomy shows emergent capabilities are not captured by such probes.
·
8 days ago