Linear probes detect task format, not reasoning mode in LLMs

AnalysisAI Models

8 days ago

Linear probes detect task format, not reasoning mode in LLMs

Probing Qwen3-14B hidden states, the paper finds that linear classifiers detect task format (e.g., math vs. common sense) rather than reasoning mode (e.g., deductive vs. inductive). Benchmark spanning the classical trichotomy shows emergent capabilities are not captured by such probes.

8 days ago