AnalysisAI Models
Jun 16, 10:48 AM
Reddit post warns Qwen/Claude distillations often worse than base model
Post argues that many community distillations of Qwen and Claude models underperform the original base models. Author notes that distilling from a larger teacher can introduce artifacts and reduce quality.
·
Jun 16, 10:48 AM