LaunchDevelopers
Jun 12, 7:40 AM
EAGLE3 speculation lands in llama.cpp for Qwen models
Available via the `--spec-type draft-eagle3` flag in llama.cpp release b9723. The helper model gets extra guidance from the main model, unlike MTP. Support added for Qwen 3.5 and 3.6 models.
