AnalysisDevelopers
Jun 13, 9:06 PM
llama.cpp PR adds EAGLE3 support for Qwen models
GitHub pull request #24593 introduces EAGLE3 speculative decoding for Qwen models in llama.cpp. It is a small work-in-progress change by community contributor jacek2023.
