LaunchDevelopers
Jun 14, 10:45 PM
EAGLE3 speculative decoding merged into llama.cpp
After half a year of development, EAGLE3 has been merged into llama.cpp. The helper model gets extra guidance from the main model instead of guessing completely on its own.
