AnalysisDevelopers
15 days ago
Strix Halo users: rejected PR boosts MoE prompt processing by up to 30%
A rejected llama.cpp PR by pedapudi can improve prompt processing for MoE models on Strix Halo by up to 30%. The small code change is not in mainline but can be manually applied.
·
15 days ago
