LaunchDevelopers
23 days ago
llama.cpp with MTP support speeds up local models

Hugging Face
@huggingfaceThe AI community building the future. https://t.co/TpiXQMQ9rZ
NYC and Paris and ๐huggingface.co

Hugging Face
@huggingface
RT @ClementDelangue: llama.cpp with MTP support makes local models fast enough to use as daily drivers ๐ Qwen3.6-27B dense generation belโฆ
ยท
23 days ago