AnalysisAI Models
1 day ago
KV quantization quality praised in local LLM use
A Reddit user on r/LocalLLaMA reports that KV cache quantization at Q4_0 retains surprising quality, even for the drafter. Screenshots illustrate the retained coherence.
·
1 day ago
