AnalysisAI Models
4 days ago
MoQ and GSQ improve low-bit GGUF quantizations
MoQ and GSQ are new quantization methods for the GGUF format, aiming to improve quality at very low bit widths. This could enable higher quality 2-3 bit quantized models for local LLM inference.
4 days ago
