AnalysisAI ModelsDevelopers
5 days ago
Community implements NanoQuant binary quantization method
A Reddit user implemented NanoQuant, a flexible binary quantization method supporting 2-bit, 1-bit, and 0.5-bit per weight quantizations for dense transformers. The implementation is available on GitHub.
·
5 days ago
