AnalysisAI ModelsDevelopers
16 days ago
OSCAR RotationZoo provides precomputed rotation matrices for INT2 KV-cache quantization
OSCAR RotationZoo offers precomputed K/V rotation matrices for OSCAR INT2 KV-cache quantization, designed to reduce memory footprint during inference. The matrices are available on HuggingFace for use with various LLMs.
