AnalysisAI Models
7 days ago
Recover-LoRA reclaims accuracy in 2-bit LLMs via LoRA and knowledge distillation
Paper proposes Recover-LoRA, a method that uses low-rank adaptation and knowledge distillation on synthetic data to recover accuracy in 2-bit quantized language models. It targets severe degradation from aggressive quantization for edge deployment.
·
7 days ago