Recover-LoRA reclaims accuracy in 2-bit LLMs via LoRA and knowledge distillation

AnalysisAI Models

7 days ago

Recover-LoRA reclaims accuracy in 2-bit LLMs via LoRA and knowledge distillation

Paper proposes Recover-LoRA, a method that uses low-rank adaptation and knowledge distillation on synthetic data to recover accuracy in 2-bit quantized language models. It targets severe degradation from aggressive quantization for edge deployment.

7 days ago