AnalysisAI Models
7 days ago
MorphoQuant: Modality-aware 4-bit quantization for omni-modal LLMs
Proposes MorphoQuant, a quantization method addressing extreme distribution heterogeneity across modalities in 4-bit OLLMs. It outperforms conventional PTQ by handling outlier patterns specific to each modality.
·
7 days ago