LaunchAI Models
26 days ago
ModelBest releases BitCPM-CANN, 1.58-bit ternary LLM trained on Ascend NPUs
BitCPM-CANN achieves 1.58-bit ternary quantization with 6x less VRAM than BF16 while retaining most capability. The model family includes sizes from 2B to 8B and is fully open-source on HuggingFace.
·
26 days ago