AnalysisAI Models
Jun 25, 12:13 AM
Gemma 4 QAT uncensored fine-tunes get MTP speed boost
Community fine-tunes of Gemma 4 (26B-A4B and 31B-QAT) add Multi-Turn Prediction (MTP), boosting inference speed by 35% and 53% respectively. Models are available on Hugging Face.
·
Jun 25, 12:13 AM
