How-ToAI Models
Jun 20, 8:23 AM
7900XTX fits Qwen 3.6 27B at 131k context
User fits Qwen 3.6 27B model with Q6K+MTP quantization at 131k context on 7900XTX 24GB VRAM using iGPU trick to free VRAM. Achieves up to ~20 tokens/s with kernel split.
·
Jun 20, 8:23 AM