Back to AIBriefs
How-ToAI Models

7900XTX fits Qwen 3.6 27B at 131k context

User fits Qwen 3.6 27B model with Q6K+MTP quantization at 131k context on 7900XTX 24GB VRAM using iGPU trick to free VRAM. Achieves up to ~20 tokens/s with kernel split.

·
Jun 20, 8:23 AM