AnalysisAI ModelsDevelopers
19 days ago
Custom Qwen-27B quant targets 16GB VRAM GPUs
Reddit user shares IQ4_KS quants for Qwen-27B, optimized for NVIDIA cards with 16GB VRAM. Uses ikawrakow's KS quantization not yet in main llama.cpp.
·
19 days ago
Reddit user shares IQ4_KS quants for Qwen-27B, optimized for NVIDIA cards with 16GB VRAM. Uses ikawrakow's KS quantization not yet in main llama.cpp.