How-ToDevelopers
19 days ago
Dual AMD GPU setup with 48GB VRAM runs llama-cpp server
User configured dual AMD GPUs (R7900 + 7800XT) totaling 48GB VRAM to run a llama-cpp server using Vulkan, bypassing ROCm compatibility issues. The setup demonstrates a practical approach to local LLM inference on mixed RDNA architectures.
·
19 days ago
