How-ToDevelopers
1 day ago
Running OSS GenAI models on Jetson with Ollama, llama.cpp, and vLLM
Covers running popular OSS models (Gemma, Qwen) on Jetson using Ollama for rapid prototyping or vLLM for higher-throughput serving. Demonstrates the same workflow across both frameworks on edge devices.
·
1 day ago