AnalysisDevelopers
7 hours ago
llama.cpp PR makes Vulkan tensor parallel viable
Pull request by pwilkin aims to make Vulkan tensor parallel (TP) viable in llama.cpp. Commenter notes it is a pass at making TP 'somewhat usable' and looks forward to its evolution.
