Test your llama.cpp '--threads' argument for up to 80% performance gain

How-ToDevelopers

2 days ago

Test your llama.cpp '--threads' argument for up to 80% performance gain

A Reddit user reports up to 80% performance improvement on llama.cpp by optimizing the --threads argument for hybrid CPU architectures. Suggests using only P-cores with taskset/affinity for best results.

PSA: Test your "threads" argument in llama.cpp (+80% performance in my case)2 days agoAXYZE8

··Discuss

2 days ago