Publish the latest llama.cpp? #1951

curvedinf · 2025-02-27T21:09:14Z

Hello, I run an AMD card and there have been very significant ROCm support updates (flash attention, quants, massive speed improvements) since the llama.cpp version currently in llama-cpp-python.

Could you do us a big one and publish a new llama-cpp-python with the latest llama.cpp? It would be much appreciated! Thank you!

ekcrisp · 2025-03-11T06:08:41Z

+1 would love to see an update to the latest llama.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Publish the latest llama.cpp? #1951

Publish the latest llama.cpp? #1951

curvedinf commented Feb 27, 2025

ekcrisp commented Mar 11, 2025

Publish the latest llama.cpp? #1951

Publish the latest llama.cpp? #1951

Comments

curvedinf commented Feb 27, 2025

ekcrisp commented Mar 11, 2025