You must log in or register to comment.
Not only is the RADV for Llama.cpp past processing faster than the official (former) AMDVLK Vulkan driver but also ROCm.
I assume that they mean “prompt processing”. That was one test for one person, but that seems like what I’d lead with, as it seems pretty significant for some use cases.



