Any frontend/model that runs on a 9070XT

CheeseNoodle@lemmy.world · edit-2 2 months ago

Any frontend/model that runs on a 9070XT

Fisch@discuss.tchncs.de · 2 months ago

I have the same GPU and I use koboldcpp with Vulkan as the backend. Works perfectly fine. I have a 12B model and it’s extremely fast. I could probably even fit a bigger model into the VRAM. Using tabbyAPI for EXL2 models didn’t work for me, it always generated gibberish (I tried 2 different models). For context, I’m on Linux, so maybe that’s not an issue on other operating systems.