DeepSeek and Qwen AI Models Now Available as Ubuntu Snaps

suoko@feddit.it · 24 days ago

DeepSeek and Qwen AI Models Now Available as Ubuntu Snaps

PlanterTree@discuss.tchncs.de · 23 days ago

Intel and ARM Ampere systems.

Does this mean they optimized for CPU instead of GPU? I doubt they target Intel GPUs tbh, so they really optimized for CPU… interesting!

brucethemoose@lemmy.world · edit-2 22 days ago

All the runtimes except Intel ones are llama.cpp Q4KMs, so the Ampere ones aren’t anything special.

…The Intel ones kinda are though. They actually have runtimes for CPU/GPU, and NPU, and AFAIK the CPU ones may be able to use AMX if you are on a server CPU.

It’s still not great for a lot of reasons, but one could do worse.