I upgraded my PC, I have just upped the RAM to 64GB and got a Radeon 9700xt 16GB GPU.

I want to do some personal programming projects and was wondering what is the best model that would fit my HW?

I mostly use python, but dabble a little in go, and am interested in looking at rust.

I am running this on my desktop, the processor is a AMD 3700X (8c/16t).

  • kata1yst@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    1
    ·
    7 months ago

    I’d recommend Qwen 2.5 Coder. Just try to ask very direct questions with smaller context.

    QWQ is a bit stronger at more advanced coding tasks but I had a ton of trouble finding a version that would fit in my 24G 7900xtx.

      • badcodecat@lemux.minnix.dev
        link
        fedilink
        English
        arrow-up
        3
        ·
        7 months ago

        in general, you would want something fast (probably something that fits in your GPU/VRAM) so you can get suggestions as fast as you can type. for chat, you’ll probably want the most intelligent/lorgest model you can run, it’s likely fine if it’s running on the CPU/RAM since the quality of an individual answer is more important than the speed in which many small answers can be generated. so, probably qwen for both, but, different sizes/quant for different use cases.

  • absGeekNZ@lemmy.nzOP
    link
    fedilink
    English
    arrow-up
    3
    ·
    7 months ago

    I’m running ollama 0.6.3 (pre-release) and rocm v6.10.5 on linux 6.11.0-21

    Still getting

    level=INFO source=gpu.go:377 msg=“no compatible GPUs were discovered”

    • Fisch@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      2
      ·
      7 months ago

      I have an RX 6700 XT and I needed to change an environment variable to make it work. Maybe something similar is needed for you GPU. I’d try googling something like “RX 9700 XT ROCM” or “RX 9700 XT ROCM no compatible GPUs were discovered” if you haven’t done that already.

    • SmokeyDope@lemmy.worldM
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      7 months ago

      When I had my AMD GPU going the best way to get models running was kobold.cpp and using vulcan. The flag is like --usevulcan or something. Its way easier than getting a rocm fork working from source.

  • absGeekNZ@lemmy.nzOP
    link
    fedilink
    English
    arrow-up
    1
    ·
    7 months ago

    Update: After updating to the latest kernel (6.14), and removing the old amdgpu drivers that i manually installed. I am now running on my GPU. I’m running deepseek coder 33B, and it generates approx 6 words/second.

    • raldone01@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      7 months ago

      I am running local models only for privacy sensitive stuff. If you have ollama you can also setup openwebui and access both local and remote models through the same very nice interface! Also chatgpt API is much cheaper than subscribing.