New, promising MoE model "Hunyuan" by Tencent

robber@lemmy.ml · 2 months ago

New, promising MoE model "Hunyuan" by Tencent

brucethemoose@lemmy.world · edit-2 2 months ago

Been trying to play with this in ik_llama.cpp, and it’s a temperamental model. It feels deep fried, like it wants to be smart if it would just stop looping or getting its own think template wrong.

It works great in 24GB VRAM though.

New, promising MoE model "Hunyuan" by Tencent

New, promising MoE model "Hunyuan" by Tencent

tencent/Hunyuan-A13B-Instruct · Hugging Face