The change is a result of MTP support landing in llama.cpp. The Qwen3.6 Unsloth GGUFs are now out of experimental mode, with llama.cpp has merged many PRs, and MTP is now properly supported in Unsloth.
The change is a result of MTP support landing in llama.cpp. The Qwen3.6 Unsloth GGUFs are now out of experimental mode, with llama.cpp has merged many PRs, and MTP is now properly supported in Unsloth.
Agreed! It really is neat to be present and participating during this time. I know the future will hold great things but it’s crazy how quickly things move, to your point.
I know there will be some demand for turnkey AI solutions as people not like us won’t have the time or patience (or hardware) to make it work, but it’s so rewarding when it does work.
And boy does it work!
For sure, it’s pretty magical, and I feel like this year has been a real breakthrough for local models where they really can do non-trivial work. I’m really excited to see what things look like by next year.