Also worth checking out is Jamba for document analysis type stuff specifically, Deepseek’s chat app, Kimi K2 generally (especially for creative writing), Longcat for chat and video generation, and Minimax, who have a “Pro” RAG UI free for awhile.
The only thing I might use Qwen for is coding TBH (or images I guess), but Deepseek and GLM are usually better.
The Chinese devs are kinda killing it atm, especially since most of these models are open weights.
First of all, try Z.ai over Qwen. It’s quite good. GLM 4.6 is way better than anything Qwen IMO.
They both use Open Web UI which you can actually self host (and hit the LLM through API).
All the Chinese chat apps are kinda censored.
…But the underlying models, if you run them locally or access them over API, aren’t as bad. The censoring is through some kind of prefilter.
nice. I need to check it out
Also worth checking out is Jamba for document analysis type stuff specifically, Deepseek’s chat app, Kimi K2 generally (especially for creative writing), Longcat for chat and video generation, and Minimax, who have a “Pro” RAG UI free for awhile.
The only thing I might use Qwen for is coding TBH (or images I guess), but Deepseek and GLM are usually better.
The Chinese devs are kinda killing it atm, especially since most of these models are open weights.