My models don't have reasoning ability in llama-b9543 server but have in llama-cli

Schilling2304@thelemmy.club · 14 hours ago

Yesterday I needed this. Will install this. Thanks.

May I ask: have you noticed if the prompt processing speeds shown in llama-bench are vastly different from llama-server ? I have hundreds of tokens of difference.

Schilling2304@thelemmy.club · 7 days ago

You mean Gemma 4 ? You read in his discord ?

Schilling2304@thelemmy.club · 7 days ago

I confirm the same and it works now. I set it to maximum because fewer reasoning effort tokens cuts it directly.

Thanks

Schilling2304@thelemmy.club · 8 days ago

My models don't have reasoning ability in llama-b9543 server but have in llama-cli

Schilling2304@thelemmy.club · 8 days ago

I know it, seeing it in models titles.

Schilling2304@thelemmy.club · 8 days ago

Is uncensoring oneself a LLM difficult ?

Schilling2304@thelemmy.club · 9 days ago

Did you try any ? Because, I tried iglors and mradermacher, I got refusal to make a pipe bomb. Their answer are funny because they say to study academic engineering instead, lol. Still a refusal. I will try this one.

Schilling2304@thelemmy.club · 10 days ago

The Arch Linux forum make people do that.

Schilling2304@thelemmy.club · 11 days ago

Someone may have the same question in the future and there will be answers. You not responding is not that bad but it is even better that you do and provide an update to your situation, if you wish.

Schilling2304@thelemmy.club · 17 days ago

No. In fact, that is nice. I should try.