

1·
7 days agoYou mean Gemma 4 ? You read in his discord ?


You mean Gemma 4 ? You read in his discord ?


I confirm the same and it works now. I set it to maximum because fewer reasoning effort tokens cuts it directly.
Thanks


I know it, seeing it in models titles.


Is uncensoring oneself a LLM difficult ?


Did you try any ? Because, I tried iglors and mradermacher, I got refusal to make a pipe bomb. Their answer are funny because they say to study academic engineering instead, lol. Still a refusal. I will try this one.


The Arch Linux forum make people do that.


Someone may have the same question in the future and there will be answers. You not responding is not that bad but it is even better that you do and provide an update to your situation, if you wish.


No. In fact, that is nice. I should try.
Yesterday I needed this. Will install this. Thanks.
May I ask: have you noticed if the prompt processing speeds shown in llama-bench are vastly different from llama-server ? I have hundreds of tokens of difference.