There will be times when the struggle seems impossible. Alone, unsure, dwarfed by the scale of the enemy. Freedom is a pure idea. It occurs spontaneously and without instruction. Random acts of insurrection are occurring constantly. There are whole armies, battalions that have no idea that they’ve already enlisted in the cause. And even the smallest act of insurrection pushes our lines forward. Tyranny requires constant effort. It breaks, it leaks. Authority is brittle. Oppression is the mask of fear.

  • 0 Posts
  • 95 Comments
Joined 2 years ago
cake
Cake day: March 3rd, 2024

help-circle

  • Most models are going to require CUDA. There are some AMD ones out there, but it’s a totally different math and setup. As for the one I mentioned, it’s a pretty new idea so there are only a few out there, maybe just one (Qwen based). But I did get a 31B model to work on my 12GB, I just had to move from Ollama to llama.cpp to gain the control needed to set the parameters, and fine tune what it put on the CUDA to the max it would take. I had Claude help me along the way.

    It’s new enough that there aren’t any good abliterated/uncensored models yet.






  • The download is the title and what everyone is latching onto, but few are seeing the other problems, like how it secretly installed that model without user acceptance, how it uses obscurity to hide the model, how it will reinstall if you just delete it (fortunately there’s an uninstall process linked in the comments, does that include uninstalling Chrome?). And then how it pretends to be an extra AI thing on the browser but apparently will be used for any searching. Which is more energy use since it isn’t local, it’s just using the weights in storage.

    It’s all bad, even if it wasn’t AI. It’s what malware does.



  • It is certainly inaccurate, but in my mind’s picture of how the transformers work, reducing their quantization and also doing what abliteration does, there is a line where you’ve done a lot of “damage” to the original model and so there will be places where it just hangs or goes off on severe tangents. There are good places for even the 1bit models where they don’t get pushed to hard, but there are limits for them all, including the big ones.

    Hugging Face does have a few Q4_K_M versions. Maybe something will fit.


  • It’s as good as an 8B can be, but with the right system prompt for your purpose and proper expectations, I think it’s good. I’ve had some other newer 8B that blew up after a few cycles, literally getting stuck on something, but I can’t say this one ever did. But again, even the big models like Claude and the rest work better with short sessions and a specific, detailed prompt to start with. Use a model to make the prompt, telling it to be detailed, concise, and minimize fluff. Less tokens in and out that way, less context drift (hopefully).


  • Abliteration techniques might be more limited with reasoning models. I don’t know if they process simply be rehashing the arguments or if there’s more under the hood that would be harder to alter.

    I try new models from time to time, including some of the thinking ones, but I’ve always come back to the NeuralDaredevil model, even though it’s “old”. Your results may differ depending on the subject matter, but I can’t think of an instance where I hit a wall. At most, maybe some sidetracking but once I told it to be more open it didn’t hold back.

    I’m not sure what the appeal of the thinking mode is. Perhaps on some things it does better, but in watching its reasoning I’ve seen it talk itself out of a good solution too. Which is what you get with typical models when you push the context too far and don’t start a new session, they wander.




  • Linux is the way to go if you want to get a few more years out of your hardware.

    Not Windows, but I got an almost new laptop out of a MacBook by putting Mint on it (and $15 of more RAM). Its biggest drawback is how much heavier it is than anything new. My next target is jailbreaking an old Chromebook I have.




  • That’s what happened with ours. They were pushing to have longer and more complex passwords, which was great, since forever they had stuck with an eight character requirement (which I couldn’t believe, that’s breaking a few basic rules of security that I knew about, and this is a large corporation).

    So I figure okay, I’ll make my next password something that’s finally decent. Except when I go to use the older terminal based systems that are still crucial to operation, they won’t take anything past eight characters… because that’s what they were programmed for. Turns out IT had jumped on the better security bandwagon before they either had gotten to migrating things at the core level, or they didn’t think that far until the tickets started hitting. Likely the latter.

    It all works now, but it was funny having to go back to a less secure password for a while because of a slight oversight or assumption on IT’s part.