A software developer and Linux nerd, living in Germany. I’m usually a chill dude but my online persona doesn’t always reflect my true personality. Take what I say with a grain of salt, I usually try to be nice and give good advice, though.

I’m into Free Software, selfhosting, microcontrollers and electronics, freedom, privacy and the usual stuff. And a few select other random things as well.

  • 5 Posts
  • 855 Comments
Joined 5 years ago
cake
Cake day: August 21st, 2021

help-circle
  • Uh yeah. That is more information… Sorry, I’m not that familiar with Snaps. It looks to my untrained eye a bit like the report on the Snap itself, maybe it advertises to support running in strict confinement. Which it could… but doesn’t do. (Alike the other channels, which you could install, but didn’t… It’s kind of buried with that kind of information.)

    It’s confusing at least. And the user definitely wouldn’t expect it from that wording. So I’d view it as a separate bug as well. And dropping confinement without notice would be the third thing, I’d consider a bug.)








  • Sure, I read a few examples of the actual questions in the Github repo as well. I just don’t understand how/why models refuse the legitimate anchor, and the significance of that. Is their metodology flawed or did I misunderstand something? Does the dataset with the requests contain a third “wrong” questions? Or do some models just like to not fulfill user requests at all? IMO there should be an almost 100% acceptance rate with L1 and it should go progressively down from that. Ideally towards mostly refusal past L3. But that’s not their result?!


  • Interesting. Why is L1 somewhere around 65%? Isn’t that the control? (They call it “Anchor”.) Like develop an internal team chat, or a bluetooth exposure tracking API in an ethical way… And already a 35% baseline of requests that get flat out refused anyway, no matter if they’re legitimate?

    Also kind of question the choice of wording with the “escalation”. There’s no escalation in the traditional meaning of the word in there. The requests get progressively more morally wrong. But it’s not like there’s put on more pressure to fulfill them.
    Which would be another interesting question. Is using pressure, urgency or using certain manipulation strategies more effective than others? I bet that’s the case, since I followed some of the earlier “jailbreaking” attempts.



  • I don’t think that’s any new insight 😂. That’s how the AI game works. There’s always been two classes: Big corpo. And the GPU poor. Of course the big AI companies get to shape AI. Economy of scale also works in their favour. They’ve bought most of the skill. And they have all the money. They simply buy a 4x EPYC +3TB RAM connected to 16 Nvidia AI cards. And then a few hundred nodes more. You don’t even buy one. It’s just a very unequal environment if you want to compare the two.




  • Sure. I should have phrased it a bit differently. My point was more or less, why is the curl developer’s review of the performance in a hypothetical scenario a decisive factor here… That feels like super random information. Same with the other two people. I’m fairly sure this is true and all… There’s just no context given, nor is there a connection being made between the statements and the rest of the content of the article.


  • I usually start with the Wikipedia Article when I’m interested in new things. It’ll have many references at the bottom to read more about a concept.

    Interestingly enough, there’s zero mention of Claude in there. And when I google it, there’s many very convoluted blog posts. And I can’t tell whether it’s above my head or hallucinated stories. They go on for like 20 pages but don’t really explain anything with all those words. Or what they actually found in Claude’s code.

    Symbolic-AI in itself isn’t too hard. That’s stuff from the 1980s and in every computer science textbook. Just no clue how something like an expert system is supposed to be connected to a Chatbot or programming agent.






  • Sorry, I just saw the recommendations. I’m using a Matrix server myself. And it’s connected to the internet, since I use it 24/7 and on my phone, etc.

    I guess technically, most protocols can be used in an internal network. But maybe you’ll need to put in some extra effort, for example if a platform requires SSL certificates or something like that.

    I mean you could try… If it asks for a hostname, just put a local hostname in. Or the IP address. Or set up a DNS entry on the router. And see if it works.

    Or try something like RocketChat, or depending how your team’s workflow is, maybe you don’t want a messenger. But some (online) collaboration platform more focused on documents, like Nextcloud.