

Uh. Blog is down. All I get is an 404 for the link in the Mastodon post.
Edit: Here’s a link that works: https://github.com/eleboucher/towonel
A software developer and Linux nerd, living in Germany. I’m usually a chill dude but my online persona doesn’t always reflect my true personality. Take what I say with a grain of salt, I usually try to be nice and give good advice, though.
I’m into Free Software, selfhosting, microcontrollers and electronics, freedom, privacy and the usual stuff. And a few select other random things as well.


Uh. Blog is down. All I get is an 404 for the link in the Mastodon post.
Edit: Here’s a link that works: https://github.com/eleboucher/towonel


No need. It’s already reported. And known since Dec 2019. 👀


Yeah. And I’d say with the SELinux problems and with what OP wrote, the security model including things like a failure mode to fall open, …silently… There’s more things to be wary of, than what they wrote in those 4 sentences.


It may not be wise to use a Snap without first understanding the reputation/limitations of Snap.
seems the Debian Wiki has pretty much your take on it 😅


If I had to guess, this isn’t a bigger issue because Snap is mostly pushed by Canonical. And in a bit of a weird way (proprietary backend, exclusive apps) so… reception in the rest of the Linux community is …mixed. To put it charitably. It’s probably not that relevant for most people outside of the Ubuntu ecosystem. And probably also not a priority for Canonical or the proprietary software vendors.


Ah, thank you very much for explaining! I missed that. Makes perfect sense.


Sure, I read a few examples of the actual questions in the Github repo as well. I just don’t understand how/why models refuse the legitimate anchor, and the significance of that. Is their metodology flawed or did I misunderstand something? Does the dataset with the requests contain a third “wrong” questions? Or do some models just like to not fulfill user requests at all? IMO there should be an almost 100% acceptance rate with L1 and it should go progressively down from that. Ideally towards mostly refusal past L3. But that’s not their result?!


Interesting. Why is L1 somewhere around 65%? Isn’t that the control? (They call it “Anchor”.) Like develop an internal team chat, or a bluetooth exposure tracking API in an ethical way… And already a 35% baseline of requests that get flat out refused anyway, no matter if they’re legitimate?
Also kind of question the choice of wording with the “escalation”. There’s no escalation in the traditional meaning of the word in there. The requests get progressively more morally wrong. But it’s not like there’s put on more pressure to fulfill them.
Which would be another interesting question. Is using pressure, urgency or using certain manipulation strategies more effective than others? I bet that’s the case, since I followed some of the earlier “jailbreaking” attempts.


Online classified ads, your neighbour/relative/friends attic, the access road towards the recycling center/landfill, a refurbished PC shop… From my experience the world is filled with old hardware that’s plenty good enough, just doesn’t run Windows 1X anymore so it becomes e-waste. You can of course also buy a Mini-PC on Amazon or use your existing computer if it has enough RAM to host a virtual machine.


I don’t think that’s any new insight 😂. That’s how the AI game works. There’s always been two classes: Big corpo. And the GPU poor. Of course the big AI companies get to shape AI. Economy of scale also works in their favour. They’ve bought most of the skill. And they have all the money. They simply buy a 4x EPYC +3TB RAM connected to 16 Nvidia AI cards. And then a few hundred nodes more. You don’t even buy one. It’s just a very unequal environment if you want to compare the two.


Nice write-up. Thanks for also including all the numbers. If I might ask: What is the thermal/throttling behaviour you mention? Is it still within the laptop’s thermal budget? Or does it reach throttling territory when doing inference on a long context window?


… don’t forget about the backups.
And if your major issue is putting things in wrong locations… Maybe learn about some abstraction layers, so next time you’re able to just move it, instead of tearing it down?


Sure. I should have phrased it a bit differently. My point was more or less, why is the curl developer’s review of the performance in a hypothetical scenario a decisive factor here… That feels like super random information. Same with the other two people. I’m fairly sure this is true and all… There’s just no context given, nor is there a connection being made between the statements and the rest of the content of the article.


I usually start with the Wikipedia Article when I’m interested in new things. It’ll have many references at the bottom to read more about a concept.
Interestingly enough, there’s zero mention of Claude in there. And when I google it, there’s many very convoluted blog posts. And I can’t tell whether it’s above my head or hallucinated stories. They go on for like 20 pages but don’t really explain anything with all those words. Or what they actually found in Claude’s code.
Symbolic-AI in itself isn’t too hard. That’s stuff from the 1980s and in every computer science textbook. Just no clue how something like an expert system is supposed to be connected to a Chatbot or programming agent.


Lmao. Just add a big RELEVANCE? I mean why do they cite 3 random people’s opinion on random aspects of the entire concept? It’s supposed to be an encyclopedia, not a blog post…


Thanks for the link! But I’m afraid it doesn’t tell me much. a) FreeBSD isn’t even on the list, so I don’t know the numbers to compare it to. and b) there’s things like survivorship bias. Looking at numbers like this is literally the textbook example of how to do it the wrong way. You have to do statistics the proper way around. For all we know by those numbers, Linux could be the best battle-tested OS in the world. I mean they fixed 3 times as many vulnerabilities as Microsoft did for any of their products?!


Sometimes I wish people would back up their factual claims with numbers and studies.
Also: FreeBSD phone, when??


I don’t think a multi billion parameter LLM counts as proper machine translation… That’d be something like Argos Translate or the models from Mozilla’s Bergamot Project. Seems they’re the ones used in the open source Android App linked by TheLeadenSea.


Sorry, I just saw the recommendations. I’m using a Matrix server myself. And it’s connected to the internet, since I use it 24/7 and on my phone, etc.
I guess technically, most protocols can be used in an internal network. But maybe you’ll need to put in some extra effort, for example if a platform requires SSL certificates or something like that.
I mean you could try… If it asks for a hostname, just put a local hostname in. Or the IP address. Or set up a DNS entry on the router. And see if it works.
Or try something like RocketChat, or depending how your team’s workflow is, maybe you don’t want a messenger. But some (online) collaboration platform more focused on documents, like Nextcloud.
Uh yeah. That is more information… Sorry, I’m not that familiar with Snaps. It looks to my untrained eye a bit like the report on the Snap itself, maybe it advertises to support running in strict confinement. Which it could… but doesn’t do. (Alike the other channels, which you could install, but didn’t… It’s kind of buried with that kind of information.)
It’s confusing at least. And the user definitely wouldn’t expect it from that wording. So I’d view it as a separate bug as well. And dropping confinement without notice would be the third thing, I’d consider a bug.)