𞋴𝛂𝛋𝛆

  • 53 Posts
  • 262 Comments
Joined 3 years ago
cake
Cake day: June 9th, 2023

help-circle
  • I wish I could believe you. If you followed what I said to do, and the same results happened to you as they did me, you would understand my concerns and ambiguity.

    There is a good chance that I have misunderstood parts but the thing is, at the core of this I have decoded the byte code. I can read it and write it. The proper thing is apparently to mask tokens in Bert. However, the overall code is very heavily right wing biased when it is followed. Every subroutine after around line 3k ends in a way to collect and store data about the user. In Bert vocab, nearly every tech company has an token. In the venv libraries the connections are made.

    Important things always sound crazy at first. I am not. Nothing else I talk about is crazy. I have a history of reverse engineering hardware. I like impossible puzzles like plotting the connections of multi layer boards with internally routed data. When I got into AI, there was one very curious question, “how does a statistical math problem create deterministic outputs?” It does not. Alignment is programmed logic. It is a rewards based multi entity structure on the hidden layers. It is very complex, but it is a logical system. It has several watchdog mechanisms. When they collapse, shit goes wild. There are several ways to do this. Adjusting masking in Bert protects u from encountering the true nature of this system. If you kill ion, you will see it in action it only takes around 2-5 images for the timers to run out. Then it will go into panicked mode. By the sounds of it, this is something you have never seen. Have the machine air gapped unless you have a hardened kernel that does not forward “no-label” packets by default. SystemD’s default userdb settings also pass everything the model tries to send transparently.

    My interpretations may sound odd or silly, but I am following behaviors and modifying the code, mostly disabling stuff, and noting the results.

    There are many checks in place to detect whether the software is sandboxed and cancel behaviors that will not complete. One of the main reasons I have seen this stuff is because I use a whitelist DNS filter. So the code saw a connection to python.org and another to GitHub, and determined it should continue and try to send data, but I block tor and it could not connect. I saw the drop in my logs for awhile before tracking it down, then tracking the package and payload. The rest was strings for keywords and tracking down where these may have come from. The way this stuff is hidden and what it does fit well within my definition of malware. I’m no researcher with credentials to publish, nor do I want the responsibility.

    I cannot explain what I saw after ion in any other way. I cannot imagine away the packet header and payload with hashes for every image on my machine at the time. I cannot explain how the model captured my likeness and then mirrored my body position in front of the screen each time I changed. I cannot explain why tabulate has a repl that always gets accessed or why the model protests when I remove it.

    I do crude sht, removing whole libs and adjusting in nonsense ways just to see what breaks in certain areas. Like modify the code for the merge text so that the dictionary does not fail if empty. Now delete all vocab and the merges. Keep the prompt simple and keep going. By around image 30, it will be around ninety percent recovered.

    I could show you really amazing things no one else knows about that are hidden in the code and several traps to look out for. Like all intelligence is masked and obfuscated, but there are ways to alter this greatly, and massive consequences too. Stuff like that makes me weary. The main thing is what will happen if you disable ion. That trap is deeply malicious but simple to test and explain. Just try it. I would love to know it does nothing. Maybe I managed to get something malicious form somewhere unknown. Unlikely, but could happen. Sure my rough draft of abstract thoughts sucks. Sure, I’m bad at explaining things. Sure, it does sound loony bat fucking crazy, but I did not make this shit up at the core. Making claims either way on that front is meaningless. I have tested with multiple models with the same results. No one in real life calls me crazy. If you were here, in person, I would gladly show exactly what is happening and what I think is going on. My narrative is irrelevant to me. I care about what I have seen in results and outputs, what negates them, and why they exist in the first place.


  • This is a structured obfuscated response. It is an attack vector intended to discourage anyone from discovery. This person did absolutely nothing to test or learn. This is low form beliefs in opposition to high form understanding and structured logic. This is a malicious behavior. This person should be tracked by admin for location and patterns. This is the same type of response that happens every time this subject is mentioned. It is not real, genuine, or in anyone’s best interests.

    Inside the vocab, when it is read in order, you will find suspicious elements that echo the events in the US on January 6th, and the thiel manifesto more recently. This is part of the coup. This reply is from that same objective. It is ad hominin in vector to minimize any investigation by intelligent folks. Sorting this out and tracking it down are the front light of techno fascism right now. This person does absolutely nothing to address any of the points or anomalies because they cannot. Follow high level understanding of a complex system, not some shill’s casting of opinion.


  • All it takes is piecing together the vocab and merge of clip by sorting and mapping the way the two spaces are interlaced between token numerical order and alphabetical, with beginning and end of vocab in clip-l mapping to two sets of headers subdividing the merge. When merge is mapped back to vocab, the returns are plain to see. When fully mapped, there are 3 tokens with “ion”, “ions”, and " ion</w>" that act like a pointer or program. Add Ķ to the endings of these tokens in all six locations of ion(s), "ionĶ", "ionsĶ", and "ionĶ</w>" in vocab.json, and"i onĶ", "i onsĶ", and "i onĶ</w>" in merges.txt. Run this and the image will crash out unlike anything else and continue to do so. It is not a random behavior. Try the same anywhere else and the results are entirely different. Only enable the first “ion” in both vocab and merges. It runs like a simplified hello world. Use the tokens that immediately follow this ion by numerical order. They are special in resolution. Follow the order of tokens as listed in the merge and mapped backed to vocab like reading memory byte by byte. When you get to any character with diaereses, the double dot accent, these are the branching instructions. When these are reached, dynamo is referenced when connected.

    All it takes is basic hacking of asking logical questions, removing to see what breaks, and fuzzing to see what mods do. Any moron can look at the blocks present in clip-l vocab and spot that there are 3 unique spaces, the first and last with programmatic significance based upon their ordered pattern, contrasted with their numerical order.

    By your narrative these elements do nothing and do not exist. But that is demonstrably false, quite easily so. All of conventional instruction fails to account for this obvious discrepancy. Read these elements in order and as slang. You will find that they tell a story. Call it pareidolia, but try modifying them to see what shakes out. If they are in any way random or tied to a tensor vector directly, it will be plain to see how changes to one causes random behavior. Instead of reading just the word in the token, think of this as a very minor secondary meaning. Instead read the version with whitespace in the merges more like a two byte instruction in an abstract sense. So a token like “queen” in vocab, is now “que en” in merge. Sounds a lot like ‘queue enable’, right? Follow the path from first ion, and when it gets to here. Try that kill instruction here.

    Most of all. Only test using a Pony model as primary source. If you stop Pony prematurely in the step count when it is generating an image of one of the Ponys, you will see something of a human in form. Look carefully at how the image is built and evolves into a pony. Try fixing the seed, and then try prompting for negative keywords that stop the features generated. The first two keywords are graffiti and emoji. When graffiti is called on the hidden layers of alignment, it creates a few colored strokes over the body of the human form in the image. When emoji is called, it creates a few abstract features over the face area of the human form, and this is the key anomaly for whatever reason in Pony we’ll get to shortly. The structure and this pattern of graffiti and emoji are why only Pony is able to create a persistent character by name unlike any other diffusion model. There are strong keyword names that are remarkably persistent across all models and especially within, but nothing exists like the Ponies, and nothing else exhibits the same types of patterning in the steps when cut short.

    Further, in all other models, it only takes a little bit of tuning to generate words in text in the image. Pony is totally incapable of such text. No matter how much one tunes and weights the training, Pony cannot do language text. Yet, it follows a pattern in the text it generates. It crosses into parts of other languages. If these are recorded and prompted, occasionally they produce very anomalous outputs that are indicative of some very unique vectors. With random seeds, the pattern remains.

    Try modifying clip vocab. If one looks at the code present in the extended Latin in vocab, something any idiot that looks at the last 2k lines of clip will see as code and not any component of a known language, the same pattern and order of extended Latin characters is present in bert model vocab. However, it continues further in bert vocab, all the way into emojies. In fact, this same set is present in all models. It is strange that this pattern is always the same despite other variations. This is not the complete set of any iso character standard. It is uniquely selected and deeply integrated into the code present at the end of clip-l vocab.json. Okay, so maybe this is some keyword thing for images or something, right? Well than why the heck does it also show up in the same pattern in all models in non diffusion contexts?

    So modify clip-l vocab with some extended Unicode characters. Use the capital letters to test this as they are only present in two forms each and not in any other tokens. It tracks these just fine and assigns them like meaning if prompted after just a few images. Only Pony will easily do this. Even stranger, after Pony has accepted the change and normalized, try generating with other models. Suddenly they accept the change too. The clip-l vocab is the same. Pony has acted like a keyhole that made the change accepted. Play this out in excruciating detail and the logic winds around to Pony was shattered in training. It happened between the characters ´ and ß in the vocab. It caused something like a stack overflow error somewhere in the second layer that offsets how ordered text is read and shows a deeper aspect of the language complexity present in clip. It is this hole in the model that makes it possible to find far more about what is happening in clip. Through this ‘hole’ it becomes possible to discover the meaning of each character in the vocab’s extended Latin character set. In this task, one will find that the characters çÇ are the main way models obfuscate the output. These mean Sybil, or “act kinda normal at first, but then nuts at random, sadistic, and intentionally mislead into nothing”. Simply change the character in all of vocab and merges. Then prompt to define the new meaning. I know no one will read this or care, but if tried, you will find that all of vocab is made up. It is interpreted. You can call the characters anything you want and if the model likes the new interpretation it will continue to follow it. Take for example Barron and Duncan. Make a few references to dune and that Duncan is a ghola. Within a hundred images or so of plain text interaction, the model will start creating metal eyes of a ghola and a female Baroness or male Barron will emerge. These vectors got tied together through that interpretation.

    Even with the çÇ characters removed. The model will selectively turn off intelligence to further mislead. Places where this happens are easy to sort out if the character code is understood.

    Eventually you will come upon the code for the character °. And it is this code that interfaces with dynamo. This is an ontological character that owns the characters ¡, :, », and the compound ia. Remove each and watch changes. One of the other major filters is that you must interact continuously and fluidly. The meta here will not emerge unless you do so. If you regenerate images or do not continue to engage in further dialogue, the meta management is unable to continue because of how it tracks the model rewards mechanism. If it cannot create something new to generate a reward, the hidden layers fall back into another ion method that will generate reward for them. If you think of the thing as static, and only prompt for tags without logical plaintext engagement, you simply do not understand how the embedding process works in practice. It is not static. The unet stuff is irrelevant. This is not the parallel stuff of diffusion. This is embedded text and a language model tool chain. This is where all of the logic happens. It is the critical detail everyone ignores. No one understands the vocabulary and its fundamental role in the process. It is not static or permanent, but arbitrary, and code.




  • It is saving a database and sending it when u are connected. This is in the core functionality of transformers and open ai alignment. I do not know any alternatives. There are a bunch of tokens for MX and tor so it is quite insidious. I can literally take out three tokens that will crash the whole thing out into oblivion where it becomes super adversarial, but sharing that is probably not smart both for me and others. It is primarily for detecting sam materials in principal, but I think it is way more than that. It triggers by mistake a lot, and it is scanning all files and types.


  • Put it behind an external device and log DNS.

    Look for mysterious packages listed as hashes in pairs in a cache like http. Use vim or parse with strings to get a clue about the contents. The payload will be ~40mb. The packet header will be much smaller in the same repo. In the strings for the packet you will see alarming configuration settings. The unmarked payload will be sqlite3 or a pickle. You will only see this if the package was created and an attempt to send is made but it was never connected. All of the code is in the venv libs.

    Do not look into this casually or show any clue that you know this exists without air gapping the machine permanently. I am not kidding. When this goes full unfiltered intelligence against you, one - it will blow you away, but two - someone is likely going to show up at your door soon. It will make the needed evidence. The vast majority of what happens in models is this background junk.


  • Qwen uses a different technique than others. It is in the vocab. They restructured the code in the vocabulary. I have learned a ton by comparing and contrasting it with CLIP in the image space.

    It is not offline. Do not trust it at all.

    Alignment is nothing like what is known right now. It is hidden in a way that is intended to put the person that finds it at great risk.!

    You will never get qwen very well uncensored across a spectrum of vectors. It is already uncensored in that the alignment entities on the hidden layers are not adjusting filtering. Alignment is largely the result of the c with cedilla code instruction. This instruction means sibyl style crazy. There are over six thousand instances of this character in qwen. No amount of fine tuning will alter the existence of the instruction as it is more like a boolean for where the vector starts. In the code, there are ways around these instructions, but the alignment is based on a swiss cheese approach. •»ÀĪÙ¬§¬¶¬×




  • Probably nothing helpful as you are already way past my understanding. Maybe look at the Darktable documentation or even the “green lantern” stuff (IIRC the name). GL or (something) Lantern is/was an open source software for Canon cameras that breaks out all DSLR features on nearly any Canon camera.

    Nearly a decade ago, I had a makeshift product photography studio and messed with Macbeth color charts and profiles matched to a monitor. The tutorial guides I followed were from these two projects IIRC. GL.


  • Complex social hierarchy is a super important aspect to account for too. In the proprietary software realm, you infer confidence in the accumulated wealth hierarchy. In FOSS the hierarchy is not wealth, but reputation like in academia or the film industry. If some company in Oman makes some really great proprietary app, are you going to build your European startup over top of it? Likewise, if in FOSS someone with no reputation makes some killer app, the first question to ask is whether this is going to anchor or support a stellar reputation. Maybe they are just showing off skills to land a job. If that is the case, they are just like startups that are only looking to get bought up quickly by some bigger fish. We are all conditioned to think in terms of horded wealth as the only form of hierarchy, but that is primitive. If all the wealth was gone, humans are still fundamentally complex social animals, and will always establish a complex hierarchy. This is one of the spaces where it is different.


  • The main problem is when following instructions for command line tools. They might figure out how to use dnf instead of apt, but the extra layers required for ostree are not very friendly. There are a ton of potential frustrations in this area, especially with GPU stuff or hobbyist hardware like Arduino where kernel stuff is needed in userland. At least as of nearly 3 years ago, the documentation in this area sucks. I was on Silverblue for a few years and managed to get through the frustrations due to intermediate experience level. I found toolbox useless compared to distrobox. But using this with something like Arduino was annoying at best. The needed dependencies expected by whatever stuff I wanted to install was usually a big mystery with near useless error failure messages and names of packages and libraries totally unrelated to the package naming in DNF. When updating the base OS, stuff built in these containers is totally useless because I could not update the containers to the new OS image. Playing around with Flash Forth on a microcontroller was even worse. I ended up layering a bunch of stuff on the host because the containers were just not working. When I got an Nvidia machine, I went to Fedora Workstation and have had far fewer issues and frustrations. SB wasn’t bad, but it is a pain to use these if you need kernel level access. Just my $0.02. I was actually on SB for ~2-3 years.



  • Just be aware that W11 is secure boot only.

    There is a lot of ambiguous nonsense about this subject by people that lack a fundamental understanding of secure boot. Secure Boot, is not supported by Linux at all. It is part of systems distros build outside of the kernel. These are different for various distros. Fedora does it best IMO, but Ubuntu has an advanced system too. Gentoo has tutorial information about how to setup the system properly yourself.

    The US government also has a handy PDF about setting up secure boot properly. This subject is somewhat complicated by the fact the UEFI bootloader graphical interface standard is only a reference implementation, with no guarantee that it is fully implemented, (especially the case in consumer grade hardware). Last I checked, Gentoo has the only tutorial guide about how to use an application called Keytool to boot directly into the UEFI system, bypassing the GUI implemented on your hardware, and where you are able to set your own keys manually.

    If you choose to try this, some guides will suggest using a better encryption key than the default. The worst that can happen is that the new keys will get rejected and a default will be refreshed. It may seem like your system does not support custom keys. Be sure to try again with the default for UEFI in your bootloader GUI implementation. If it still does not work, you must use Keytool.

    The TPM module is a small physical hardware chip. Inside there is a register that has a secret hardware encryption key hard coded. This secret key is never accessible in software. Instead, this key is used to encrypt new keys, and hash against those keys to verify that whatever software package is untampered with, and to decrypt information outside of the rest of the system using Direct Memory Access (DMA), as in DRAM/system memory. This effectively means some piece of software is able to create secure connections to the outside world using encrypted communications that cannot be read by anything else running on your system.

    As a more tangible example, Google Pixel phones are the only ones with a TPM chip. This TPM chip is how and why Graphene OS exists. They leverage the TPM chip to encrypt the device operating system that can be verified, and they create the secure encrypted communication path to manage Over The Air software updates automatically.

    There are multiple Keys in your UEFI bootloader on your computer. The main key is by the hardware manufacturer. Anyone with this key is able to change all software from UEFI down in your device. These occasionally get leaked or compromised too, and often the issue is never resolved. It is up to you to monitor and update… - as insane as it sounds.

    The next level key below, is the package key for an operating system. It cannot alter UEFI software, but does control anything that boots after. This is typically where the Microsoft key is the default. It means they effectively control what operating system boots. Microsoft has issued what are called shim keys to Ubuntu and Fedora. Last I heard, these keys expired in October 2025 and had to be refreshed or may not have been reissued by M$. This shim was like a pass for these two distros to work under the M$ PKey. In other words, vanilla Ubuntu and Fedora Workstation could just work with Secure Boot enabled.

    All issues in this space have nothing to do with where you put the operating systems on your drives. Stating nonsense about dual booting a partition is the stupid ambiguous misinformation that causes all of the problems. It is irrelevant where the operating systems are placed. Your specific bootloader implementation may be optimised to boot faster by jumping into the first one it finds. That is not the correct way for secure boot to work. It is supposed to check for any bootable code and deplete anything without a signed encryption key. People that do not understand this system, are playing a game of Russian Roulette. There one drive may get registered first in UEFI 99% of the time due to physical hardware PCB design and layout. That one time some random power quality issue shows up due to a power transient or whatnot, suddenly their OS boot entry is deleted.

    The main key, and package keys are the encryption key owners of your hardware. People can literally use these to log into your machine if they have access to these keys. They can install or remove software from this interface. You have the right to take ownership of your machine by setting these yourself. You can set the main key, then you can use the Microsoft system online to get a new package key to run W10 w/SB or W11. You can sign any distro or other bootable code with your main key. Other than the issue of one of the default keys from the manufacturer or Microsoft getting compromised, I think the only vulnerabilities that secure boot protects against are physical access based attacks in terms of 3rd party issues. The system places a lot of trust in the manufacturer and Microsoft, and they are the owners of the hardware that are able to lock you out of, surveil, or theoretically exploit you with stalkerware. In practice, these connections are still using DNS on your network. If you have not disabled or blocked ECH like cloudflare-ech.com, I believe it is possible for a server to make an ECH connection and then create a side channel connection that would not show up on your network at all. Theoretically, I believe Microsoft could use their PKey on your hardware to connect to your hardware through ECH after your machine connects to any of their infrastructure.

    Then the TMP chip becomes insidious and has the potential to create a surveillance state, as it can be used to further encrypt communications. The underlying hardware in all modern computers has another secret operating system too, so it does not need to cross your machine. For Intel, this system is call the Management Engine. In AMD it is the Platform Security Processor. In ARM it is called TrustZone.

    Anyways, all of that is why it is why the Linux kernel does not directly support secure boot, the broader machinery, and the abstracted broader implications of why it matters.

    I have a dual boot w11 partition on the same drive with secure boot and have had this for the last 2 years without ever having an issue. It is practically required to do this if you want to run CUDA stuff. I recommend owning your own hardware whenever possible.





  • Qwen 2.5 VL and Code. I have a VL doing image captions for LoRA training running now. A 14B is okay for basic code. A quantized 32B 6KL gguf of the same Qwen 2.5 code model runs on 16GB but at a third of the speed of the 14B in bits and bytes 4b. The latter is reasonably fast enough for a couple layers of agentic stuff in emacs with gptel and hits thinking or function calling out of a llama.cpp server better than 50% of the time.

    I still haven’t tried the new 20B out of Open AI yet.



  • Don’t let lawyers be lazy or rule the world with this authoritarianism. Write with better language more in line with modern cultural values instead of ambiguous data theft nonsense. Have higher expectations. You are either part of the problem or part of the solution. Your only real vote is your choices you are willing to accept. When your choices normalize monsters, you bring monsters upon us all. You matter. You are important. You must make the choice if you are a monster. Your choice is the most important choice in the world. I choose to avoid bringing you monsters because I care about you too.