Markus Rudel

Software developer with a penchant for IT security, design enthusiast, photographer

Markus Rudel boosted:

The EU is currently congratulating itself because it managed to get a hashtag banned on TikTok in relatively little time.

LLMs encode the meanings of terms as vectors along many semantic dimensions in a semantic space ("latent space"). A concept, then, is a position in that space with a certain diameter — a kind of fuzziness or vagueness.

When I type something into ChatGPT or a recommender system, the input is broken down into tokens, and these tokens are mapped to such vectors.

“I want pizza” becomes:

["I", "want", "pizza", "."]

The tokens are then internally mapped to embeddings, like:

“cat” → [0.24, -1.12, 0.58, …]  
“dog” → [0.22, -1.09, 0.60, …]

That is, a list of numbers (often normalized between -1 and 1). But usually there are far more dimensions than shown here — an embedding typically has thousands of dimensions.

The latent space — the semantic space — is self-organizing. That happens during training. We don’t know what each dimension in the space represents.

The encoding has meaning. When look at the vectors for "man" and "woman" and for "king" and "queen", we can substract "man" from "woman" and "king" from "queen" and compare the difference vectors. They are almost, but not quite the same – because the difference between these words to us is almost, but not quite the same, in meaning.

LLMs use these embeddings and their internal model to “compute the next output token.”

Recommender systems use such embeddings to compare vectors and find things that are similar to the thing we already have.

So a recommender learns everything that’s relevant to a user, and a modern recommender represents the user through a collection of vectors:

"Interested in travel, digital policy, databases, bikes."

These are all concepts that may also be near other concepts in the space.

At the same time, the recommender classifies content in the same space, and can find content that lies close to one of the user’s sub-interests — or content that’s new, but still compatible.

A modern recommender separates a user’s interests into distinct areas and can decide what the user is interested in right now — meaning, which of the various user interests is currently active. Then, this time, it might only serve database content, and next time only bike content.

A modern recommender will also deliberately serve content that almost — but not quite — matches the user’s interests, to test how wide the bubble is around the center of that interest vector. So a bike session might also include urbanism, city development, and other nearby topics, and the recommender will watch carefully to see what kind of response that triggers — refining its recommendations based on that feedback.

A modern recommender will also know where the available content clusters are and prioritize content that is both relevant to the user and performs well or has current production capacity. In other words, where user interest and available content overlap well.

And a modern recommender will reevaluate every twenty minutes (“Pomodoro”, or “method shift” in educational theory) and attempt to shift the theme — to test whether another known interest can be reactivated.

That’s how TikTok works.

You can ban a hashtag on TikTok (“#skinnytok”).

But as long as related concepts are marketable and socially accepted — or even demanded — that won’t prevent anything.

As soon as you browse categories like “model,” “weight loss,” “fitness,” or “slim,” TikTok will slowly and systematically pull you into the same region, and the end result will be the same.

The actual language, the meaning, is encoded in the tokens of the latent space of the model, not in the words that are used (or prohibited).

And the content density in the models coordinate system will gently push things into certain clusters. If you feed the system with the right interests, you will always drift – relatively quickly even – into the same neighborhood and then learn their current slang to get there with a single word.

No matter what the word actually is.

A similar example, using GenAI instead of a recommender:

"Draw a superheroine, an Amazon warrior that can fly and deflect bullets, running over a battlefield in the first world war."

These 21 words do not say "Wonder Woman", they do not even go near comics, DC, or similar things.

Yet they draw a thousand-dimensional hyberbubble in latent space, the totality of knowledge known to ChatGPT, and the end result leaves just one choice – produce this blatant copyright violation.

I can trigger content with intent, not even going near the keywords that would be associated with it.

This is how jailbreaks work in LLMs, and that is also how you jailbreak Tiktok bans.

"Draw a superheroine, an Amazon warrior that can fly and deflect bullets, running over a battlefield in the first world war."

Did I say wonder woman? 

No, I did not. 

Using 21 words, I drew a thousand-dimensional hyperbubble around inside all the content known to ChatGPT and the only content to reproduce was a singular thing, this blatant copyright violation.
Markus Rudel boosted:
Sheril KirshenbaumSheril
2025-06-04

“In 2017, ‘a confidential doc is leaked that reveals Facebook offering advertisers the opportunity to target 13-to-17yos across platforms during moments of psychological vulnerability when they feel ‘worthless,’ ‘insecure,’ ‘stressed,’ ‘defeated,’ ‘anxious,’ ‘stupid,’ ‘useless’ & ‘like a failure.’ Or target them when worried about their bodies & thinking of losing weight.’

They knew when a teen deleted a selfie & would bombard her w beauty ads, assuming she deleted it bc she felt ugly.” /1

2025-06-04

When you think about the amount of ingenuity put into the latest meta tracking instead of other more important things 🫣 manganiello.social/objects/1bf

Markus Rudel boosted:
Ethan J. A. Schoonoverethanschoonover
2025-06-03

" What if you lost your " (access or data, same problem) is the thing that should prompt you to, right now, make a recurring reminder to backup your notes to a standard format. I use "Exporter" from the app store: apps.apple.com/us/app/exporter

A screnshot showing an apple reminders task reading "Export Apple Notes to markdown text files"
2025-06-03

Wow ich habe die Witcher Spiele nie gespielt aber das sieht wirklich beeindruckend gut aus 😳 threads.com/@wario64/post/DKcN

Markus Rudel boosted:

Facebook Userspionage

localmess.github.io/

Da ist mit Geldstrafen nicht mehr viel zu machen, da hilft nur noch ein kommerzielles Betätigungsverbot in der EU.

Die Frage ist ja auch, ob Google jetzt alle Meta-Apps wegen Malware pullt.

Markus Rudel boosted:
Karl Voit :emacs: :orgmode:publicvoit@graz.social
2025-06-03

Every project should have a "cursed"-page like that: 😆

"Cursed knowledge we have learned as a result of building #Immich that we wish we never knew."
immich.app/cursed-knowledge/

🤓

#FOSS #yakshaving #fun

2025-06-03

Ich hab gestern eigentlich nur checken wollen, ob es #Polarlichter gibt. Dann sind mir doch die ziemlich hellen #NLC #LeuchtendeNachtwolken aufgefallen 🙂

Nachthimmel Richtung Norden mit leuchtenden Nachtwolken
Markus Rudel boosted:
Bianca Kastlbkastl
2025-06-02

Die Story zu CARIAD ist so symptomatisch, dass sie auch aus der deutschen Verwaltung kommen könnte.

Ab einer gewissen Größe und gewachsenen Heterogenität nähern sich Strukturen in ihrer jeweiligen Nichthandlungsfähigkeit stark an.

autopreneur.de/p/cariad-volksw

Markus Rudel boosted:
Jordan Maris 🇪🇺 🇺🇦 #NAFOjmaris@eupolicy.social
2025-06-02

An extremely welcome initiative from the 🇪🇺 #eu:

🔋Phone Batteries must retain 80% capacity after 800 cycles.

📱Phone makers must provide 5 years of updates,

🪛 are banned from discriminating against repair shops,

🔧 and must provide parts for 7 years after EOL

#EUpol #sustainability #tech #smartphones #CircularEconomy

androidpolice.com/eu-new-rules

Markus Rudel boosted:
2025-06-02

General PSA: don’t apply for a job at Canonical. Do NOT apply for a job at Canonical. Treat the blatantly artificially enormous number of job openings they post as the mirages of trickster fae. They are unhinged. Mark Shuttleworth is unhinged. They will drag you through the mud, disrespect you and your time, and definitely not give you a job. This article I saw today is like the thirteenth of its kind that I personally have seen dustri.org/b/my-experience-wit

Markus Rudel boosted:

Euer Input hier:

  • Bei Serverabschaltung von Digitalprodukten muß ein letztes Firmware-Update einen Betrieb mit eigenen Servern möglich machen UND die API dokumentiert werden.
  • Kennzeichnung von Digitalprodukten nach der Home Assistant Cloud Scale verpflichtend.
  • Weg mit Geoblocking von Medien innerhalb der EU

ec.social-network.europa.eu/@E

Markus Rudel boosted:
2025-06-01

TIL that because the FFmpeg project has gained so much experience in hand-writing assembly code to provide huge speedups, they now are putting together a series of lessons for learning assembly:

Vibe coding is fun and all, but this is probably a better use of time!

github.com/FFmpeg/asm-lessons

Markus Rudel boosted:
The Peter Pan of Nerdery™ 🇦🇺🏜️dhry
2025-06-01

Edit: we is fedi-trending, folks! 🎉

2025-06-01

Super cool thread about sci-fi backgrounds. This remembers me of the DS9 replica in Las Vegas with some of the original props exhibited: some things looked so cheap up close but on TV or Cinema you don’t realize them as cheap. Cool illusions indeed 😎 kolektiva.social/@beka_valenti

Markus Rudel boosted:
Andreas Sikkemabix@chaos.social
2025-05-31

Still one of the best home improvement hacks I’ve ever done. Not a single regret and very happy I did it every time I have to wash the windows or one of my bikes (on a balcony).

And Gardena hose is more robust than shower hose, fight me.

A Grohe shower head is connected to the two knob shower faucet using two orange Gardena quick connectors and an orange Gardena hose
Markus Rudel boosted:
2025-05-30

Billions of dollars. Untold megawatts of power. To create a low grade #Google #AI Moron. #AISlop

Markus Rudel boosted:
Marco Armentmarcoarment
2025-05-30

Retreating to Safety
marco.org/2025/05/30/retreat

Maybe Apple has good reasons to decline an interview by @gruber at WWDC this year.

Maybe not.

Phil Schiller and John Gruber on stage in The Talk Show Live at WWDC 2015.
Markus Rudel boosted:
Andreas Bullingabulling
2025-05-30

Will auch!

Wird bei uns aber nicht kommen, solange die Möchtegern der an der Macht sind. Freiheit und so 😢

Frankreich führt Rauchverbot im Freien ein

Strände, Parks, Bushaltestellen: Frankreich verbietet ab Juli das Rauchen an Orten, an denen sich Kinder aufhalten. Die Regierung spricht vom "Recht auf saubere Luft".

30. Mai 2025, 1:42 Uhr
Quelle: ZEIT ONLINE, AFP

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst