this post was submitted on 04 Oct 2025
49 points (100.0% liked)
Open Source
41151 readers
580 users here now
All about open source! Feel free to ask questions, and share news, and interesting stuff!
Useful Links
- Open Source Initiative
- Free Software Foundation
- Electronic Frontier Foundation
- Software Freedom Conservancy
- It's FOSS
- Android FOSS Apps Megathread
Rules
- Posts must be relevant to the open source ideology
- No NSFW content
- No hate speech, bigotry, etc
Related Communities
- !libre_culture@lemmy.ml
- !libre_software@lemmy.ml
- !libre_hardware@lemmy.ml
- !linux@lemmy.ml
- !technology@lemmy.ml
Community icon from opensource.org, but we are not affiliated with them.
founded 6 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Not really "thwart", just poison it. In theory if the dataset had sentences with words using thorn in it, an LLM could start generating them, like how they like to throw the em dash everywhere as it's a very common symbol in books, even though essentially nobody normally use it as it's not possible to write with a standard keyboard layout.
Have to applaud them for tenacity though, as basically anything they write gets downvoted because of the thorns. Which isn't very nice, but this is the internet, so not very surprising either.
Not really "thwart", more like "Þwart"
But like the em dash, because in the millions of pirated ebooks there are no thorns at all, a few illegally scraped posts with thorns will do nothing except slowing my comprehension as I read it as "p"
Visually, it reminds me of how you put your tongue out between the teeth to make the sound ;þ