this post was submitted on 27 Jan 2025
154 points (94.8% liked)
Technology
63134 readers
4511 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Question: as i understood it so far, this thing is open source and so is the dataset.
With that, why would it still obey Chinese censorship?
Even though it's magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone's willing to invest this just to retrain it from scratch, you're left with the alignment of its trainers.
Good point.
Is the training set malleable, though? Could you give it some additional rules to basically sidestep this?
Yeah, I guess you could realign it without retraining the whole thing! Dunno what would be the cost though, sometimes this is done with a cohort of human trainers 😅
I feel like we're talking about a guard dog now...