this post was submitted on 16 Feb 2025
120 points (94.8% liked)

Not The Onion

13644 readers
616 users here now

Welcome

We're not The Onion! Not affiliated with them in any way! Not operated by them in any way! All the news here is real!

The Rules

Posts must be:

  1. Links to news stories from...
  2. ...credible sources, with...
  3. ...their original headlines, that...
  4. ...would make people who see the headline think, “That has got to be a story from The Onion, America’s Finest News Source.”

Comments must abide by the server rules for Lemmy.world and generally abstain from trollish, bigoted, or otherwise disruptive behavior that makes this community less fun for everyone.

And that’s basically it!

founded 2 years ago
MODERATORS
 

cross-posted from: https://feddit.org/post/7978920

all 9 comments
sorted by: hot top controversial new old
[–] phoenixz@lemmy.ca 21 points 6 days ago

Taking away someone's voice because they said naughty words

We truly do live in all black mirror episodes combined

[–] AnUnusualRelic@lemmy.world 4 points 4 days ago

Noted. Don't rely on crazy and prudish US companies for anything vital. You never know what they'll be offended by next.

[–] thisbenzingring@lemmy.sdf.org 12 points 6 days ago

fucking hell, that is so bullshit

suck a dirty dick ElevenLabs

[–] Contramuffin@lemmy.world 6 points 6 days ago

Genuine Black Mirror moment

[–] dandelion@lemmy.blahaj.zone 5 points 6 days ago (1 children)
[–] ogeist@lemmy.world 4 points 6 days ago
[–] chiisana@lemmy.chiisana.net 3 points 5 days ago (1 children)

There’s really compelling open source models like Zonos coming out; ElevenLabs will need to figure out how to thread the needle to keep everyone happy while other solutions eat into the pie.

[–] Blass_Rose@pawb.social 1 points 4 days ago

Oh I'm glad this tech went somewhere useful! I remember reading the paper and toying with the models they released as a proof of concept like... 8 years ago? It was really powerful back then. The ability to do TTS of someone's voice given literally 3 seconds of training data?! (In fact I found that it worked best with short, nonsense audio clips than actually saying anything. Saying "test test test" worked way better than reading an actual sentence.) But now it looks like it can actually handle tone well. It's also probably way better now, and less... Asthmatic sounding.