this post was submitted on 11 Feb 2025
42 points (97.7% liked)

Selfhosted

42716 readers
256 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
 

Title question mostly. I've played with XTTS-v2 and it worked pretty well, but I'm wondering if folks are using anything else special. I'd like to train my own voice finetune which is what I did with XTTS-v2, and then use it with home assistant's voice feature. Welcome all opinions on it!

top 16 comments
sorted by: hot top controversial new old
[–] ALERT@sh.itjust.works 10 points 1 week ago (1 children)

if you need English - right now it's kokoro-fastapi https://github.com/remsky/Kokoro-FastAPI set this container up and use it as an openai TTS endpoint using this hacs integration https://github.com/sfortis/openai_tts

Very nice! I'll check this out!

[–] johntash@eviltoast.org 4 points 1 week ago (1 children)

Piper works pretty well. I'm only using it because it was easier to find a custom glados voice.

Kokoro has good default voices. I also started trying out Speaches recently. It provides an open ai api wrapper around several options

[–] VocalCritic@lemmy.ml 1 points 1 week ago

Any tips on getting speaches to work with Home assistant? Got speaches working but haven't gone the next step yet.

[–] just_another_person@lemmy.world 2 points 1 week ago (1 children)

Pretty much just personal preference at this point. XTTS is certainly not the most efficient though.

[–] scrubbles@poptalk.scrubbles.tech 1 points 1 week ago (1 children)

any personal preferences you recommend?

[–] just_another_person@lemmy.world 3 points 1 week ago* (last edited 1 week ago) (1 children)

Pico, Piper, Mary, and Google all run locally and off of CPU only.

I think all the rest require cloud accounts or acceleration hardware to work quickly.

I'm personally fine with Mary or Piper, but I know some people like the fancier ones.

[–] sxan@midwest.social 1 points 1 week ago (1 children)

Google? Have you verified that?

[–] just_another_person@lemmy.world 2 points 1 week ago (1 children)
[–] sxan@midwest.social 1 points 1 week ago (1 children)

The docs don't say it's completely offline. Can you turn off your LAN connection and it still works? Have you tried this? Or just firewall off out bound access to Google services?

This comment:

Contrary to what the name suggests, the integration only does text-to-speech and does not translate messages sent to it.

doesn't say it doesn't call out to Google services; it says only that it doesn't use translation services. I didn't see anything else that implies it doesn't send data to Google.

[–] just_another_person@lemmy.world 1 points 1 week ago* (last edited 1 week ago) (2 children)

If it doesn't require an API key in the config, it's offline. My HA works totally offline unless I need to do updates, and it's always worked for me.

You can also view the code, cuz open source.

[–] sxan@midwest.social 1 points 1 week ago

Yeah, so I dug into it, and it's definitely not offline. It uses gtts, which ultimately makes calls to google.com for the tts. You can track it down yourself, but you'll eventually end up here, which talks about how to change the google host name in case it's blocked.

I'm not sure why you believe not needing an API key means it isn't calling a Google API, especially in this case where it clearly states it's using an unofficial channel - which is the same trick third party YouTube clients use to access YouTube videos without using API keys.

[–] sxan@midwest.social 1 points 1 week ago (1 children)

👍 Thanks. I'm surprised, and still skeptical, but thanks.

[–] un_ax@lemmy.sdf.org 1 points 1 week ago

I was curious and it uses gTTS.

It calls what's probably the "speak" button on translate.google.com

This project is leveraging the undocumented Google Translate speech functionality

[–] Vector@lemmy.world 2 points 1 week ago

Don’t know much about the training side of things, but I have Piper set up with home assistant using the Wyoming protocol and it just goes. Some of the out-of-the-box voices are pretty decent too.

[–] HybridSarcasm@lemmy.world 2 points 1 week ago

You might consider asking this in the !homeassistant@lemmy.world community, too.