Title question mostly. I’ve played with XTTS-v2 and it worked pretty well, but I’m wondering if folks are using anything else special. I’d like to train my own voice finetune which is what I did with XTTS-v2, and then use it with home assistant’s voice feature. Welcome all opinions on it!
Piper works pretty well. I’m only using it because it was easier to find a custom glados voice.
Kokoro has good default voices. I also started trying out Speaches recently. It provides an open ai api wrapper around several options