With just 15 seconds of any voice, Fish Speech can reliably synthesize natural and fluent speech while maintaining the given timbre, style, and accent. Our open-source team, creators of So-VITS-SVC and Bert-VITS2, proudly introduces Fish Speech.
I have been using TTS solutions for over 15 years. Currently using the well-known ones such as 11labs, wellsais, etc. Fish Speech is my new go-to. It is fast and cost-effective. I have also loaded it locally, and it works very well locally. However, at the current cost, using the website is more than worth it.
This is the 2nd launch from Fish Speech. View more
Fish Speech 1.4
Open-Source Multilingual Text-to-Speech with Voice Cloning
Your Voice, Your Way: Open-Source TTS
Powerful, fast, and natural speech in any language. Clone voices instantly. Self-host or use our service. Lightning-fast, affordable pricing.
Excited to introduce Fish Audio 1.4 - now open-source and more powerful than ever! 🎉
What's new:
- Trained on 700k hours of multilingual data (up from 200k)
- Now supports 8 languages: English, Chinese, German, Japanese, French, Spanish, Korean, and Arabic
- Fully open-source, empowering developers and researchers worldwide
Our mission: Make cutting-edge voice tech accessible to everyone.
Key features:
- Lightning-fast TTS with ultra-low latency
- Instant voice cloning
- Self-host or use our cloud service
- Simple, flat-rate pricing
Try it out:
- Playground: https://fish.audio
- GitHub: https://github.com/fishaudio/fis...
- HuggingFace Model: https://huggingface.co/fishaudio...
- Demo: https://huggingface.co/spaces/fi...
We can't wait to see what you'll create with Fish Audio. Happy voice building! 🎧🐠
@lengyue Congratulations on the launch of Fish Audio 1.4! 🎉 It's incredible to see the platform grow with 700k hours of multilingual data and support for 8 languages—this is a huge step forward! Making it open-source will truly empower developers and researchers across the globe. Excited to see the innovations that come from this. Keep up the amazing work!
Congratulations on launching such an innovative product! I'm really intrigued by the idea of having powerful, fast, and natural speech synthesis available in any language—it's a game changer for accessibility and creativity.
The feature that stands out to me is the ability to clone voices instantly. This opens up so many possibilities for content creators and developers alike. Additionally, the option to self-host or use your service provides flexibility that many users will appreciate.
I’m curious about how you handle voice cloning from an ethical standpoint. Also, are there plans to integrate more languages or dialects in the future? Excited to see where this goes—keep up the great work!
Stella AI
Todocap