
Inworld develops AI products for builders of consumer applications, enabling scaled applications that grow into user needs and organically evolve through experience.
Inworld develops AI products for builders of consumer applications, enabling scaled applications that grow into user needs and organically evolve through experience.
Inworld
Hi Product Hunt! I'm Kylan, co-founder of @Inworld, and I'm stoked to share Inworld TTS with you.
We've spent the past four years working alongside thousands of builders, and this launch represents a lot of what we've learned since our first Product Hunt debut in 2022. At Inworld, we build AI products that help consumer applications grow and evolve with their users. Inworld TTS is our first step towards removing a critical barrier that is keeping builders from their next million users.
New users get 2M free characters. Accessible via API or in our Playground. Try it now.
Inworld TTS delivers state-of-the-art quality and latency at the most affordable price on the market.
$5 per million characters, with comparable models around $100. Here’s a quick example:
What do you get?
✅ Industry-leading quality (Word Error Rate & Speaker Similarity)
✅ Real-time latency (median latency of 200ms)
✅ Free zero-shot voice cloning
✅ Multilingual and crosslingual support
✅ Audio markups for emotion, style and nonverbals
✅ SOC2 Type II + on-premise deployments
✅ Open-sourced training and modeling code
How is this possible?
We're focused on removing the most pressing infrastructure barriers that keep great AI applications from scaling. Voice is one of the biggest cost and complexity hurdles facing today's builders, so we decided to tackle it head-on.
We repurposed large language models for speech synthesis rather than using traditional TTS architectures. This innovative approach, combined with streamlined serving infrastructure, enabled us to deliver state-of-the-art quality and real-time performance at a fraction of the cost. You can read (or listen) to the specifics here.
Where can you try it?
Inworld TTS is available today via API and can be experienced in our TTS Playground, where you can test pre-built voices or clone your own voice. Find more technical details in our blog post or try it now.
Inworld
@kylan_gibbs Exciting to be launching on Product Hunt again – biggest product launch to-date, with much more to come. Scale is getting solved. Evolution is next.
Inworld
@kylan_gibbs looking forward to the next steps :)
Inworld
@kylan_gibbs It's truly remarkable to see what the team has accomplished with this product. The effort and brilliance that's gone into this labor of love cannot be overstated and is clearly evident in the results. So pumped to see what's to come!
@kylan_gibbs This is a game changer for developers - Excited to see users interact with AI via voice seamlessly!
@kylan_gibbs Can't wait to see the great things that developers are going to build using this!
This looks impressive! How does Inworld TTS handle different languages and accents?
Inworld
@evgenii_zaitsev1 Thanks! We currently support 11 languages, not including accents. If a voice cloning prompt represents all the particularities of an accent, the model will reproduce most of them. If you have more data, we can perform professional cloning (self-service will be available later). We have customers to whom we delivered multiple voices with a New York accent, a Kiwi accent, etc.
The Max model is better for multilingual support and accent preservation. API will be available soon, currently in UI only.
The next model iteration will have further improvements in quality and support for more languages. If you are interested in specific languages, please let us know. For now, if you have an app, you can set up routing that sends traffic to Inworld for the languages we support.
No Cap
@No Cap will be trying this out!
Inworld
Inworld
@ednevsky let us know what you think! Always looking forward to feedback :)