Inspiration
The inspiration came from seeing how much short-form content creators and streamers rely on sound to stand out. While video filters are everywhere, voice filters are rare and usually low-quality. I wanted to make something that lets anyone sound hilarious or professional instantly.
I was also motivated by the viral nature of memes and soundbites. A single funny voice clip can spread faster than text or video, and I wanted to build a tool that makes this easy.
What it does
Transforms your voice into lots of unique characters and professional styles in one tap. Users can sound like a witch, pirate, or villain. Finished clips can be shared directly with friends.
How we built it
I built Voice Changer: AI Generator as a SwiftUI iOS app with a simple, one-tap recording interface. For the core transformations, I integrated the ElevenLabs audio transformation API through a Node.js backend running on Firebase Cloud Functions, which handles requests securely and efficiently. I use a Hetzner-hosted SQL database for structured data and Hetzner CDN storage to deliver processed audio quickly and reliably, ensuring users get low-latency, high-quality results.
Accomplishments that we're proud of
I’m especially proud of achieving a fully synchronized system between the Hetzner SQL database, the Firebase backend, and the iOS app. Any change I make to the characters database is instantly reflected on user devices, with only the deltas being downloaded. Combined with local caching of audio and images, this setup keeps the app both lightweight and responsive. Building this end-to-end synchronization was technically challenging but also very satisfying.
What we learned
I learned how to integrate a complete end-to-end system, from the SwiftUI mobile app to the Firebase backend and Hetzner CDN infrastructure into a seamless flow. This experience gave me a deeper understanding of how mobile clients, cloud functions, databases, and content delivery networks can work together to provide a fast, reliable, and scalable product.
What's next for Voice Changer: AI Generator
Next, I plan to enable users to pair their transformed voices with 9×16 videos for easy sharing on social media. Users will be able to generate videos from the voice script, edit them, and create inspiring, funny, or epic content. This will allow users to build unique social media accounts and influence their audience with engaging, personalized audiovisual creations.
Built With
- elevenlabs
- firebase
- hetzner
- mysql
- node.js
- revenuecat
- swiftui
- typescript
Log in or sign up for Devpost to join the conversation.