Inspiration

Our motivation https://urlz.fr/tfEX for AudioNova stemmed from the need for a local-first, high-performance speech solution that guarantees privacy and low latency. Many existing text-to-speech (TTS) and voice-changing tools rely heavily on cloud services, raising concerns about data ownership, speed, and costs. By optimizing Whisper models through Qualcomm AI Hub for Snapdragon X processors, we saw an opportunity to build a versatile, on-device platform https://urlz.fr/tfEX that can help individuals with speech impairments, content creators, and businesses alike

What it does

AudioNova is a Windows-based application that provides:

Voice Generation:

Generate natural-sounding voices from text. Users can pick from five built-in voices or clone a custom voice using Qualcomm Whisper for transcription and Vall-E-X for generation.

Voice Changing:

Transform any uploaded or recorded audio into a new voice. Qualcomm Whisper transcribes the file locally, then Vall-E-X regenerates the speech in a different voice.

Built With

  • seo
Share this project:

Updates