Inspiration
Our motivation https://urlz.fr/tfEX for AudioNova stemmed from the need for a local-first, high-performance speech solution that guarantees privacy and low latency. Many existing text-to-speech (TTS) and voice-changing tools rely heavily on cloud services, raising concerns about data ownership, speed, and costs. By optimizing Whisper models through Qualcomm AI Hub for Snapdragon X processors, we saw an opportunity to build a versatile, on-device platform https://urlz.fr/tfEX that can help individuals with speech impairments, content creators, and businesses alike
What it does
AudioNova is a Windows-based application that provides:
Voice Generation:
Generate natural-sounding voices from text. Users can pick from five built-in voices or clone a custom voice using Qualcomm Whisper for transcription and Vall-E-X for generation.
Voice Changing:
Transform any uploaded or recorded audio into a new voice. Qualcomm Whisper transcribes the file locally, then Vall-E-X regenerates the speech in a different voice.
Log in or sign up for Devpost to join the conversation.