VOIXA: Voice & AI Video Learning for Spanish and English

Inspiration

We were inspired by the need to support diverse student communities, especially Spanish-speaking and bilingual learners who face language barriers in traditional education tools. Most platforms are designed for English-first learning, making it harder for many students to fully understand and engage with their coursework. We wanted to create a system where students can learn and interact in the language they are most comfortable with.

At the same time, learning today is still largely passive. Students scroll through PDFs, slides, and LMS platforms without real interaction, often struggling with complex topics. We realized the issue is not the lack of content, but the lack of accessible and engaging ways to interact with it.

VOIXA brings these ideas together by turning course materials into something students can talk to, watch, and actively learn from, while ensuring language is never a barrier.

What it does

VOIXA transforms static course materials into a fully interactive AI-powered learning experience:

Students can chat with their syllabus and lecture slides using voice or text
Difficult topics can be converted into AI-generated explainer videos
The platform generates quizzes and flashcards for active learning
All features are available seamlessly in both English and Spanish

Students can complete the entire learning process in either language, including asking questions, receiving explanations, generating videos, and practicing with quizzes and flashcards.

How we built it

We built VOIXA using a combination of AI models and a full stack architecture:

Core AI Systems
- Gemini API for AI powered video generation in English and Spanish
- ElevenLabs for high quality voice generation in both languages
- OpenAI APIs for text understanding, responses, quizzes, and flashcards
Frontend
- React, Vite, Tailwind CSS
- React Router, Axios, react-markdown
- i18next for English and Spanish support
Backend
- Python with FastAPI

Challenges we ran into

Converting unstructured course materials into accurate AI responses
Synchronizing generated scripts, visuals, and audio into coherent videos
Maintaining correctness while simplifying complex topics
Ensuring consistency across voice, text, video, and both languages

Accomplishments that we're proud of

Built a fully working system combining voice, text, and AI video generation
Enabled complete learning workflows in both English and Spanish
Successfully integrated multiple AI systems into one seamless experience
Created an intuitive and interactive way to learn from existing course materials