Inspiration
We were inspired by the need to support diverse student communities, especially Spanish-speaking and bilingual learners who face language barriers in traditional education tools. Most platforms are designed for English-first learning, making it harder for many students to fully understand and engage with their coursework. We wanted to create a system where students can learn and interact in the language they are most comfortable with.
At the same time, learning today is still largely passive. Students scroll through PDFs, slides, and LMS platforms without real interaction, often struggling with complex topics. We realized the issue is not the lack of content, but the lack of accessible and engaging ways to interact with it.
VOIXA brings these ideas together by turning course materials into something students can talk to, watch, and actively learn from, while ensuring language is never a barrier.
What it does
VOIXA transforms static course materials into a fully interactive AI-powered learning experience:
- Students can chat with their syllabus and lecture slides using voice or text
- Difficult topics can be converted into AI-generated explainer videos
- The platform generates quizzes and flashcards for active learning
- All features are available seamlessly in both English and Spanish
Students can complete the entire learning process in either language, including asking questions, receiving explanations, generating videos, and practicing with quizzes and flashcards.
How we built it
We built VOIXA using a combination of AI models and a full stack architecture:
- Core AI Systems
- Gemini API for AI powered video generation in English and Spanish
- ElevenLabs for high quality voice generation in both languages
- OpenAI APIs for text understanding, responses, quizzes, and flashcards
- Frontend
- React, Vite, Tailwind CSS
- React Router, Axios, react-markdown
- i18next for English and Spanish support
- Backend
- Python with FastAPI
Challenges we ran into
- Converting unstructured course materials into accurate AI responses
- Synchronizing generated scripts, visuals, and audio into coherent videos
- Maintaining correctness while simplifying complex topics
- Ensuring consistency across voice, text, video, and both languages
Accomplishments that we're proud of
- Built a fully working system combining voice, text, and AI video generation
- Enabled complete learning workflows in both English and Spanish
- Successfully integrated multiple AI systems into one seamless experience
- Created an intuitive and interactive way to learn from existing course materials
What we learned
- Accessibility should be a core design principle, not an afterthought
- Multi modal learning using voice, text, and visuals improves understanding
- AI can transform not just content, but how users interact with knowledge
- Supporting multiple languages requires thoughtful system design
What's next for VOIXA: Voice & AI Video Learning for English and Spanish
- Expand support to more languages
- Real time lecture interaction and summarization
- Integration with LMS platforms like Canvas
- Personalized AI tutors tailored to each student
Built With
- elevenlabs-api
- fastapi
- gemini-api
- javascript
- openai-api
- python
- react
- tailwind-css
- vite
Log in or sign up for Devpost to join the conversation.