Inspiration
Reading long PDFs is often tiring and lacks personalization. We wanted to bridge the gap between humans and documents by making reading more interactive and emotionally aware. VoiceMate was inspired by the idea of transforming static PDFs into engaging experiences.g.
What it does
VoiceMate is an emotion-aware PDF reader that adapts content delivery based on user emotions. It converts documents into natural speech, plays suitable background music, highlights key information, and supports note-taking. This creates a more immersive and accessible reading experience.
How we built it
We built VoiceMate using Python for PDF extraction, NLP, and emotion detection. Text-to-speech enables audio narration, while Streamlit provides an interactive user interface. Additional modules handle language detection, entity recognition, and user personalization.
Challenges we ran into
Accurate real-time emotion detection was challenging due to varied user expressions. Synchronizing audio, emotions, and document flow required careful optimization. Ensuring performance while handling large PDFs was also demanding.
Accomplishments that we're proud of
We successfully developed an end-to-end emotion-enhanced document reader. Integrating adaptive audio, multilingual support, and interactive notes into a single platform was a major achievement. The system delivers a smooth and engaging user experience.
What we learned
We learned how to design emotion-aware AI systems and integrate NLP with user interaction. The project improved our understanding of accessibility-focused design and real-time AI adaptation. We also strengthened our teamwork and problem-solving skills.
What's next for VoiceMate
We plan to add voice-based Q&A using RAG for deeper interaction. Future updates include improved personalization, offline support, and broader accessibility features. We also aim to enhance emotion detection accuracy and scalability.
Log in or sign up for Devpost to join the conversation.