Inspiration
As students, we often found research papers to be dense, jargon-heavy, and inaccessible to the broader student community. While they contain valuable knowledge, their format creates a barrier that discourages exploration and understanding. At the same time, we were inspired by how creators like 3Blue1Brown communicate complex ideas with clarity and visual beauty. This sparked the idea for ResearchTube — a platform that transforms research papers into intuitive, animated videos to help students truly engage with academic work in an approachable and enjoyable format.
What it does
ResearchTube takes academic research papers and converts them into visual explainers that feel like 3Blue1Brown-style videos. Users can upload a paper or provide an arXiv link, and the system extracts key concepts, definitions, and logical progressions. These are turned into animated sequences that break down the material into digestible chapters, complete with voice narration and intuitive visuals. The final product is a short, high-quality video that helps students understand the core ideas behind even the most complex papers — without having to wade through dense text.
How we built it
We built a pipeline that begins with NLP-based parsing of uploaded PDFs or arXiv links to identify the structure and flow of a paper. Using tools like spaCy and custom-trained summarizers, we extract key components such as the abstract, methodology, equations, and conclusions. We then use manim, a Python animation engine, to create dynamic visuals that explain these concepts step-by-step. For narration, we integrate a script generation module that feeds into a text-to-speech engine. The frontend, built using React, presents the final video with interactive elements like chapter markers and optional pausing for deeper exploration. The overall system is modular, allowing for both automation and manual refinement where needed.
Challenges we ran into
One major challenge was striking the right balance between simplicity and accuracy. Oversimplifying a paper risks losing the depth of the research, while too much detail overwhelms the viewer. We also faced difficulties with automating script generation since research papers vary widely in structure and language. Rendering animations at scale was another technical bottleneck, especially under limited computing resources. Ensuring that the system could generalize across disciplines — from machine learning to biology — required constant iteration and feedback.
Accomplishments that we're proud of
We’re proud that we successfully created a working prototype that turns academic research into digestible visual content. We were able to integrate multiple technologies — NLP, animation, voice synthesis, and a responsive UI — into a cohesive platform. Most importantly, early testing showed that users found our videos significantly easier to understand compared to reading the papers directly. Seeing complex ideas become accessible to a wider audience felt like a huge win.
What we learned
We learned a lot about the art of communication — especially how to translate technical depth into visual clarity without diluting meaning. On the technical side, we gained hands-on experience in natural language processing, animation scripting, voice synthesis, and frontend development. We also developed an appreciation for cross-disciplinary thinking, realizing that making research accessible isn’t just a technical challenge but also a creative one.
What's next for ResearchTube
Going forward, we aim to scale the platform by allowing more automation while maintaining quality. We plan to build a contributor model where students or educators can edit or refine the auto-generated videos. We also want to integrate user feedback loops to improve the NLP summarization and visualization logic. Eventually, we envision ResearchTube as the go-to platform where any student, regardless of background, can visually explore cutting-edge research across any field.
Log in or sign up for Devpost to join the conversation.