Inspiration

I believe that education is the most effective way to uplift humans and I have been working as a teacher and app builder ( focussing on edtech). I have built several apps over the years trying to help people learn digital skills. I have been creating video tutorials for a while and realized that there is a lot of friction when it comes to creating videos and sharing them. There's scripting, recording, editing, post-production, distributing, monetising, and most importantly being judged. I wanted a better tool for recording & publishing. I also realized that everyone has knowledge & skills that is hidden behind their screens in the form of workflows, presentations, research papers, etc. This knowledge if shared can help learners learn and creators make money from their knowledge.

What it does

Instalearn College is a screen recording tool + a micro tutorials marketplace. Teachers/creators can record their screens, transform their recording into professional tutorials with AI voiceovers and custom branding, and share their tutorials on our marketplace for the benefit of learners. Learners can pay a monthly subscription and get access to all the tutorials.

How we built it

Built the app with Google AI Studio.

  1. Core Frontend React 19 (ESM): Utilizing the latest React version for component-based UI, managed via native browser ES modules (no-build architecture). TypeScript: Deeply typed state management for tutorial configurations, user profiles, and branding settings. Tailwind CSS: Utility-first styling with a custom theme featuring glassmorphism, high-contrast typography (Inter), and smooth "fade-in" animations.

  2. Artificial Intelligence (Google Gemini API) Multimodal Reasoning (gemini-3-flash-preview): Used to analyze video frames extracted from screen recordings to perform OCR and synthesize instructional scripts. Native Text-to-Speech (gemini-2.5-flash-preview-tts): Generates high-quality, professional audio voiceovers directly from the generated scripts using the Kore prebuilt voice.

  3. Backend & Infrastructure (Firebase Suite) Firebase Authentication: Handles secure user identity via Google OAuth and Email/Password providers. Cloud Firestore: A real-time NoSQL database managing the community feed, likes, follows, and threaded discussions (comments). Firebase Storage: Scalable cloud hosting for the .webm tutorial video files.

  4. Media & Audio Engineering MediaRecorder API: Captures high-definition system video and microphone audio streams. Web Audio API: Processes raw PCM audio data, handles decoders, and manages real-time audio/video synchronization. Canvas API: Used for complex frame extraction (multimodal input) and serverless video composition (merging intros, recordings, and outros into a final master file).

  5. Utilities & Design Lucide React: A comprehensive icon library for the dashboard and editor UI. esm.sh: CDN-based module resolution, enabling a modern development workflow without a local bundler. Native Web APIs: Utilizing navigator.mediaDevices for screen capture and URL.createObjectURL for real-time video previewing.

Challenges we ran into

  1. Getting AI to generate scripts that actually make sense from raw screen recordings was tricky. We iterated on prompts and added context detection to improve output. We are still improving the app to get better scripts for the screen recordings.

  2. Synching the audio and the video.

Accomplishments that we're proud of

The biggest accomplishment is that I now have a tool where I can record my skills and share them on my platform. All the features are working as of now:

  1. Screen recording
  2. AI editing (voiceover & branding)
  3. Publishing/downloading tutorials
  4. Analytics
  5. Tutorial playback with social features e.g like and comments
  6. AI detects inappropriate content before allowing tutorials to get published
  7. An admin can also delete a tutorial if required
  8. A product tour to guide users

What we learned

  1. Everyone really does have knowledge to share. The barrier isn't expertise—it's the effort of packaging that expertise. There's also the fear of being judged. Remove the frictions, and people will create.

  2. Marketplaces need liquidity on both sides. We learned that charging creators before there's buyer demand doesn't work. The tool needs to be valuable independent of the marketplace.

  3. AI can democratize content creation. Auto-generating scripts and voiceovers isn't just a nice feature—it's what makes the entire model possible. It turns anyone into a professional tutorial creator.

What's next for Instalearn College (3 months)

  1. Create 25 videos to understand roadblocks and how improving the recording tool.
  2. Setup analytics - GA, GSC, Hotjar
  3. Onboard 100 creators and 1,000 learners
  4. Focus on tutorial verticals - prompting, vibe coding, marketing, productivity, startups.
  5. Measure user engagement
  6. Add subscription for recording tool and courses
  7. Strategies for converting free to paid users

Built With

Share this project:

Updates