🎯 Inspiration

Many people with autism struggle to read facial expressions or emotional cues, which can lead to stress and isolation.

🤓 What it does

MoodSee helps users identify the emotions of the person they’re speaking to through simple visual cues inside the headset.

The app uses the headset’s passthrough-video data and sends images to ChatGPT to analyze facial expressions in (almost) real time and translate them to simple visuals above the speaker.

🔧 How we built it

Building Blocks BlazeFace for Face Detection ChatGPT for Emotion Detection Vercel for the OpenAI Interface Unity-PassthroughCameraApiSamples

🔥 Challenges we ran into

Face tracking in the headset The original AI models were trained on cropped faced, which is not what the headset is seeing. This forced us to switch to ChatGPT, which came with its own challenges Getting access to camera access was an issue at first Local processing to remove reliance on third-party apps Real-time processing Designing visuals that communicate emotion effectively without being distracting or stressful.

🏆 Accomplishments that we're proud of

Getting the emotion recognition pipeline to work

📚 What we learned

  • How to use Meta-PCA
  • Set up inference and ran RoboFlow AI
  • Used face-tracking locally on the Quest
  • Sent images to ChatGPT for recognition, including emotions
  • Worked with voice recognition and built actions from it

🚀 Target Group

MoodSee is designed for neurodiverse users, especially those with autism, who struggle to interpret facial expressions during conversation. It acts as a real-time emotional assistant to reduce social friction and support smoother face-to-face communication.

⏭️ What's next for [C23] MoodSee by Empaths XR

  • User testing!
  • Local live tracking to remove reliance on ChatGPT or third-party libraries
  • Consent feature that allows users to ask conversation partners for permission
  • Voice detection to add another layer of emotional context
  • Pet emotion tracking

Built With

Share this project:

Updates