π·οΈ Spidey Sense
Navigate the world, fearlessly.
AI-powered navigation assistant that helps visually impaired users explore their surroundings through real-time detection, spatial awareness, and conversational guidance.
π Social Impact
More than 285 million people worldwide live with visual impairments. Traditional canes and GPS apps offer limited awareness β they canβt describe nearby obstacles or open paths.
Spidey Sense transforms independence by turning vision into voice, guiding users with real-time spatial awareness, conversational AI, and natural speech β empowering safer, more confident mobility.
π§ Inspiration
We asked: βWhat if someone who canβt see could still sense the world like Spider-Man?β
Most navigation tools tell you where you are, not whatβs around you. Visually impaired users often wonder:
βIs there something in front of me?β
βCan I move forward safely?β
So we built Spidey Sense β a friendly AI companion that sees, thinks, and speaks, helping users explore with awareness and trust.
π‘ What It Does
π§ Real-Time Object Detection β Detects people, chairs, doors, and obstacles using COCO-SSD.
π¦― Spatial Awareness Engine β Classifies objects into left, center, right zones for precise guidance.
π Voice Synthesis (ElevenLabs) β Converts COCO SSDβs findings into lifelike speech.
π₯ Smart Timer β Periodically checks surroundings every second and guides user accordingly.
πͺ Multi-Mode Awareness β Switch between Explore, Focus, and Follow for different contexts.
π Key Benefits
ποΈ Vision β Voice β Narrates your environment in real time.
π¦― Safe Movement β Guides you away from dead ends and toward clear paths.
π§ Conversational Insight β Natural dialogue, not robotic alerts.
π€± Touch Expansion β Future-ready for haptic feedback integration.
π Accessibility First β Voice-first, minimalistic design built for independence.
π Use Cases
- Pedestrian navigation for the visually impaired
- Campus or indoor mobility for students
- Elderly users navigating homes and care facilities
- Assistive tech developers integrating multimodal AI
π οΈ How We Built It
π₯ Frontend
- HTML, CSS, JavaScript for a voice-first UI
- Web Speech API for push-to-talk and voice capture
- Mock interface simulating object detection + Gemini dialogue
- Auth0 integration for security
- MATLAB plots visualizing the walking distances of friends in friendly competitions
βοΈ Backend
- Node.js + Express for API routing
- COCO-SSD model for live object detection
- ElevenLabs API for lifelike voice output
π€ AI & APIs
- ElevenLabs TTS β natural speech output
- COCO-SSD β real-time detection for 80+ object classes
π Challenges We Overcame
π§© Integrating three AI systems (vision + language + voice)
π€ Managing latency in voice-triggered queries
π¦― Translating object positions into spatial guidance
π§ Designing a calm and empathetic voice UX
πΊ Accomplishments
β
Auth0 login capabilities to ensure user data remains secure
β
End-to-end multimodal pipeline: Detection β Scene Summary β Speech Output
β
Scene-aware conversational responses
β
Periodic voice prompts (every second)
β
Inclusive voice-first interface tested with real users
π What We Learned
- Context > Detection: Users need actionable guidance, not raw data
- Voice-first Design improves trust and usability
- Multimodal AI bridges the gap between accessibility and autonomy
- Accessibility = intuitive speech, minimal friction, and reliability
π Next Steps
𦑠Integrate haptic belt feedback for spatial direction
πΊοΈ Add indoor navigation using AR markers
π± Launch a mobile app (Flutter) with offline support
π§ Integrate Gemini context memory for multi-turn conversations
πͺ§ Incorporate Optical Character Recognition (OCR) to read street signs, menus, labels, bus numbers, or product packaging.
πΆοΈ Integrate app with wearable interfaces like Meta Glasses for ease of use.
β€οΈ Why Spidey Sense
Spidey Sense empowers visually impaired users to move confidently and independently β combining sight, speech, and spatial intelligence into one assistive companion. Itβs more than an app β itβs AI that helps you feel your surroundings.



Log in or sign up for Devpost to join the conversation.