kanari

Overview Page
Architecture Overview
Gemini Model Architecture
Biomarker Pipeline
Landing Page
Overview Page - Trends
Achievements Page
Check-ins Page - Responsive UI/UX Design

Inspiration

You know the feeling: 8 hours of studying, skipped meals, and still falling behind.

As community college STEM students living alone, we saw how burnout builds quietly through scrolling loops, context switching, and procrastination cycles. Those energy leaks are hard to spot in real time. We built kanari to make those patterns visible early, so recovery can happen before a crash.

What it does

kanari is a browser-based burnout risk companion that uses short daily voice check-ins.

Voice check-in with Gemini Live: a natural 30-60 second conversation.
On-device acoustic biomarkers: local feature extraction for pauses, pitch variability, speech dynamics, and spectral signals.
Gemini 3 analysis layer: semantic interpretation, personalized synthesis, targeted recovery suggestions, and achievement generation.
3-7 day burnout forecast: heuristic short-horizon trend guidance with confidence and contributing factors.
Action loop: suggestions become scheduled recovery blocks in kanari’s built-in local calendar, including recurring schedules.
Progress layer: check-in history, journal support, streaks, and achievements to reinforce daily consistency.

How we built it

We built kanari with Next.js App Router, React, TypeScript, Tailwind CSS, and Dexie (IndexedDB).

Gemini architecture

Gemini Live (WebSocket) powers real-time voice conversation during check-ins.
Gemini 3 Flash (REST) powers deeper reasoning and generation across multiple endpoints:
- check-in context summary generation from prior sessions and trend signals
- post-check-in synthesis (narrative, evidence-linked insights, suggestions)
- semantic audio interpretation
- suggestion generation and diff-aware refresh
- daily achievements generation
- optional coach-avatar recipe selection (style + seed)

Voice and ML pipeline

Browser audio capture with Web Audio + VAD + Meyda feature extraction.
Session-level acoustic scoring and quality checks.
Semantic + acoustic biomarker fusion.
Trend-based forecasting logic for 3-7 day risk guidance.

Product and UX system

Local-first persistence in IndexedDB.
In-app scheduling and recovery block management.
Guided onboarding with voice selection, coach avatar, and accountability preferences.
Appearance customization (accent color, font selection, graphics quality).
Feature tour and demo workspace seeding for judge-friendly exploration.

Challenges we ran into

Realtime synchronization between live conversation events and local acoustic timelines.
Browser performance tuning so signal processing stayed smooth on consumer laptops.
Structured output reliability across multiple Gemini 3 JSON workflows.
Conversation robustness under interruption, reconnect, and session-resume scenarios.
UX clarity for presenting complex wellness signals in a calm, actionable interface.

Accomplishments that we're proud of

We shipped an end-to-end loop: voice check-in -> synthesis -> forecast -> scheduled recovery action.
We integrated Gemini 3 across core product workflows, not a single isolated feature.
We implemented local-first acoustic analysis and storage for strong user data control.
We built resilient realtime voice behavior with barge-in handling, reconnect, and session preservation.
We delivered polished product depth beyond core ML, including achievements, feature tour, demo mode, and personalization.

What we learned

Voice carries strong stress and fatigue cues, and those cues become much more useful when combined with semantic context from Gemini 3.

We also learned that voice AI products need equal investment in model integration and systems reliability. Structured outputs, recovery paths, and clear UX language directly shape user trust.

What's next for kanari

Extend forecasting from short horizon to long-term seasonal pattern tracking.
Improve explainability so users can see why risk and confidence changed.
Add optional multimodal inputs such as wearable biomarkers (for tri-modal fusion: audio + text + biometrics).
Expand recovery planning intelligence with stronger adherence feedback loops.
Build Team Pulse for privacy-preserving aggregate wellness insight at group level.

Built With

gemini-2.5-flash-native
gemini-3.0-flash
gemini-live
indexeddb
next.js
oauth
react

Submitted to

Gemini 3 Hackathon

Created by

I worked on debugging and adding new features in both frontend and backend. This was the first time I built something with Gemini 3.0 API.

Aleksandr Ershov
I built the core Kanari product end-to-end: the Next.js app, landing/onboarding, dashboard, settings, and demo/feature tour. I implemented the AI voice check-in pipeline (mic capture + audio features), real-time Gemini Live conversation, transcript merging, and post-check-in synthesis UI. I added offline-first storage with IndexedDB so check-ins, audio, and generated insights/journals persist locally. I integrated Gemini Flash for analysis and memory/context, plus achievements/accountability features and stronger error/session handling. I built the burnout prediction + biomarker layer (personalized calibration/fusion and stress/fatigue/risk reporting) and the recovery workflow (kanban + calendar scheduling with recurring events). I also did extensive mobile polish, reliability fixes, tests, and updated Devpost/demo assets.

Sean Esla

Updates

Aleksandr Ershov started this project — Feb 09, 2026 12:35 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.