Inspiration

3D modeling has a brutal learning curve. Every tool — Blender, ZBrush, SolidWorks — requires you to learn its way of thinking before you can make anything. But everyone knows how to draw. I wanted to see what happens when you skip the modeling interface entirely and just let people sketch what they want, then let a vision model handle the translation to actual geometry.

What it does

You sketch with the MX Ink in mixed reality. It only has to be a simple sketch, a stick figure, a blobby shape, whatever gets the idea across. That sketch gets sent to a vision model which interprets it and generates a 3D model via Three.js.

The 3D-model is right there in front of you, and now, you just keep drawing on it! Sketch a hat on the head, add an extra limb, scribble some detail on the torso. The vision model receives the existing model plus your new strokes, understands the spatial relationship, and regenerates with your additions properly integrated. You're iteratively building up a 3D model through a back-and-forth conversation in sketches.

Preliminary stack

WebXR + Three.js for in-headset rendering, MX Ink SDK for stylus input and stroke capture, vision model API (Gemini, Llama) for sketch interpretation. The pipeline takes the vision model's structured output and translates it into Three.js geometry programmatically.

What we learned

MX INK's pressure sensitivity and full spatial tracking give the vision model meaningfully more signal than a flat 2D sketch would — you get stroke depth, drawing angle, pressure variation. That's a richer input for the AI to work with when figuring out what you meant. I'm very interested in exploring how much of a difference this makes with asset quality.

What's next for Elsewhere Studio

Build it. The core risk is the vision model interpretation loop, so that's first — get sketch capture → vision API → Three.js generation working end to end, then layer on iterative refinement. After that: export support (GLTF, OBJ) so models aren't trapped in the headset, and collaborative sessions where multiple people sketch on the same model.

Built With

Share this project:

Updates