memos UI
list of memos
memos view and edit
about memos
memos workflow diagram

memos: Intelligent Voice Memos that Generate Personal Software

Inspiration

The main inspiration was thinking about the next era of software. We're living in a transitional period in regards to software, but the applications we build remain largely static.

A interesting idea is the concept of personal software: applications designed on the spot to solve a specific purpose. This is doable today if the user has the willpower to do it. But it would be even better if the software designed itself, by understanding what the user needs and providing the application for it without the user needing to articulate every detail. Similar to the ability of neurons to rewire themselves via Neuroplasticity.

What it does

memos is a webapp that receives audio input while chatting and keeping the user engaged. It then processes' these inputs and intelligently organizes them into different (or the same) memos.

These files can contain AI calls, have ingested context and basically become simple applications that can be run directly on memos.

The agents autonomously interpret your needs and generates mini-applications, essentially turning voice commands into personal software.

How we built it

Frontend: Developed with NextJS and inspired by the design of Teenage Engineering products, our interface is built with customized shad/cn components and Tailwind CSS.
Backend: We integrated ElevenLabs Conversational AI for real-time interaction and the Vercel AI SDK to process inputs into .AIM files. AIM is a publicly available programming language where code blocks naturally merge with markdown, making the underlying logic easy to understand and modify.
Deployment: For hackathon efficiency, all files are stored in the browser during development. We deployed the solution on Vercel using Fluid Compute to handle the heavy lifting of LLM file processing and execution.

Challenges we ran into

Even using structured outputs the results were not deterministic enough to always have the correct syntax. We managed to achieve a nice consistency by optimizing the prompts but for it to be perfect will have to think of additional type safeties.

Accomplishments that we're proud of

We successfully built an end-to-end flow that takes user input from voice to organized, executable memos. This validates our vision of intelligent voice memos that autonomously generate personal software, marking a significant step toward a more intuitive interaction with digital tools.

What we learned

The project deepened our understanding of:

Prompting LLMs to generate code in specific syntaxes.
Coordinating specialized agents for distinct tasks.
Seamlessly integrating conversational AI into a user-centric application.

What's next for memos

memos is an example of a different way to interact with software, and after the hackathon we plan to make it more production ready by having auth and proper file storage.

As for new features and ideas we want to set up a conversational agent with memo context (today the agent is focused on gathering content not on explaining information).

And for the long term we want to optimize the intelligent capabilities of memos to rewrite themselves and provide the user with solutions without him asking for it.

Built With

aim
elevenlabs
nextjs
shad/cn
tailwindcss
vercel

Updates

Marcelo Kunze started this project — Feb 23, 2025 09:57 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.