Inspiration

What if sending an email, scheduling a task, or fetching data didn’t require opening multiple apps and repeating the same steps again and again?

In real life, even a “simple” task like:

  • checking the weather,
  • applying for a job,
  • typing an email,
  • sending a scheduled message etc.

can take 5–10 manual steps.

We felt this could be made more efficient.

We realized - Humans think in intent, but systems work in steps. That gap is where time, effort, and productivity are lost. So we asked a simple question: “What if intent itself could be executable?

That idea became OneTapAI.

What it does

OneTap AI converts natural language into a single secure, executable link.

You simply describe what you want: “Send me today’s weather after 1 minute.”

And the system understands you, breaks the task into structured steps, assigns those steps to intelligent agents and generates a link. Clicking the link executes the entire workflow automatically.

OneTapAI turns “what you want” into “what gets done” - instantly.

We designed the system like a team of intelligent workers (AI agents):

  1. A Controller Agent understands the user’s request.
  2. It converts the request into a task graph (DAG).
  3. Each task is handled by a specialized agent.
  4. The system generates a token-based execution link.

This is technically explained in the next section.

How we built it

We built OneTapAI as a distributed multi-agent system. At the core, we use a Controller–Worker architecture:

  1. A Controller Agent (powered by GLM 5.1) understands user intent and converts it into a structured workflow.
  2. This workflow is represented as a Directed Acyclic Graph (DAG), where each node is a task.
  3. Each task is handled by a specialized Worker Agent, such as:
  • API Agent (for API calls)
  • Communication Agent (emails/messages)
  • Scheduler Agent (timed execution)
  • Data Agent (information processing)

In our model, we have used:

  • FastAPI for backend API handling
  • Redis for fast state tracking
  • MongoDB for persistent storage
  • Playwright for automation
  • Secure tokens (JWT/AES) for execution links
  • Task orchestration engine for DAG execution
  • Logging system to track real-time execution

When a user clicks the generated link, the system validates the token, executes tasks in the correct order, tracks progress in real-time and returns the final result.

We treat user intent as a compilable unit, similar to how code is compiled into executable programs. Each component was built to simulate a real-world distributed system, ensuring modularity, scalability and clear separation of responsibilities.

Challenges we ran into

1. Converting Intent to Structure - Natural language is ambiguous. Mapping it into a clear execution plan required designing a reliable pipeline that converts said task to a precise workflow.

2. Task Dependency Management - Ensuring tasks run in the correct order without conflicts was tricky but solved using DAG-based execution logic. Designing and managing DAG execution was a major challenge.

3. Making It Actually Work (Not Just a Concept) - Many systems simulate automation, but we focused on real execution, with real APIs, real scheduling, and real outputs.

4. Multi-agent Coordination - Managing communication between multiple agents while maintaining consistency and state was a complex system design challenge.

Accomplishments that we're proud of

  • Built a fully working multi-agent execution system, not just a prototype.
  • Enabled one-click execution of multi-step workflows.
  • Designed a secure token-based execution system.
  • Created a system that mimics real-world distributed computing architectures.

What we learned

  • How to design multi-agent systems.
  • How to convert abstract ideas into working systems.
  • Real-world challenges in automation pipelines.
  • Structuring workflows using graph-based models.

What's next for OneTapAI

We plan to take OneTapAI even further:

  • Integrate more real-world APIs (payments, bookings, managements).
  • Build a dashboard that shows live execution tracking.
  • Introduce a learning layer to adapt to user preferences.
  • Enable reusable execution links.

Our long-term vision is to move from interactive AI system to fully autonomous systems.

OneTapAI — From idea to execution, one tap is all it takes!

Code for OneTapAI - https://github.com/dhruv-developer/GLM

Built With

+ 19 more
Share this project:

Updates