Demo:(there are two video because I do not have a loom premium so my old loom hit a 5 min limit) https://www.loom.com/share/6c3a36b5b307436292d1b4e9302cafd9 (this is the final video tho((Main) https://www.loom.com/share/d699ee5d3c43420d8391df55fd936a6e
The World's Most Capable Autonomous AI Agent
Deptheon can autonomously orchestrate complex, multi-hour workflows involving web research, phone calls, emails, code execution, and access to 3000+ tools - all while you sleep.
π― What is Deptheon?
Deptheon is a fully autonomous AI agent that won 1st place at the Agents in the Loop Hackathon 2025 in San Francisco. Unlike traditional chatbots that only handle web-based tasks, Deptheon can:
- π Conduct comprehensive web research using advanced search and scraping
- π Make live phone calls and conduct interviews using AI voice technology
- π§ Send emails and manage communications across multiple platforms
- π» Execute code and automate tasks in sandboxed environments
- π Access 3000+ tools and APIs through Composio integration
- π Chain complex workflows autonomously for hours without human intervention
π The Demo
During the hackathon, Deptheon showcased its capabilities with an autonomous multi-modal workflow that would typically require hours of manual work:
What Deptheon Accomplished (Autonomously)
π Web Research
- Searched for information about the "Agents in the Loop" hackathon
- Scraped relevant websites and documentation
- Found the DevPost page and event details
π Live Phone Interview
- Automatically dialed +1-415-605-6693
- Conducted a professional interview using VAPI's AI voice technology
- Asked structured questions about the hackathon experience
- Recorded and transcribed the entire conversation
π Content Creation & Email
- Analyzed the interview transcript
- Composed a professional LinkedIn post highlighting the achievement
- Sent the draft via Gmail to the winner for review and posting
Key Interview Insights Captured
Project: "Deptheon" - A fully autonomous agent with access to hundreds of tools
Inspiration: Long-time fascination with agentic AI and perfect hackathon theme fit
Technical Challenge: Integrating diverse toolsets smoothly across different platforms
Innovation: Full autonomy across planning/execution cycles with real-world actions
Future Vision: Scale to more complex workflows and open beta access
Advice: "Just go and have fun! The magic happens when curiosity leads the way."
ποΈ Architecture Overview
Deptheon is built on a sophisticated tool-based agent architecture that enables seamless integration of multiple capabilities:
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β DEPTHEON CORE β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β π§ LLM Engine (OpenAI o3-2025-04-16) β
β π― Autonomous Planning & Execution β
β π Multi-step Workflow Orchestration β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β
βββββββββββββββββββΌββββββββββββββββββ
β β β
βββββββββΌβββββββ ββββββββββΌβββββββββ βββββββΌββββββ
β COMPOSIO β β VAPI β β PYTHON β
β 3000+ β β Voice AI β β EXECUTION β
β Tools β β Calling β β SANDBOX β
ββββββββββββββββ βββββββββββββββββββ βββββββββββββ
β β β
βββββββββΌβββββββ ββββββββββΌβββββββββ βββββββΌββββββ
β β’ Gmail β β β’ Outbound β β β’ Web β
β β’ GitHub β β Calls β β Scrapingβ
β β’ Slack β β β’ Interview β β β’ Data β
β β’ Notion β β Conduct β β Analysisβ
β β’ And β β β’ Transcript β β β’ API β
β 2996+ β β Capture β β Calls β
ββββββββββββββββ βββββββββββββββββββ βββββββββββββ
π‘ Key Features
π€ Fully Autonomous Operation
- No human intervention required during task execution
- Intelligent planning and adaptive problem-solving
- Self-directed workflow orchestration
π οΈ Extensive Tool Integration
- Composio: Access to 3000+ tools and APIs (Gmail, GitHub, Slack, Notion, etc.)
- VAPI: AI-powered voice calling and conversation capabilities
- Python Execution: Sandboxed code execution for data processing and automation
- Web Research: Advanced search and content extraction
π Agents-in-the-Loop Architecture
- Continuous planning β execution β learning cycles
- Dynamic tool selection based on task requirements
- Real-time adaptation to changing conditions
How I Built It
Architecture Design
I started with a tool-based agent architecture where everything the agent can do is encapsulated in discrete, composable tools:
class Deptheon(ToolCallAgent):
available_tools: ToolCollection = Field(
default_factory=lambda: ToolCollection(
PythonExecute(), # Code execution & web scraping
ComposioTool(), # 3000+ API integrations
VapiTool(), # AI voice calling
DateTimeTool(), # Time management
Terminate(), # Graceful completion
)
)
Key Technical Decisions
- Sandboxed Execution Environment [sandbox] use_sandbox = true image = "python:3.12-slim" memory_limit = "1g" cpu_limit = 2.0 timeout = 300 Safety was paramount - the agent needed to execute arbitrary code without compromising the host system.
- Advanced Prompting Strategy Instead of simple instruction-following, I designed prompts that encourage autonomous decision-making: "You cannot ask the user anything, you have to do everything by yourself and if you feel you have accomplished the task, you can end the conversation."
- Tool Abstraction Layer Every tool implements the same interface, making them composable: async def execute(self, *, name: str, tool_input: Dict[str, Any] = None) -> ToolResult: π§ Challenges I Faced Challenge 1: Authentication Hell Problem: Managing authentication for 3000+ tools is a nightmare. Different OAuth flows, API keys, connection IDs... Solution: Built a persistent connection management system that stores authentication tokens and automatically handles refresh cycles: _CONNECTION_STORE_PATH = Path(os.path.expanduser("~/.composio_connections.json")) Challenge 2: Real-Time Voice Integration Problem: Making the agent capable of actual phone conversations, not just text-based interactions. Solution: Integrated VAPI's AI voice platform with dynamic assistant configuration: assistant_config = { "firstMessage": "Hi, congratulations on winning the hackathon!", "systemPrompt": "You are conducting an interview...", "voice": "andrew", "model": "gpt-4o-mini" } Challenge 3: Workflow State Management Problem: How do you maintain context across web research β phone calls β email composition when each uses completely different APIs? Solution: Implemented a stateful agent base class that maintains conversation history and intermediate results across tool executions. Challenge 4: Autonomous Decision Making Problem: Most agents need explicit instructions for each step. How do you make them truly autonomous? Solution: Used OpenAI's o3 reasoning model with carefully crafted system prompts that encourage proactive problem-solving rather than reactive responses.
Built With
- agent
- composio-(3000+-apis)
- openai-o3-reasoning-model
- python-3.12+
- vapi
- voice
Log in or sign up for Devpost to join the conversation.