NEURA-AI Cognitive Assistant

Inspiration

Most AI assistants today can answer questions but cannot remember long-term context or perform real digital tasks. Users still rely on multiple apps for emails, scheduling, browsing, and productivity. This becomes even more difficult for visually impaired users who depend on accessible, voice-based systems.

We wanted to build an AI that goes beyond chat — an assistant that can understand, remember, and act. This led to NEURA, a cognitive AI web agent designed to automate digital workflows through voice interaction.

What We Built

NEURA (Neural Enhancement & Reasoning Assistant) is a multimodal AI web agent that can see, speak, remember long-term context, make decisions, and automate digital tasks.

It combines:

Voice interaction for hands-free control
LLM-based reasoning for intelligent responses
Persistent memory using embeddings
Workflow automation via APIs and browser automation

NEURA can handle tasks like email generation, scheduling, information retrieval, coding assistance, and document understanding.

Additionally, NEURA supports adaptive automation based on user behavior. For example, if a user frequently uses platforms like YouTube, Instagram, or Twitter, NEURA can automate repetitive actions and workflows for those platforms.

How We Built It

We designed NEURA using a modular architecture:

Input Layer – Voice, text, and visual inputs
AI Layer – Speech-to-text, NLP, and LLM reasoning
Memory Layer – Embedding-based long-term memory
Decision Engine – Chooses between responding or executing tasks
Automation Layer – Executes workflows using APIs and browser automation

The system was built using Python, with automation handled via n8n and browser automation tools. Memory was managed using vector embeddings, and cloud services were used for scalability.

Challenges We Faced

Integrating multiple components like voice processing, memory retrieval, and automation into a seamless system was a major challenge. Ensuring fast responses while maintaining contextual memory required careful optimization.

Another key challenge was designing a voice-first, accessible system that can effectively support visually impaired users.

What We Learned

We learned how to build a full-stack AI agent that combines reasoning, memory, and automation. This project also helped us understand how multimodal AI can improve accessibility and real-world productivity.

NEURA shows how AI can evolve from simple chatbots into intelligent agents that understand context, adapt to users, and perform real-world tasks.

Built With

gcp
gemini
google
googlecloudprovider
livekit
llm
mcp
mem0
n8n
pineconedb
python
stt
tts

Submitted to

Frostbyte Hackathon

Created by

I worked on integrating the LLM for reasoning and response generation, implemented real-time voice interaction using LiveKit, and developed the memory system for persistent context. I also handled voice input/output to enable hands-free interaction.

R.Mahendra Vikas
I developed and integrated automation workflows using n8n, enabling NEURA to execute tasks and connect with external APIs efficiently.

Karthika V

Updates

R.Mahendra Vikas started this project — Apr 12, 2026 01:54 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.