YES Ai: Smart Assistant with Real-Time Tools & Deep Research

Our Main Logo
Deep Research Mode
User Input Page
Create account page
Secure OTP Via Email
Login Page
architecture diagram

💡 Inspiration: A Unified AI Assistant

Our goal was to move beyond single-purpose apps and create a unified, secure, multi-functional AI agent that truly acts as a personal assistant for the modern web. YES Ai was inspired by the need to consolidate:

Real-time information access (Search & Weather)
Complex problem-solving (Math & Research)
Secure user management (OTP Login)

All seamlessly orchestrated by Google gemini-2.0-flash to demonstrate its full capabilities for the AI Accelerate: Unlocking New Frontiers. Our architecture is immediately being retrofitted for Elasticsearch integration to serve as the primary knowledge retrieval system, aligning directly with the Elastic Partner Challenge requirements.

✨ Core Features: What YES Ai Delivers

Feature	Technical Insight
🔍 Deep Research Mode	Fetches and synthesizes multi-source information using external search APIs.
🌐 Multilingual Power	Supports fluid conversation in English, Bengali, and Hindi, showcasing the model's language resilience.
☁️ Real-Time Tooling	Simultaneously calls external APIs for Weather updates and internal tools for Math from a single user query.
🔒 Secure Authentication	Implements a custom, production-ready login system with Email OTP verification and secure bcrypt password hashing.

🚧 Challenges, Learnings, & Future Vision

🧠 Key Learnings & Technical Execution

Tool Orchestration Mastery: Learned to structure system prompts to ensure Gemini 2.0 Flash reliably and accurately selects multiple tools (Math, Weather, Search) based on a single, complex user input.
Security Implementation: Hands-on experience implementing robust authentication using bcrypt and OTP—a critical learning for deploying scalable web applications.
Deployment Resilience: Learned secure deployment best practices on Streamlit Cloud, utilizing secrets.toml to hide API keys from the public repository.

🚀 Future Scope: The Hybrid V2 Vision

Our biggest challenge was building the current architecture modularly enough for a future upgrade. This vision showcases our understanding of sustainable, production-grade AI:

Area	Planned Upgrade	Technical Impact
Model Architecture	Hybrid Model Integration (Gemini + Llama 3)	Optimized cost-effectiveness and specialized custom response tones.
Accessibility	🔊 Voice Commands & TTS	Implementing Speech-to-Text for input and Text-to-Speech for output.
Generative Tools	🖼️ Photo Generation Integration	Adding a feature to generate photorealistic images directly within the chat.

Current efforts are focused on completing the transition from third-party APIs to Elastic as the central search engine, a critical step for enhancing the agent's scalability and contextual understanding. This commitment to a Hybrid V2 vision demonstrates an understanding of production-grade, sustainable AI development, which is critical for unlocking new capabilities on the web.

🔗 Built With

AI Core: Google gemini-2.0-flash
Frontend/Backend: Python / Streamlit
Authentication: Custom Login System (bcrypt, Email OTP)
Deployment: Streamlit Cloud

Built With

bcrypt
google-gemini-api
google-search-api
openweathermap
openweathermap-api
python
sqlite
streamlit
streamlit-cloud
tool-calling

Updates

Ranajit Dhar started this project — Oct 13, 2025 03:06 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.