💡 Inspiration: A Unified AI Assistant

Our goal was to move beyond single-purpose apps and create a unified, secure, multi-functional AI agent that truly acts as a personal assistant for the modern web. YES Ai was inspired by the need to consolidate:

  • Real-time information access (Search & Weather)
  • Complex problem-solving (Math & Research)
  • Secure user management (OTP Login)

All seamlessly orchestrated by Google gemini-2.0-flash to demonstrate its full capabilities for the AI Accelerate: Unlocking New Frontiers. Our architecture is immediately being retrofitted for Elasticsearch integration to serve as the primary knowledge retrieval system, aligning directly with the Elastic Partner Challenge requirements.


✨ Core Features: What YES Ai Delivers

Feature Technical Insight
🔍 Deep Research Mode Fetches and synthesizes multi-source information using external search APIs.
🌐 Multilingual Power Supports fluid conversation in English, Bengali, and Hindi, showcasing the model's language resilience.
☁️ Real-Time Tooling Simultaneously calls external APIs for Weather updates and internal tools for Math from a single user query.
🔒 Secure Authentication Implements a custom, production-ready login system with Email OTP verification and secure bcrypt password hashing.

🚧 Challenges, Learnings, & Future Vision

🧠 Key Learnings & Technical Execution

  1. Tool Orchestration Mastery: Learned to structure system prompts to ensure Gemini 2.0 Flash reliably and accurately selects multiple tools (Math, Weather, Search) based on a single, complex user input.
  2. Security Implementation: Hands-on experience implementing robust authentication using bcrypt and OTP—a critical learning for deploying scalable web applications.
  3. Deployment Resilience: Learned secure deployment best practices on Streamlit Cloud, utilizing secrets.toml to hide API keys from the public repository.

🚀 Future Scope: The Hybrid V2 Vision

Our biggest challenge was building the current architecture modularly enough for a future upgrade. This vision showcases our understanding of sustainable, production-grade AI:

Area Planned Upgrade Technical Impact
Model Architecture Hybrid Model Integration (Gemini + Llama 3) Optimized cost-effectiveness and specialized custom response tones.
Accessibility 🔊 Voice Commands & TTS Implementing Speech-to-Text for input and Text-to-Speech for output.
Generative Tools 🖼️ Photo Generation Integration Adding a feature to generate photorealistic images directly within the chat.

Current efforts are focused on completing the transition from third-party APIs to Elastic as the central search engine, a critical step for enhancing the agent's scalability and contextual understanding. This commitment to a Hybrid V2 vision demonstrates an understanding of production-grade, sustainable AI development, which is critical for unlocking new capabilities on the web.


🔗 Built With

  • AI Core: Google gemini-2.0-flash
  • Frontend/Backend: Python / Streamlit
  • Authentication: Custom Login System (bcrypt, Email OTP)
  • Deployment: Streamlit Cloud

Built With

Share this project:

Updates