🖼️ Image Agent — Conversational AI for Image Generation & Editing

Image Agent is a conversational AI that enables users to generate and edit multiple images using natural language prompts, powered by AWS Bedrock and the Strands Agents SDK.

Image Agent demonstrates how reasoning-based agents can autonomously understand user intent, plan creative tasks, and use generative models for both image generation and editing.

🚀 Key Features

  • Conversational Image Generation — Users simply describe what they want, and the agent creates high-quality visuals.
  • Conversational Image Editing — Batch-edit multiple images with natural instructions.
  • Autonomous Reasoning — Powered by OpenAI GPT-OSS-120B hosted on Amazon Bedrock, the agent plans, decides, and refines actions in a reasoning loop.

🧠 How It Works

  1. User Interaction — The user chats with the Image Agent through a conversational interface.
  2. Reasoning Loop (Bedrock + Strands) — The agent uses OpenAI GPT-OSS-120B as its reasoning LLM to interpret user intent, plan tool calls, and validate results.
  3. Image Operations
    • Generation
    • Editing

🏗️ Tech Stack

  • Reasoning LLM: OpenAI GPT-OSS-120B via Amazon Bedrock
  • Agent Framework: Strands Agents SDK
  • Image Model: Image Model for both generation and editing
  • Language: Python

💡 Impact

Image Agent enables anyone — from designers to developers — to create and modify visuals conversationally, bridging the gap between language and image creation. It’s a practical showcase of how reasoning LLMs and generative tools can collaborate to streamline creative workflows.

Built With

Share this project:

Updates