🖼️ Image Agent — Conversational AI for Image Generation & Editing

Image Agent is a conversational AI that enables users to generate and edit multiple images using natural language prompts, powered by AWS Bedrock and the Strands Agents SDK.

Image Agent demonstrates how reasoning-based agents can autonomously understand user intent, plan creative tasks, and use generative models for both image generation and editing.

🚀 Key Features

Conversational Image Generation — Users simply describe what they want, and the agent creates high-quality visuals.
Conversational Image Editing — Batch-edit multiple images with natural instructions.
Autonomous Reasoning — Powered by OpenAI GPT-OSS-120B hosted on Amazon Bedrock, the agent plans, decides, and refines actions in a reasoning loop.

🧠 How It Works

User Interaction — The user chats with the Image Agent through a conversational interface.
Reasoning Loop (Bedrock + Strands) — The agent uses OpenAI GPT-OSS-120B as its reasoning LLM to interpret user intent, plan tool calls, and validate results.
Image Operations
- Generation
- Editing

🏗️ Tech Stack

Reasoning LLM: OpenAI GPT-OSS-120B via Amazon Bedrock
Agent Framework: Strands Agents SDK
Image Model: Image Model for both generation and editing
Language: Python

💡 Impact

Image Agent enables anyone — from designers to developers — to create and modify visuals conversationally, bridging the gap between language and image creation. It’s a practical showcase of how reasoning LLMs and generative tools can collaborate to streamline creative workflows.

Built With

amazon-web-services
bedrock
strands

Updates

Viraj Anchan started this project — Oct 10, 2025 03:56 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.