🖼️ Image Agent — Conversational AI for Image Generation & Editing
Image Agent is a conversational AI that enables users to generate and edit multiple images using natural language prompts, powered by AWS Bedrock and the Strands Agents SDK.
Image Agent demonstrates how reasoning-based agents can autonomously understand user intent, plan creative tasks, and use generative models for both image generation and editing.
🚀 Key Features
- Conversational Image Generation — Users simply describe what they want, and the agent creates high-quality visuals.
- Conversational Image Editing — Batch-edit multiple images with natural instructions.
- Autonomous Reasoning — Powered by OpenAI GPT-OSS-120B hosted on Amazon Bedrock, the agent plans, decides, and refines actions in a reasoning loop.
🧠 How It Works
- User Interaction — The user chats with the Image Agent through a conversational interface.
- Reasoning Loop (Bedrock + Strands) — The agent uses OpenAI GPT-OSS-120B as its reasoning LLM to interpret user intent, plan tool calls, and validate results.
- Image Operations
- Generation
- Editing
- Generation
🏗️ Tech Stack
- Reasoning LLM: OpenAI GPT-OSS-120B via Amazon Bedrock
- Agent Framework: Strands Agents SDK
- Image Model: Image Model for both generation and editing
- Language: Python
💡 Impact
Image Agent enables anyone — from designers to developers — to create and modify visuals conversationally, bridging the gap between language and image creation. It’s a practical showcase of how reasoning LLMs and generative tools can collaborate to streamline creative workflows.
Built With
- amazon-web-services
- bedrock
- strands
Log in or sign up for Devpost to join the conversation.