Tracks
Art/Multimedia, AI/ML, General.
🔥 Inspiration
We were inspired by how overwhelming it can be to keep up with online content. Long articles, research papers, and blogs are everywhere—but who has the time to read them all? We wanted to create something that could summarize web pages visually, making key ideas easier to absorb and share.
🚀 What it does
Page2Pixel is a Chrome extension that turns any webpage into an AI-generated image summary using GPT-4o and DALL·E 3. It extracts content, summarizes it, and creates a visual that blends artistic design with informational clarity, all within a sidepanel in your browser.
🛠️ How we built it
- Frontend: React + TypeScript using WXT (Web Extension Tools), styled with Tailwind CSS
- Backend: FastAPI with Python and async/await support
- AI Stack: GPT-4o for summaries + image prompts, DALL·E 3 for visual generation
- Browser Integration: Chrome Extension API for tab/sidepanel control
🧱 Challenges we ran into
- Tuning GPT-4o prompts for clarity and creativity
- Ensuring the generated image matches the topic accurately
- Real-time communication between background, content, and UI scripts
- Avoiding over-summarization that loses key context
🏆 Accomplishments that we're proud of
- Seamless pipeline: webpage → summary → image
- Clean, responsive sidepanel UI with intuitive design
- Balanced visuals and info with no manual tweaking
- Built something useful, fast, and fun to use!
📚 What we learned
- Prompt engineering for text-to-image workflows
- Advanced Chrome Extension development with WXT
- Async FastAPI routing for multi-stage AI processing
- UI/UX patterns for embedded tools like sidepanels
🔮 What's next for Page2Pixel
- Enable image download and sharing
- Add text-to-speech summaries
- Let users customize image styles or summary tone
- Launch on the Chrome Web Store
Built With
- chrome
- extention
- fastapi
- imagegen
- openai
- python
- react
- typescript
- wxt

Log in or sign up for Devpost to join the conversation.