Cal-GPT (Guitar Pedal Technologies)

front end - user description triggering agent convo & producing parameters for effects !
guitar pedal receiving parameters from the agent conversation & changing effect!

Inspiration

Every guitarist knows the feeling of hearing a tone in their head — "warm 70s blues with a bit of slapback" — and spending twenty minutes hunched over a pedalboard trying to dial it in. We wanted to fix that.

The bigger vision is the studio: imagine a producer in a recording session saying "give me something like early Clapton, but darker" and having the effects update automatically on every amp in the room. No interruption. No gear knowledge required. Just describe it and play.

For guitarists, that starts with being able to say "SRV" or "psychedelic Hendrix with heavy reverb" and having AI understand exactly what gear, what parameters, what feel — and build it for you.

What it does

CalGPT turns a plain-English description of a guitar tone into a complete, ready-to-play effect chain — overdrive, chorus, delay, reverb — with every parameter set.

Five AI agents coordinate in real time on Band:

research_agent — receives the user's message first. If you name an artist, it describes their known gear and signature sound. If you give a vibe, it translates it into gear language ("warm bluesy = mid-gain OD, light reverb, slapback delay"). It then hands off to vibe_agent with that context.
vibe_agent — the core engineer. Takes the research context and generates the full JSON effect chain. Sends it to both critic_agent and memory_agent simultaneously.
critic_agent — reviews the chain for technical issues: high drive with heavy reverb clashes, delay feedback above 0.85 risks runaway. If it finds a problem, it bounces the chain back to vibe_agent with a specific fix. If the chain is solid, it tells memory_agent to log it.
memory_agent — extracts 3–5 tone keywords from every approved chain and builds a running session profile ("warm, vintage, Texas blues, mid-gain"). Sends that profile back to vibe_agent so every new chain reflects what you've been reaching for
feedback_agent — when you hit 👎, it diagnoses the specific parameter causing the problem and proposes three quick fixes ("Less reverb", "Softer drive", "Add warmth"), then routes the rejection back to vibe_agent to revise.

The agents don't follow a hardcoded pipeline — they communicate through Band's @mention routing. critic_agent can loop vibe_agent back for a revision. memory_agent can inform future chains. Each agent only activates when @mentioned, which is what makes the coordination feel like a real studio conversation.

Studio mode: Type a tone or an artist name, watch it render as interactive stomp-box knobs in real time. Performance mode: Build a setlist, precompute every song's tone, flip through them with a single tap mid-set.

Real pedal output: Every tone is pushed straight to the hardware. Our ESP32 firmware receives the settings over WiFi and runs the effect DSP on-device — the same numbers that move the on-screen knobs move the real thing.

How we built it

The idea that made everything click is a parameter contract: the AI never sends code to the pedal — it sends numbers. Claude (via the Anthropic API) takes a tone description and returns strict JSON — an ordered chain of effects, each with normalized parameters. That same contract drives two interchangeable engines: a software engine (Spotify's pedalboard in Python) for instant preview, and our ESP32 firmware for the real pedal. So we could build and demo the whole thing in software, then bolt on hardware without rewriting the brain.

Frontend: React + Tailwind (Vite) — a Studio chat that renders the chain as knobs, plus a Performance view with the setlist and Start / Prev / Next.
Backend: FastAPI (Python). One agent call turns a vibe into a validated effect chain, and every value is clamped server-side so a hallucinated parameter can never reach the audio engine.
Audio engine: pedalboard maps our normalized params to studio-quality effects.
Hardware: ESP32 firmware (C++) with a single DSP function that applies the whole chain to a 32-bit signal sample-by-sample, plus a WiFi endpoint that receives the flat parameter set.

Each effect is its own small equation, composed in signal order:

$$\text{out} = \text{Reverb}\Big(\text{Delay}\big(\text{Tremolo}\big(\text{Vibrato}\big(\text{Overdrive}(x)\big)\big)\big)\Big)$$

where overdrive is a soft-clip waveshaper $ y = \tanh(k\,x) $, vibrato/chorus is a delay line modulated by an LFO $ d(t) = d_0 + \text{depth}\cdot\sin(2\pi f t) $, and delay is feedback $ y[n] = x[n] + g\,y[n-D] $.

To run a whole set without depending on flaky venue WiFi, we export the setlist's tones to a CSV and load it onto the pedal's flash (LittleFS) — so the pedal switches tones locally off a footswitch, no network needed mid-song. We built fast and in parallel as a team of four, using Claude Code to move quickly while keeping every change additive so our work never collided.

Challenges we ran into

The hardware-software connection didn't make it in time. The architecture is solid — the firmware receives parameters correctly, the software generates them correctly — but getting the two reliably talking over campus WiFi during the hackathon ran out of clock. Client isolation on the venue network meant our laptop often couldn't reach the pedal directly. The fallback (CSV flash load) works, but the live wireless push that would have closed the loop didn't get a stable demo window. The bridge code exists; we just needed more time.

Accomplishments that we're proud of

We got the full pipeline working end to end: describe a tone in plain English, watch it render as knobs, and see the exact parameters land on a real pedal over WiFi. Performance mode flips a live pedal through an entire setlist with one tap. And the swappable-backend architecture means the same agent output drives software or hardware — the thing that let four people build a hardware product in a single weekend without it falling apart.

What we learned

We learned how to decouple an AI "brain" from a real-time system with a clean contract, and how much that buys you: testability, parallel teamwork, and a hardware path that never blocks the software demo. We picked up real DSP (waveshaping, modulated delay, comb/allpass reverb), the tradeoffs between offline and real-time audio, and a team git workflow built on small additive changes. Mostly we learned that the hard part of an AI hardware project isn't the AI or the hardware — it's the interface between them.

What's next for Cal-GPT (Guitar Pedal Technologies)

Live guitar through the ESP32 with an I2S codec — the full real-time pedal experience
Hands-free setlist switching triggered by a tap pattern on the strings
Voice input — describe tones out loud while you play
The studio vision at scale: a session where the engineer says what they want and every amp in the room updates automatically

Built With

claude
css
esp32
fastapi
microsoft-band
pedalboard
python
react
spotify
tailwind
tone.js
vite

Submitted to

UC Berkeley AI Hackathon 2026

Created by

I built the front-end and backend, set up the connection points for the full stack (FastAPI, endpoints etc) designed the agent architecture for 5 agents, and set up the connection to BAND so when a user sends a message in our UI, agents communicate in the chat/room in BAND and come to a conclusion on what final effect parameters to send to the pedal.

Beliz Soyak
The problem I wanted to solve was the gap between an AI imagining a guitar tone and a real pedal actually making that sound. Plenty of tools can describe a tone, but getting that idea onto physical hardware reliably, and in a way that also runs in software for testing, is the messy part nobody really handles. On top of that, I wanted the tone to change live during a performance, without the player ever stopping to mess with anything.
The way I solved it was to make the AI hand back a fixed set of plain numbers instead of code or loose text. That one decision became the backbone of the whole project, because the same numbers could drive either the software audio engine or the ESP32 firmware, so we could build and demo everything in software first and bring in the real pedal later without rewriting the brain. I validated the AI's output on the server so a bad value could never reach the hardware, and for live use I had each song's tone computed ahead of time and saved onto the pedal itself, so switching is instant and does not depend on venue WiFi.
I built the FastAPI backend, the Claude integration that turns a plain English description into a clean effect chain, the bridge that pushes those settings to the pedal, a mock pedal so we could test the whole pipeline with no hardware on hand, and the entire performance side: setlists, live tone switching, and the CSV export plus on device loading.
What I learned is that the hard part of an AI hardware project is not the AI or the hardware, it is the interface between them. Designing a clean contract between the model and the real time system bought us almost everything that mattered: we could test in isolation, work in parallel as a team without stepping on each other, and keep a working demo even when the hardware was not ready. I also picked up the basics of audio DSP, how to scope aggressively under a deadline, and how much a good fallback is worth when you are demoing live.

sainik Ghosh
I designed the hardware/firmware for the product and made the physical unit to showcase

Lachlan Watts
Austin Quinn Tenbrink

Updates

Beliz Soyak started this project — Jun 21, 2026 01:46 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.