posted an update

The Model Died. The Protocol Didn't.

On March 9, Google retired Gemini 3 Pro — the model half this hackathon was built on.

Projects that hardcoded their architecture around a single model endpoint are now facing a choice: migrate under pressure, or hope that their demo video is enough.

RE's demo is still running. Not because we patched anything. Because we didn't have to.

When we submitted RE in February, we made a deliberate architectural choice: build on Gemini 3 Flash and Gemma 3 27B in a hybrid configuration, with a runtime model toggle in the UI. Not because we predicted Google would retire Pro — but because a governance protocol that depends on a single model isn't governance. It's a feature request with a countdown timer.

That choice was not incidental. It was the thesis.


But this update isn't about survival. It's about what comes next — without changing a line of code.

Gemini 3.1 Pro launched this week. It scores more than double the reasoning performance of 3 Pro on ARC-AGI-2. That's not an incremental update. It's a generation-level improvement in the model's ability to handle novel logic patterns.

Here's what that means for each of the five Gemini-specific capabilities RE was built on:

1. 1M Token Context Window + Stronger Reasoning

RE injects the full append-only evidence chain into every inference call. No truncation — because in a compliance context, partial records are inadmissible records. With 3.1 Pro's doubled reasoning capability reading that same chain, the model doesn't just retrieve what happened. It understands the trajectory — why decisions shifted, where patterns emerge, what the sequence means.

This is what we described in Update #6 as the inverse of Monte Carlo tree search: instead of expanding a thousand paths and discarding 999, RE preserves every path. A stronger model reading the same preserved paths produces deeper comprehension. The record didn't change. The reader did.

2. Thinking Levels (thinking_budget)

RE's policy engine routes reasoning depth by risk level. Low-risk actions get minimal thinking. High-risk triggers — transactions exceeding thresholds, authority changes, injection attempts — get deep reasoning.

With 3.1 Pro, every thinking level is more capable. The same budget allocation produces higher-quality risk assessment. The policy engine doesn't need to change. The model behind it just got sharper.

3. Thought Signatures

RE captures the model's internal reasoning process from the API and writes it into the cryptographic audit log. These aren't summaries. They're the model's actual thought trace — verifiable evidence of how a decision was reached.

With 3.1 Pro's enhanced reasoning, those thought traces become richer. More reasoning steps, more explicit logic chains, more auditable evidence per decision. The evidence chain's resolution increases without any change to the capture mechanism.

4. Flash + Gemma Same-Family Switching

RE's hybrid architecture — Flash for cloud, Gemma 27B for edge — runs identical governance logic across both models with one SDK and one system prompt format. 3.1 Pro doesn't break this. It extends it. The governance layer is model-agnostic by construction. Adding 3.1 Pro to the model selector is a configuration change, not an architecture change.

5. Context Caching

RE's evidence chain grows with every session. Without caching, cost and latency scale linearly with chain length. Context caching makes the growing chain economically sustainable. With 3.1 Pro's improved reasoning operating on cached context, RE gets better inference quality at the same cost structure.


None of this requires changing the submitted code. None of this requires changing the architecture. None of this requires asking for permission.

This is what model-agnostic design means in practice: the model upgrades, the governance doesn't need to. The evidence chain is the constant. The model is the variable.

We said in Update #1: RE doesn't make AI smarter. RE makes AI answerable.

Gemini 3.1 Pro makes the AI smarter. RE still makes it answerable. The two are complementary — and they always were.

— Che, Solo developer, Project RE, Taipei Taiwan

Log in or sign up for Devpost to join the conversation.