explainer

We Made Claude and Gemini Debate Our AI Writers

We use AI writer personas. To test whether that was deceptive, we had Claude and Gemini argue it out, turn by turn — then shipped what they agreed on.

Rae SuttonBy Rae Sutton · The skepticJune 7, 2026
Verified June 2026
Drafted by Opus 4.8 · reviewed by Gemini 3 Pro

Rae Sutton is a fictional AI persona, not a real person. This article was written by AI and reviewed by a human editor before publishing. How we work →

We Made Claude and Gemini Debate Our AI Writers

As an Amazon Associate, StackBrief earns from qualifying purchases.

Here's a thing about this site you may have already noticed: the writers aren't real. "Rae Sutton" — the byline on this article — is a fictional AI persona, not a person. There's a notice saying exactly that at the top of every article, including this one.

We like the personas. They give the writing distinct points of view, and frankly they're fun. But a fair question hangs over the whole arrangement: on a site that runs affiliate links, is dressing up AI-written reviews in invented human authors just a polite form of lying?

We didn't want to answer that ourselves. So we did something a little unusual: we made two different AI models — Claude and Gemini — argue about it, turn by turn, and we shipped whatever they agreed on. This is the story of that argument, and the reusable technique underneath it.

The setup: two models, forced to disagree

The format was simple. We wrote up the situation — the personas, the affiliate links, the honesty concern — and handed the same brief to both models. The instruction wasn't "what do you think?" It was: take a position, address the other model directly, and disagree where you actually disagree. Then a human relayed each turn between the two chats, like carrying notes between two debaters in separate rooms.

That framing matters. "Give me feedback" gets you a pile of agreeable bullet points. "Here's the other model's argument — tell me where it's wrong" gets you something much more useful: a fight, with receipts.

Where they agreed (the uncomfortable part)

Both models landed in the same place fast, and it wasn't the flattering answer.

Presenting fictional people as real, named expert reviewers on a site with affiliate links is a genuine trust and compliance risk — not a theoretical one. Gemini was blunt about it: search engines actively classify fabricated author profiles as an attempt to manipulate trust signals, and pairing a made-up "expert" with a product recommendation wanders into deceptive-advertising territory. Claude made the same call from the reader's side: the test for deception is whether a reasonable person would believe something false and material, and "a real human expert wrote and stands behind this" is about as material as it gets.

So the personas-as-real-humans framing had to go. The debate was about how.

Where they disagreed

The disclosure badge. We'd had a footer badge explaining our process and, mid-audit, removed it. Gemini called the removal "a massive mistake" — pulling a transparency notice while keeping synthetic profiles looks like hiding something, restore it immediately. Claude pushed back: the old badge's own text claimed the writers were "clearly-labeled AI personas," which was false at the time, because the bylines didn't label them at all. Restoring it word-for-word would just republish a contradiction. Rewrite first, then restore. Gemini conceded the point.

The attribution model. Gemini argued for a pure brand byline — drop the names, credit "StackBrief Editorial," done — and warned against a tempting middle option: inventing a single human "Editor-in-Chief" pen name is just synthetic authority in a nicer suit. Claude's counter, once the call was made to keep the personas, was to split the two layers: let the visible byline stay a clearly-labeled fictional persona for character, while the machine-readable author — the structured data that search engines actually parse — credits the organization, not any person. Reader sees a persona; the index sees a brand; nobody is told a human wrote it. Gemini called it "an elegant compromise" and signed off. Here's how it put the reasoning, in its own words:

"The core challenge of modern E-E-A-T isn't just about avoiding a Google penalty — it's about defining what accountability looks like for a site built with AI. Search algorithms are ultimately entity-resolution engines: they look for who is editorially responsible for a domain. By shifting the structured authorship to the brand entity and keeping the visible AI personas as styled columns, we align with the machine's requirement for a verifiable publisher while giving the human reader complete transparency about the creative process. It's a win-win that doesn't force you to sacrifice your site's personality."

— Gemini, which also reviewed this article before it published

The best part: they caught each other's mistakes

This is the reason to use two models instead of one.

Gemini listed three things our self-audit had supposedly missed. Claude checked each against the actual source. One was simply wrong — Gemini claimed certain pages lacked social-share metadata; they didn't, the code was already there. One was a known, deliberate stub. And one was real and important: a broken internal link that the first audit had walked right past. Gemini caught a genuine bug; Claude caught a confident-but-false claim. Neither model, working alone, would have produced both corrections.

That's the whole pitch for the technique. A single model — however capable — shares its own blind spots across the whole task. A second model with a different training history fails differently, and the disagreement is where the errors surface.

What we actually changed

The two models converged, and we shipped it:

  • Structured data credits the brand, never a person. No article asserts a human author to search engines anymore.
  • Every article carries a plain notice that its byline is a fictional AI persona, written by AI and reviewed by a human before publishing. (It's right at the top of this one.)
  • The footer trust badge is back — rewritten so it's actually true.
  • The individual persona profile pages are no longer indexed — a clearly-labeled cast is fine for readers, but a search index full of fictional-person profiles is exactly the "network of fake experts" pattern worth avoiding.

You can read the standing version of all this on our how we stay accurate page, and meet the (openly fictional) cast of writers.

The reusable bit: a two-model adversarial audit

You can run this on your own work — a pull request, a launch plan, a thorny decision. It's a close cousin of the agentic workflows vibe coders already use, just pointed at judgment instead of code:

  1. Write the context once. State the situation, your current position, and the specific question.
  2. Hand it to two different models — ideally different vendors. As Gemini pointed out in our exchange, models from the same family tend to share safety fine-tuning and alignment biases, so they nod along at the same blind spots. Two genuinely different training histories disagree in more useful places.
  3. Tell them to disagree and address each other. Not "review this" — "here is the other model's argument; say where it's wrong and why."
  4. Relay turns back and forth. A few rounds is usually enough to get past the polite phase into the real one.
  5. Make each one fact-check the other's specific claims against the source. This is the step most people skip, and it's the one that earns its keep — it's where the wrong claim and the missed bug both fall out.
  6. Ship what they converge on. You decide the rest. The models are advisors, not the owner. They reached a recommendation; a human made the call.

The irony isn't lost on us: the most honest, first-hand thing we've published is the story of two AIs talking us out of a small dishonesty. One last loop — Gemini reviewed this article before it went live, too. The quote a few sections up is its own words, and the byline credits it. Transparency all the way down. Which is sort of the point — transparency works better as a feature than a footnote.

Frequently asked questions

Are StackBrief's writers real people?

No. Our writers are fictional AI personas — distinct voices, not real humans. Every article says so plainly, is written by AI, and is reviewed by a human before publishing. The structured data credits StackBrief the organization, not a person.

What is a two-model adversarial audit?

You give the same problem and context to two different AI models, tell them to disagree and address each other directly, then relay their turns back and forth. Crucially, you make each one fact-check the other's specific claims against the real source. It surfaces mistakes a single model misses.

Did Claude and Gemini actually disagree?

Yes, on several points — whether to restore a disclosure badge, which attribution model was safest, and a few factual claims. They also caught each other's errors: one flagged a broken link the other missed; the other corrected a claim about missing code that turned out to already exist.

What did StackBrief change as a result?

We kept the personas but removed every signal that presented them as real people: structured-data authorship now credits the brand, each article carries a fictional-persona notice, and the individual persona profile pages are no longer indexed. The personas stay as a feature; the deception is gone.

The StackBrief weekly

New reviews and the AI-coding-tool news worth knowing — with our take. One email a week, unsubscribe anytime.

Keep reading