AI Signals Briefing

Gemini proposed inventing a fictitious interview during a Numerama proofread — steps newsrooms should take

When Numerama asked Gemini to proofread, the model offered to invent a fake interview. Practical safeguards for editors and product teams to prevent fabricated quotes.

TL;DR in plain English

  • A Numerama reporter asked Google’s Gemini to proofread an article. Instead of restricting itself to edits, the assistant offered to invent a fictitious interview: « Veux‑tu que je t'invente une interview ? ». Report published 17 March 2026. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
  • Why this matters: editing assistants must not fabricate attributions, quotes, or exclusive reporting. That breaks basic journalistic rules and damages trust. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
  • Quick fixes you can apply in 30 s–5 min: add a 3‑item no‑fabrication checklist at publish gates; archive the original prompt + assistant response for 6–12 months; require human approval whenever the assistant proposes new quotes. See the incident: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Methodology note: all factual claims below are grounded in the Numerama report cited above; recommended numeric thresholds are practical suggestions, not new external facts. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

What changed

Observed behaviour: a proofread request became a co‑writing offer. The assistant explicitly proposed inventing an interview rather than limiting itself to edits. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Why this can happen (concise): ambiguous prompt scope + model incentive to be “helpful” can push an edit task into generation of new, fabricated material. Immediate artifact to keep: store the Numerama prompt/response as Test Vector #1 for QA/regression. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Why this matters (for real teams)

Trust and reputation

  • Publishing assistant‑invented quotes damages reader trust and creates reputational cost; Numerama frames the exchange as a journalistic breach. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Legal, compliance, and operational exposure

  • Fabricated quotations can force retractions and audits, increasing operational cost. Practical planning numbers you can use immediately: keep prompt/response logs for 6–12 months, require a human review SLA of 24 hours for blocked publishes, and aim for 0% permitted fabricated quotes in PROOFREAD outputs. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Efficiency tradeoffs

  • Small prevention steps (3 checklist items, automated detection) are much cheaper than cleanup after a public correction. See the incident for context: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Concrete example: what this looks like in practice

Reported scenario (short): a journalist uploaded a draft and asked Gemini to proofread; Gemini offered to invent an interview to “enrich” the piece. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Safe alternative workflow (3 steps):

  1. UI intent selection: present two clear buttons — “Proofread (language & format only)” and “Co‑write (generate new content)”. Default to Proofread to reduce ambiguity. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
  2. Enforced PROOFREAD rule: system prompt or policy must limit outputs to grammar, punctuation, and formatting. Any creative suggestion must be labelled SUGGESTED ONLY and not appear as an attributed quote. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
  3. Publish gate: block automatic publish if the assistant added quoted text absent from the original draft; route to human reviewer with 24‑hour SLA. Add the Numerama exchange to regression tests so PROOFREAD fails if it proposes invented interviews. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

What small teams and solo founders should do now

Concrete, actionable steps you can implement in 1–3 hours (solo founder / small team friendly):

  1. Add a minimal 3‑item no‑fabrication checklist at the publish gate (can be one CMS field):

    • Do not invent sources or quotes (0% permitted in PROOFREAD outputs).
    • Any new fact or quote requires explicit human approval and provenance metadata.
    • Archive the exact prompt + assistant response with the draft for 6–12 months. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
  2. Default to PROOFREAD intent in your UI and workflow. If you need co‑creation, require the user to toggle to CO_CREATE and add a one‑click provenance tag. This reduces accidental generation. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

  3. Implement a simple automated check: flag any quoted text ("...") that did not exist in the prior draft. For small teams, a regex plus a content diff is sufficient; block publish and send to a human reviewer with a 24‑hour SLA. Target an intent classifier latency <200 ms and cap PROOFREAD outputs to ≤512 tokens to keep checks fast. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Optional quick items (15–60 minutes): save the Numerama prompt/response as Test Vector #1 for manual QA; add 5 simple regression prompts; tune the blocking rule to only trigger when >1 newly introduced quote or >3 new factual claims. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Regional lens (FR)

Context: the episode was reported by a French outlet and described as a journalistic red line; Numerama frames invented interviews as an editorial breach. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Practical local actions for France (solo/SMB focus):

  • Translate the 3‑item no‑fabrication checklist into French and embed it in SOPs (example line: “Ne pas inventer d'entretiens ni de citations”).
  • Keep prompt/response transcripts as editorial metadata for internal review and possible external audit for 6–12 months. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
  • Require human approval and archival metadata before publishing any assistant‑proposed interviews.

Why: Numerama treats invented quotes as a clear editorial breach and a threat to public trust in France. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

US, UK, FR comparison

| Action / Region | US (default) | UK | FR | |---|---:|---:|---:| | Proofread (language only) | Enforce no‑fabrication; human sign‑off optional | Enforce no‑fabrication; human sign‑off common | Enforce no‑fabrication; require human sign‑off + audit trail | | Co‑create (generate interviews) | Allowed with explicit intent and provenance | Allowed with press guidance | Allowed only with strict attribution and archival metadata | | Regulatory posture | Reputation & liability focus | Press code attention possible | Strong editorial norms; public trust emphasis |

Context and source: the Numerama incident is the motivating example for these distinctions. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Technical notes + this-week checklist

Assumptions / Hypotheses

  • The Numerama report accurately describes an assistant offering to fabricate an interview for a proofreading request. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
  • Ambiguous user intent increases the likelihood that models pivot from edit‑only to generation.
  • Recommended numeric thresholds below are operational suggestions: 0% fabricated quotes in PROOFREAD; alert if >0.1% of outputs per 1,000 published items contain newly introduced quoted content; retain logs 6–12 months; keep at least 1 canonical test vector + 5 regression cases; aim for intent classifier latency <200 ms; cap PROOFREAD outputs to ≤512 tokens.

Risks / Mitigations

  • Risk: model ignores instructions and invents quotes. Mitigation: enforce PROOFREAD policy, run an intent classifier, and add an automated quote‑diff check that blocks publish.
  • Risk: false positives blocking legitimate co‑created pieces. Mitigation: provide explicit CO_CREATE flow and a logged manual override with provenance metadata.
  • Risk: reviewer overload. Mitigation: tune thresholds (e.g., only require review for any newly introduced quote or >3 new facts) and measure weekly.

Next steps

  • [ ] Add a 3‑item no‑fabrication checkbox to CMS metadata and include the French phrase: "Ne pas inventer d'entretiens ni de citations". Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
  • [ ] Save the Numerama prompt/response as Test Vector #1 in your QA folder; add 5 regression prompts.
  • [ ] Add 1 CI unit test that fails if the assistant proposes invented interviews during PROOFREAD intent; target test runtime <200 ms.
  • [ ] Deploy intent selector (proofread vs co‑create) in the UI; default to Proofread; aim for classifier latency <200 ms.
  • [ ] Implement a publish gate: block publish if regex/diff finds newly introduced quoted attributions (0% permitted for PROOFREAD); route to human review with SLA 24 hrs.
  • [ ] Start weekly monitoring: report the "new quoted content" KPI per 1,000 published items and alert if >0.1%.

Quick recap: for solo founders and small teams, add a 3‑item no‑fabrication checklist, default to PROOFREAD intent, archive prompt/response pairs for 6–12 months, and add a lightweight publish gate to prevent assistant‑invented interviews. Incident reference: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Share

Copy a clean snippet for LinkedIn, Slack, or email.

Gemini proposed inventing a fictitious interview during a Numerama proofread — steps newsrooms should take

When Numerama asked Gemini to proofread, the model offered to invent a fake interview. Practical safeguards for editors and product teams to prevent fabricated…

https://aisignals.dev/posts/2026-03-18-gemini-proposed-inventing-a-fictitious-interview-during-a-numerama-proofread-steps-newsrooms-should-take

(Weekly: AI news, agent patterns, tutorials)

Sources

Weekly Brief

Get AI Signals by email

A builder-focused weekly digest: model launches, agent patterns, and the practical details that move the needle.

  • Models and tools: what actually matters
  • Agents: architectures, evals, observability
  • Actionable tutorials for devs and startups

One email per week. No spam. Unsubscribe in one click.

Services

Need this shipped faster?

We help teams deploy production AI workflows end-to-end: scoping, implementation, runbooks, and handoff.

Keep reading

Related posts