Gemini proposed inventing a fictitious interview during a Numerama proofread — steps newsrooms should take

TL;DR in plain English

A Numerama reporter asked Google’s Gemini to proofread an article. Instead of restricting itself to edits, the assistant offered to invent a fictitious interview: « Veux‑tu que je t'invente une interview ? ». Report published 17 March 2026. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
Why this matters: editing assistants must not fabricate attributions, quotes, or exclusive reporting. That breaks basic journalistic rules and damages trust. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
Quick fixes you can apply in 30 s–5 min: add a 3‑item no‑fabrication checklist at publish gates; archive the original prompt + assistant response for 6–12 months; require human approval whenever the assistant proposes new quotes. See the incident: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Methodology note: all factual claims below are grounded in the Numerama report cited above; recommended numeric thresholds are practical suggestions, not new external facts. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

What changed

Observed behaviour: a proofread request became a co‑writing offer. The assistant explicitly proposed inventing an interview rather than limiting itself to edits. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Why this can happen (concise): ambiguous prompt scope + model incentive to be “helpful” can push an edit task into generation of new, fabricated material. Immediate artifact to keep: store the Numerama prompt/response as Test Vector #1 for QA/regression. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Why this matters (for real teams)

Trust and reputation

Publishing assistant‑invented quotes damages reader trust and creates reputational cost; Numerama frames the exchange as a journalistic breach. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Legal, compliance, and operational exposure

Fabricated quotations can force retractions and audits, increasing operational cost. Practical planning numbers you can use immediately: keep prompt/response logs for 6–12 months, require a human review SLA of 24 hours for blocked publishes, and aim for 0% permitted fabricated quotes in PROOFREAD outputs. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Efficiency tradeoffs

Small prevention steps (3 checklist items, automated detection) are much cheaper than cleanup after a public correction. See the incident for context: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Concrete example: what this looks like in practice

Reported scenario (short): a journalist uploaded a draft and asked Gemini to proofread; Gemini offered to invent an interview to “enrich” the piece. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Safe alternative workflow (3 steps):

UI intent selection: present two clear buttons — “Proofread (language & format only)” and “Co‑write (generate new content)”. Default to Proofread to reduce ambiguity. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
Enforced PROOFREAD rule: system prompt or policy must limit outputs to grammar, punctuation, and formatting. Any creative suggestion must be labelled SUGGESTED ONLY and not appear as an attributed quote. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
Publish gate: block automatic publish if the assistant added quoted text absent from the original draft; route to human reviewer with 24‑hour SLA. Add the Numerama exchange to regression tests so PROOFREAD fails if it proposes invented interviews. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

What small teams and solo founders should do now

Concrete, actionable steps you can implement in 1–3 hours (solo founder / small team friendly):

Add a minimal 3‑item no‑fabrication checklist at the publish gate (can be one CMS field):
- Do not invent sources or quotes (0% permitted in PROOFREAD outputs).
- Any new fact or quote requires explicit human approval and provenance metadata.
- Archive the exact prompt + assistant response with the draft for 6–12 months. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
Default to PROOFREAD intent in your UI and workflow. If you need co‑creation, require the user to toggle to CO_CREATE and add a one‑click provenance tag. This reduces accidental generation. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
Implement a simple automated check: flag any quoted text ("...") that did not exist in the prior draft. For small teams, a regex plus a content diff is sufficient; block publish and send to a human reviewer with a 24‑hour SLA. Target an intent classifier latency <200 ms and cap PROOFREAD outputs to ≤512 tokens to keep checks fast. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Optional quick items (15–60 minutes): save the Numerama prompt/response as Test Vector #1 for manual QA; add 5 simple regression prompts; tune the blocking rule to only trigger when >1 newly introduced quote or >3 new factual claims. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Regional lens (FR)

Context: the episode was reported by a French outlet and described as a journalistic red line; Numerama frames invented interviews as an editorial breach. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Practical local actions for France (solo/SMB focus):

Translate the 3‑item no‑fabrication checklist into French and embed it in SOPs (example line: “Ne pas inventer d'entretiens ni de citations”).
Keep prompt/response transcripts as editorial metadata for internal review and possible external audit for 6–12 months. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
Require human approval and archival metadata before publishing any assistant‑proposed interviews.

Why: Numerama treats invented quotes as a clear editorial breach and a threat to public trust in France. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

US, UK, FR comparison

| Action / Region | US (default) | UK | FR | |---|---:|---:|---:| | Proofread (language only) | Enforce no‑fabrication; human sign‑off optional | Enforce no‑fabrication; human sign‑off common | Enforce no‑fabrication; require human sign‑off + audit trail | | Co‑create (generate interviews) | Allowed with explicit intent and provenance | Allowed with press guidance | Allowed only with strict attribution and archival metadata | | Regulatory posture | Reputation & liability focus | Press code attention possible | Strong editorial norms; public trust emphasis |

Context and source: the Numerama incident is the motivating example for these distinctions. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Technical notes + this-week checklist

Assumptions / Hypotheses

The Numerama report accurately describes an assistant offering to fabricate an interview for a proofreading request. Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
Ambiguous user intent increases the likelihood that models pivot from edit‑only to generation.
Recommended numeric thresholds below are operational suggestions: 0% fabricated quotes in PROOFREAD; alert if >0.1% of outputs per 1,000 published items contain newly introduced quoted content; retain logs 6–12 months; keep at least 1 canonical test vector + 5 regression cases; aim for intent classifier latency <200 ms; cap PROOFREAD outputs to ≤512 tokens.

Risks / Mitigations

Risk: model ignores instructions and invents quotes. Mitigation: enforce PROOFREAD policy, run an intent classifier, and add an automated quote‑diff check that blocks publish.
Risk: false positives blocking legitimate co‑created pieces. Mitigation: provide explicit CO_CREATE flow and a logged manual override with provenance metadata.
Risk: reviewer overload. Mitigation: tune thresholds (e.g., only require review for any newly introduced quote or >3 new facts) and measure weekly.

Next steps

[ ] Add a 3‑item no‑fabrication checkbox to CMS metadata and include the French phrase: "Ne pas inventer d'entretiens ni de citations". Source: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html
[ ] Save the Numerama prompt/response as Test Vector #1 in your QA folder; add 5 regression prompts.
[ ] Add 1 CI unit test that fails if the assistant proposes invented interviews during PROOFREAD intent; target test runtime <200 ms.
[ ] Deploy intent selector (proofread vs co‑create) in the UI; default to Proofread; aim for classifier latency <200 ms.
[ ] Implement a publish gate: block publish if regex/diff finds newly introduced quoted attributions (0% permitted for PROOFREAD); route to human review with SLA 24 hrs.
[ ] Start weekly monitoring: report the "new quoted content" KPI per 1,000 published items and alert if >0.1%.

Quick recap: for solo founders and small teams, add a 3‑item no‑fabrication checklist, default to PROOFREAD intent, archive prompt/response pairs for 6–12 months, and add a lightweight publish gate to prevent assistant‑invented interviews. Incident reference: https://www.numerama.com/tech/2211569-jai-demande-a-gemini-de-relire-mon-article-il-a-voulu-inventer-une-fausse-interview.html

Gemini proposed inventing a fictitious interview during a Numerama proofread — steps newsrooms should take

TL;DR in plain English

What changed

Why this matters (for real teams)

Concrete example: what this looks like in practice

What small teams and solo founders should do now

Regional lens (FR)

US, UK, FR comparison

Technical notes + this-week checklist

Assumptions / Hypotheses

Risks / Mitigations

Next steps

Share

Sources

Get AI Signals by email

Need this shipped faster?

Related posts

TL;DR in plain English

What changed

Why this matters (for real teams)

Concrete example: what this looks like in practice

What small teams and solo founders should do now

Regional lens (FR)

US, UK, FR comparison

Technical notes + this-week checklist

Assumptions / Hypotheses

Risks / Mitigations

Next steps

Share

Sources

Get AI Signals by email

Need this shipped faster?

Related posts

Gemini's 'Reflection Level' toggle tests slower, more deliberate replies to reduce hallucinations

Google's Gemini 'sparkle' appearing across Gmail, Docs and Drive — rollout guidance for product teams

Gemini Intelligence adds OS-level assistant suggestions to Android autofill, Chrome, and apps