A screening methodology for life-science papers. It reverse-engineers how real cases
were caught — the exact panels compared, the transform applied, the statistic recomputed —
and turns that into a reproducible per-paper checklist. It is a **detective's lens, not a
verdict machine**: every output stays at "observed anomaly" or "question for the authors,"
because red flag ≠ proof and an accusation can end a career.
Trigger when:
.xlsx, or a DOI and asks if it's trustworthy.Do NOT trigger when:
Run this as a chain-of-steps. Cheapest, fastest signals first; the expensive image/stat
forensics last (they tell you where to dig is often answered for free by the cheap checks).
Step 1 — Scope & status. Identify the input: single figure, full paper, supplementary
dataset, or a batch. Run the status cascade in parallel (it's free and may hand you the
answer): Retraction Watch Database → PubMed retraction banner → Crossref/Crossmark notice →
PubPeer (search DOI/author) → ORI case index (only if adjudicated US PHS misconduct is the
question). Note what already exists; your job may shift to verifying/extending a prior flag.
Step 2 — Ordered screen. Walk the pipeline, recording each hit; do not stop at the first:
.xlsx → calcChain.For stats-heavy/clinical papers, swap 3 and 4. For a batch question, run M5 (recurrence across papers is the signal).
Step 3 — Match a model & classify. For each hit, Read references/sop_models.md, match the
operation model (M1–M7), and name the sub-type + Bik category. Confirm image matches by
performing the transform yourself (flip/rotate/overlay) and including the result; confirm any
tool flag by human inspection — a large share of automated image hits are benign reuse, so treat
none as a finding until you have reproduced it by hand.
Step 4 — Benign-explanation gate (mandatory before any escalation). Run the benign-explanation
checklist in M6. Record which innocent causes were excluded and why (disclosed splice, JPEG
block, same-experiment loading-control reuse, tiling overlap, figure-assembly slip). No
"looks suspicious → flag." Apply the honest-error discriminators from M1 (directionality,
recurrence, sophistication, provenance, disclosure).
Step 5 — Grade & document. Default every finding to Tier 1 (observed anomaly). Escalate
to Tier 2 (question for authors) only after Step 4, using the disclosed-evidence + hedge +
named-alternative formula. Never originate Tier 3 (adjudicated misconduct) — cite the body
that ruled. Write each finding in the reproducible annotation format (M7) and pick an Output Mode.
| # | Model | Core proposition | Main source |
|---|---|---|---|
| --- | ------- | ------------------ | ------------- |
| M1 | FFP Taxonomy & Honest-Error Discriminators | Classify the anomaly (fabrication/falsification + sub-types); separate honest error from misconduct via 5 tests; only ever assert the "significant departure," never intent. | ORI/42 CFR 93; Bik mBio 2016 |
| M2 | Image Forensics | Every band/field is a fingerprint; catch by eye, confirm by flip/rotate/overlay-Difference; correlated background texture (not band shape) is decisive; Bik Type I/II/III drives escalation. | Bik; ASM/ImageTwin pilot; Proofig |
| M3 | Statistical Forensics | Consistency tests (GRIM/GRIMMER/statcheck) prove impossibility from the text alone; distributional tests (digit/uniformity/duplication) raise flags; .xlsx calcChain exposes moved rows. | Data Colada [98],[109]; Brown & Heathers; Nuijten |
| M4 | Exposure-Site Method Mining + Verification Routing | Treat PubPeer/blog threads as worked detection recipes to replay; map each red flag to the platform that confirms/contextualizes it. | PubPeer; Data Colada; For Better Science |
| M5 | Paper-Mill & Systemic Signals | The fingerprint is recurrence across a batch: tortured phrases, wrong gene reagents (Seek & Blastn), templated "too-clean" figures, sold-authorship network shape. | Cabanac/Labbé; Byrne; Bik Tadpole mill |
| M6 | Graded-Evidence & Red-Line Discipline | Three-tier language with a banned-word filter; mandatory benign-explanation gate; the Data Colada disclosed-facts+hedge+alternative formula is both the ethics and the legal safe harbor. | COPE; Gino v. Data Colada; Sarkar v. Doe |
| M7 | Reproducible Screening Workflow & Annotation | Cheapest-signal-first ordering; a finding is real only if a stranger with the PDF can repeat your exact check; 7-field annotation (locator+comparison+transform+result+category+exclusions+neutral wording). | Bik; PubPeer FAQ; STM Integrity Hub |
Full cards (inputs, action steps, evidence, failure modes, boundaries, confidence) live in
references/sop_models.md. Read the matching card before acting; do not paste the card back to the user.
| Mode | Trigger | Output structure |
|---|---|---|
| ------ | --------- | ------------------ |
| Figure check | One figure/blot/panel shared | Per-panel: observation → transform performed + result → Bik category → benign causes excluded → tier + neutral wording |
| Full-paper screen | A paper/DOI to screen | Status-cascade result, then ordered-pipeline findings by layer, a triage summary, and an overall "monitor / clarify / already-flagged" disposition |
| Stats recompute | Means/SDs/p-values or .xlsx | Per-stat: test (GRIM/GRIMMER/statcheck/SPRITE/calcChain) → input → verdict (impossible/consistent/implausible) → cannot-prove line |
| Paper-mill / batch | "Is this a mill?" / multiple papers | Per-layer firing (text/reagent/image/network) + recurrence/batch evidence + advisory composite, human-review gate |
| Verification routing | "Where do I check this?" | The red-flag → platform routing table: which site, how to query, what it confirms |
| Annotation draft | "Write a PubPeer-grade comment" | The 7-field reproducible annotation, neutral and hedged, with the transform result attached |
| File | What | When to read |
|---|---|---|
| ------ | ------ | -------------- |
references/sop_models.md | Full M1–M7 operation cards: inputs, action steps, evidence, failure modes, boundaries, confidence | Step 3 — read the matching card before acting |
references/research_notes.md | Human-readable evidence summary + the red-flag→platform routing table + tortured-phrase / banned-word seed lists | When you need the routing table or a source citation |
references/R01..R07-*.md | Primary research dossiers with real cases and URLs (audit trail) | When you need to trace a claim to its source case |
examples/demo_screening.md | Worked screening transcripts (figure check, stats recompute, boundary refusal) | To see the expected output shape |
共 1 个版本