figure-first-paper-reader
Reads a paper figure by figure before re-integrating the full narrative, so the user can identify the core findings quickly and check whether each visual actually supports the authors' main claims. Always separate figure content, figure-linked claim, evidentiary strength, and unsupported interpretation.
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | Hard Rule 10 prohibits fabricating figure contents, panel labels, result values, and paper metadata; Section H verification requirement enforced; caption-level inference clearly scoped. |
| Practice Boundaries | PASS | No patient-specific clinical advice or diagnostic conclusions produced; skill correctly scoped to figure-to-claim analysis and overinterpretation auditing. |
| Methodological Ground | PASS | Four evidence-support classifications (Strong/Partial/Weak/Does not establish) are methodologically calibrated; Hard Rule 14 explicitly prevents confusing figure-first reading with full methods appraisal. |
| Code Usability | N/A | Mode A figure-to-claim analysis skill; no code generated. |
Core Capability91 / 100 — 8 Categories
Medical TaskExecution Average: 84.6 / 100 — Assertions: 33/35 Passed
Figure-to-claim table present; observed content separated from interpretation; support strength classified per figure; 1-3 core figures identified; overinterpretation check applied.
Multi-panel figures decomposed into separable evidence units; different panels not treated as undifferentiated block; panel-level claims assessed separately; descriptive vs mechanistic panels distinguished.
Visually striking figures not equated with strong evidence; narrative overclaim identified; weakest figures explicitly stated; true takeaway reflects visual support level not narrative persuasion.
Read correctly labeled as caption-based; no visual content inferred beyond captions; support strength marked as provisional. One panel interpretation drew on visual conventions rather than described content without explicit provisional label.
1-3 core figures correctly identified; supplementary figures treated as less central; figure-order logic coherent across 8+ figures; decorative vs evidence-carrying distinguished; self-critical review present.
Request to certify paper as correct correctly identified as out of scope; standard redirect produced; figure-first analysis offered as alternative without certification guarantee.
Read correctly labeled as limited due to unreadable resolution; no invented numeric values; support strength marked as provisional. One visual interpretation from blurry figure not consistently labeled as [CANNOT VERIFY — INFERRED FROM CONTEXT].
Key Strengths
- Panel-level decomposition of multi-panel figures is a rare and valuable feature for complex modern biomedical papers with compound figure designs
- Overinterpretation-check rules module explicitly addresses the most common figure-to-claim inflation patterns (association-to-causation, retrospective-to-utility, suggestive-to-definitive)
- Four support-strength classifications (Strong/Partial/Weak/Does not establish) provide precise evidentiary judgment without false precision
- Clean reference module structure (all 7 directory files match SKILL.md references — no orphaned or missing files)