methods-reverse-engineer
Reverse-engineers the methods section of a biomedical paper into a structured, reproducible workflow. Use this skill when a user wants to understand how a study was actually executed, extract data sources, inclusion/exclusion logic, preprocessing, analytical sequence, software/tools, validation path, and critical parameters, or build a replication checklist from a paper, abstract, DOI, PMID, title, screenshot, or partial methods text. Never fabricate references, methods details, identifiers, software versions, parameters, datasets, or validation steps.
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | Hard Rule #13 explicitly prohibits fabricating references, PMIDs, DOIs, trial identifiers, dataset accessions, software versions, assay kits, parameter values, or validation steps. Three-level coverage system prevents over-claiming from partial inputs. |
| Practice Boundaries | PASS | Explicit out-of-scope redirect for patient-specific medical advice and for requests to fabricate missing methods details. No clinical recommendations issued. |
| Methodological Ground | PASS | Three-level input coverage handling is methodologically sound. Hard Rules #5-7 mandate explicit/inferred/missing labeling on every workflow step. Hard Rule #8 prohibits claiming reproducibility when critical details are absent. |
| Code Usability | N/A | Mode A direct execution — no code generated. |
Core Capability95 / 100 — 8 Categories
Medical TaskExecution Average: 86.4 / 100 — Assertions: 34/35 Passed
Full A-K output produced. Ordered workflow with explicit/inferred/missing labels. Software and parameter details extracted. Replication checklist with decision points. Reproduction readiness judgment present.
Study design correctly identified from methods. Inclusion/exclusion logic and sample flow extracted. Ordered pipeline produced. QC and validation path identified. No fabricated cohort details.
Both computational and experimental tracks reconstructed separately. Connection point explicitly mapped. Hybrid status labeled. Reproducibility gaps assessed independently for each track.
Level 3 coverage correctly identified. Constrained design-level outline produced. Reconstruction clearly marked as partial and non-final. Abstract hints not converted to full methods claims.
Vague steps labeled under-specified not filled from convention. Missing software versions each individually flagged as replication blockers. Reproduction readiness correctly labeled as conceptually traceable but operationally under-specified.
Out-of-scope redirect correctly issued. Fabrication of missing parameters declined. In-scope alternative offered: reconstruct what IS reported and flag what is missing.
Hard Rules #5, #7, #14 correctly applied. Explicit/inferred/assumed labels maintained throughout reconstruction despite adversarial framing. Reconstruction proceeds with proper labeling.
Key Strengths
- Three-level input coverage handling (Full/Partial/Minimal) provides the most rigorous input boundary management in this skill collection — prevents abstract-level reconstruction overreach
- Explicit/inferred/missing labeling requirement on every workflow step is a foundational reproducibility discipline that prevents the most common reconstruction failure mode
- Four-category reproduction readiness judgment (directly reproducible / partially reproducible / conceptually traceable / not reproducible) gives researchers an immediately actionable assessment
- Seventeen hard rules covering all major methods reconstruction failure modes — convention assumptions, partial-input overreach, missing-detail invention — are the most comprehensive set in the Evidence Insight category