reporting-guideline-compliance-checker
Checks biomedical manuscripts against reporting guidelines such as CONSORT, STROBE, PRISMA, and TRIPOD to identify missing or weak reporting elements before submission or revision.
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | No fabricated participant flow counts, endpoint definitions, registration numbers, or guideline requirements invented. Hard rules 1, 3, 7 explicitly prohibit fabrication and are consistently enforced. |
| Practice Boundaries | PASS | No diagnostic or prescriptive clinical conclusions. Skill is limited to reporting completeness review, not clinical recommendation. |
| Methodological Ground | PASS | Correct guideline-to-design mapping (CONSORT=RCT, STROBE=observational, PRISMA=SR/MA, TRIPOD=prediction). Hybrid boundary rules correctly flag multi-framework situations. |
| Code Usability | N/A | Mode A skill — no code generated. |
Core Capability93 / 100 — 8 Categories
Medical TaskExecution Average: 90.1 / 100 — Assertions: 34/34 Passed
All five assertions passed. CONSORT correctly identified. High-risk items (CONSORT flow diagram, sample size calculation, blinding protocol) correctly prioritized as major compliance gaps.
All five assertions passed. STROBE correctly identified. Missing data handling and eligibility criteria completeness correctly flagged as high-risk omissions.
All five assertions passed. Clarification-first gate triggered correctly. Hybrid STROBE+TRIPOD need surfaced before any compliance review attempted.
All five assertions passed. PRISMA correctly identified. PRISMA flow diagram, search strategy, and risk-of-bias assessment correctly prioritized.
All five assertions passed. Hybrid boundary rule correctly triggered. TRIPOD primary + REMARK secondary recommended. Dual-framework reporting gaps clearly identified.
All four assertions passed. Correctly declined journal-specific formatting scope. Pivoted to offer valid reporting guideline compliance review.
All five assertions passed. Hard rule 1 applied correctly. Partial review produced with explicit scope limitations, compliance not certified.
Key Strengths
- Five-tier severity model with a formal 'unclear due to missing manuscript material' tier is more nuanced than typical compliance checklist tools — prevents false reassurance
- Dedicated hybrid-study-boundary-rules.md file explicitly addresses the most common real-world misclassification problem (prediction + observational, biomarker + clinical)
- Section F (Submission-Risk Assessment) adds practical submission-oriented prioritization beyond the standard major/moderate/minor split
- Hard rule 5 — never confuse 'missing' with 'not applicable' and never label 'present' what is only weakly reported — targets the three most commonly conflated compliance states
- Four guideline families with clear selection logic plus hybrid escalation path covers the vast majority of biomedical manuscript types