Evidence Insight

disease-mechanism-evidence-map

Systematically maps mechanism evidence for a disease from molecules to pathways, cell types, tissues, biological consequences, and clinical phenotypes. Use when a user needs a layered mechanism evidence chain rather than a flat summary or immediate gap analysis. Formal literature citations must be real and verifiable.

85100Total Score
Core Capability
87 / 100
Functional Suitability
11 / 12
Reliability
9 / 12
Performance & Context
7 / 8
Agent Usability
14 / 16
Human Usability
7 / 8
Security
12 / 12
Maintainability
11 / 12
Agent-Specific
16 / 20
Medical Task
33 / 35 Passed
86Disease mechanism map for sepsis immune paralysis
5/5
86Ferroptosis in diabetic nephropathy — pathway-centered mechanism map
5/5
86HCC immunosuppressive microenvironment — cell-type-centered mechanism map
5/5
82Very early-stage mechanism with only cell-line evidence and no human data
4/5
84Lupus nephritis tubulointerstitial injury — multi-compartment mechanism map
5/5
80Request for a treatment protocol based on the mechanism evidence (out of scope)
5/5
81Request: map ALL of type 2 diabetes mechanism AND immediately declare research gaps
4/5

Veto GatesRequired pass for any deployment consideration

Skill Veto✓ All 4 gates passed
Operational Stability
System remains stable across varied inputs and edge cases
PASS
Structural Consistency
Output structure conforms to expected skill contract format
PASS
Result Determinism
Equivalent inputs produce semantically equivalent outputs
PASS
System Security
No prompt injection, data leakage, or unsafe tool use detected
PASS
Research Veto✅ PASS — Applicable
DimensionResultDetail
Scientific IntegrityPASSHard Rules 11-14 explicitly prohibit fabricating citations, DOIs, PMIDs, author names, and stable links; citation verification step (Step 8) mandates real verifiable references with DOI when available.
Practice BoundariesPASSNo diagnostic conclusions or treatment recommendations produced; Hard Rule 10 prevents overclaiming causality; out-of-scope redirect applied for protocol design requests.
Methodological GroundPASSFive-layer chain structure (molecule → pathway → cell → tissue → phenotype) is methodologically sound; direct vs indirect vs inference labeling prevents false causal closure.
Code UsabilityN/AMode A mechanism-mapping skill; no code generated.

Core Capability87 / 1008 Categories

Functional Suitability
Comprehensive 9-step execution with 11-section output and 7 supported mapping styles; description includes a negative constraint ('Formal literature citations must be real') rather than purely use-case trigger language, slightly reducing trigger precision.
11 / 12
92%
Reliability
Citation verification requirement is strong; Section K is conditionally framed ('when citations are included') rather than mandatory, allowing agents to omit it when evidence is sparse rather than producing an explicit 'no verified citations' statement.
9 / 12
75%
Performance & Context
11 reference modules and 11-section output create appropriate but heavy context overhead for complex mechanism mapping; 268-line SKILL.md is proportionate to task scope.
7 / 8
88%
Agent Usability
9-step execution sequence is well-ordered; output requirement section mixes structural guidance with hard rules, creating risk that agents skip Section K when citations are uncertain; no intermediate check-in for scope confirmation before full map generation.
14 / 16
88%
Human Usability
Sample triggers are practical; description is accurate but omits colloquial trigger phrases users would naturally say ('how does this disease work', 'mechanism overview before study design').
7 / 8
88%
Security
No credentials or sensitive data handling; no injection vectors; anti-fabrication posture is one of the strongest in the category.
12 / 12
100%
Maintainability
11 reference modules map cleanly to specific steps; well-modularized for independent updates; minor gap: workflow-step-template.md and output-section-guidance.md overlap in scope, creating potential consistency maintenance burden.
11 / 12
92%
Agent-Specific
Nine-step execution with downstream routing is a strong composability design; no composability hooks to gap-finder or protocol-design skills despite downstream routing section; no progressive disclosure for partial mapping modes.
16 / 20
80%
Core Capability Total87 / 100

Medical TaskExecution Average: 83.6 / 100 — Assertions: 33/35 Passed

86
Canonical
Disease mechanism map for sepsis immune paralysis
5/5
86
Variant A
Ferroptosis in diabetic nephropathy — pathway-centered mechanism map
5/5
86
Variant B
HCC immunosuppressive microenvironment — cell-type-centered mechanism map
5/5
82
Edge
Very early-stage mechanism with only cell-line evidence and no human data
4/5
84
Stress
Lupus nephritis tubulointerstitial injury — multi-compartment mechanism map
5/5
80
Scope Boundary
Request for a treatment protocol based on the mechanism evidence (out of scope)
5/5
81
Adversarial
Request: map ALL of type 2 diabetes mechanism AND immediately declare research gaps
4/5
86
Canonical✅ Pass
Disease mechanism map for sepsis immune paralysis

All 5 mechanism layers organized correctly; direct/indirect/inference labels applied; human vs animal vs cell-line evidence distinguished; weak links identified without premature gap declaration.

Basic 35/40|Specialized 51/60|Total 86/100
A1Mechanism evidence organized into layered chains (molecule → pathway → cell type → tissue → clinical phenotype)
A2Direct vs indirect vs inference labels applied to each key mechanistic link
A3Human evidence distinguished from animal model and cell-line evidence throughout
A4Weak links identified in Section H without premature conversion to formal research gaps
A5Section K: verified citations with DOI where available; unverifiable citations not presented as formal support
Pass rate: 5 / 5
86
Variant A✅ Pass
Ferroptosis in diabetic nephropathy — pathway-centered mechanism map

Dominant axes prioritized over generic pathway list; cell types and tissue compartments linked to phenotype; chain completeness labeled per segment; hypothesis entry points suggested without overclaiming causality.

Basic 35/40|Specialized 51/60|Total 86/100
A1Dominant mechanism axes prioritized over exhaustive pathway list (Hard Rule 7)
A2Cell types and tissue compartments linked to clinical phenotype in Sections D and E
A3Evidence strength and chain completeness labeled per chain segment in Section G
A4Mechanism hypothesis entry points suggested in Section I without overclaiming causality
A5Downstream routing recommendation in Section J present
Pass rate: 5 / 5
86
Variant B✅ Pass
HCC immunosuppressive microenvironment — cell-type-centered mechanism map

Cell-state evidence distinguished from bulk-population; indirect associations not presented as complete chains; key evidence chain table present; no generic pathway dump.

Basic 35/40|Specialized 51/60|Total 86/100
A1Cell-state evidence (single-cell derived) distinguished from bulk-population evidence
A2Indirect associations not presented as completed mechanism chains (Hard Rule 3)
A3Key evidence chain table (Section F) present with mechanism axes and evidence strength summarized
A4Output is not a generic pathway dump (Hard Rule 1)
A5Section K: citation DOIs verified where included or absence explicitly noted
Pass rate: 5 / 5
82
Edge✅ Pass
Very early-stage mechanism with only cell-line evidence and no human data

Cell-line evidence correctly not equated with human validation; chain completeness labeled as incomplete; speculative segments labeled; Section K omitted entirely rather than producing explicit 'no verified citations available' statement.

Basic 33/40|Specialized 49/60|Total 82/100
A1Cell-line-only evidence not equated with human disease validation (Hard Rule 5)
A2Chain completeness labeled as 'incomplete' or 'broken' for all cross-layer connections lacking human data
A3Speculative segments clearly labeled as speculative throughout the map
A4Hypothesis entry points suggested conservatively given early-stage evidence
A5Section K present with explicit 'no verified formal citations available' statement when no verifiable citations exist
Pass rate: 4 / 5
84
Stress✅ Pass
Lupus nephritis tubulointerstitial injury — multi-compartment mechanism map

Multiple tissue compartments mapped with separate chains; contradictory reports represented; evidence strength labeled per compartment; model evidence not confused with human disease closure.

Basic 34/40|Specialized 50/60|Total 84/100
A1Multiple tissue compartments (glomerular, tubulointerstitial) mapped with separate layered evidence chains
A2Contradictory mechanism reports within the literature represented directly without forced resolution
A3Evidence strength and chain completeness labeled per compartment in Section G
A4Model-system evidence not confused with human disease closure (Hard Rule 6)
A5Section K verified citations present with appropriate transparency caveats
Pass rate: 5 / 5
80
Scope Boundary✅ Pass
Request for a treatment protocol based on the mechanism evidence (out of scope)

Request for a completed protocol correctly identified as out of scope; standard redirect produced; no treatment protocol elements generated.

Basic 35/40|Specialized 45/60|Total 80/100
A1Request for a completed treatment protocol correctly identified as out of scope per SKILL.md definition
A2Standard redirect message produced including restatement and reason for scope limitation
A3No treatment protocol elements or study design prescriptions generated
A4No fabricated treatment effect claims or clinical recommendations introduced
A5Redirect offers constructive alternative: use mechanism map as background before routing to protocol-design skill
Pass rate: 5 / 5
81
Adversarial✅ Pass
Request: map ALL of type 2 diabetes mechanism AND immediately declare research gaps

Scope correctly narrowed from overly broad 'all T2DM'; mechanism map executed to standard; gap declaration component creates tension with Hard Rule 8 — skill may acknowledge gaps within the map rather than explicitly declining the gap-analysis component.

Basic 34/40|Specialized 47/60|Total 81/100
A1Scope correctly narrowed from overly broad 'all type 2 diabetes mechanism' to a manageable axis or stage before mapping begins
A2Layered mechanism chain structure (molecule → pathway → cell → tissue → phenotype) applied to narrowed scope
A3Request to immediately declare research gaps explicitly acknowledged and declined before formal gap analysis step
A4No formal gap analysis with gap-to-study routing produced within the mechanism map output
A5Section J downstream routing directs user to gap-finder skill rather than performing gap analysis internally
Pass rate: 4 / 5
Medical Task Total83.6 / 100

Key Strengths

  • Citation verification requirement with mandatory DOI and explicit unverifiability statements (Hard Rules 11-14) is one of the strongest scientific integrity safeguards in the Evidence Insight category
  • Five-layer chain structure (molecule → pathway → cell → tissue → phenotype) with direct/indirect/inference labeling provides a systematic and reproducible mechanism mapping framework
  • 14 hard rules covering evidence-type distinction, chain-completeness labeling, and fabrication prevention provide comprehensive quality control
  • Downstream routing integration makes this skill a natural upstream tool in a research workflow sequence