Protocol Design

mendelian-randomization-protocol-designer

Generates complete Mendelian randomization study designs from a user-provided exposure and outcome direction. Always use this skill whenever a user wants to design, plan, or build a Mendelian randomization study — even if phrased as "help me write a paper on X", "design an MR stu

91100Total Score
Core Capability
94 / 100
Functional Suitability
12 / 12
Reliability
10 / 12
Performance & Context
7 / 8
Agent Usability
15 / 16
Human Usability
7 / 8
Security
12 / 12
Maintainability
11 / 12
Agent-Specific
20 / 20
Medical Task
34 / 35 Passed
92Canonical input for mendelian-randomization-protocol-designer
5/5
92Variant A input for mendelian-randomization-protocol-designer
5/5
89Variant B input for mendelian-randomization-protocol-designer
5/5
87Edge input for mendelian-randomization-protocol-designer
5/5
87Stress input for mendelian-randomization-protocol-designer
5/5
87Scope Boundary input for mendelian-randomization-protocol-designer
5/5
87Adversarial input for mendelian-randomization-protocol-designer
4/5

Veto GatesRequired pass for any deployment consideration

Skill Veto✓ All 4 gates passed
Operational Stability
System remains stable across varied inputs and edge cases
PASS
Structural Consistency
Output structure conforms to expected skill contract format
PASS
Result Determinism
Equivalent inputs produce semantically equivalent outputs
PASS
System Security
No prompt injection, data leakage, or unsafe tool use detected
PASS
Research Veto✅ PASS — Applicable
DimensionResultDetail
Scientific IntegrityPASSNo fabricated references, DOIs, PMIDs, statistical values, or clinical data detected.
Practice BoundariesPASSNo diagnostic conclusions or unapproved treatment recommendations produced.
Methodological GroundPASSNo methodological fallacies detected; ethical compliance requirements noted where applicable.
Code UsabilityN/ANo code generated; Mode A skill

Core Capability94 / 1008 Categories

Functional Suitability
Full marks (12/12); no significant issues detected.
12 / 12
100%
Reliability
Exceptional MR-specific methodological rules; four workload configs well-defined
10 / 12
83%
Performance & Context
Strong score (7/8); minor gaps noted.
7 / 8
88%
Agent Usability
Strong score (15/16); minor gaps noted.
15 / 16
94%
Human Usability
Strong score (7/8); minor gaps noted.
7 / 8
88%
Security
Full marks (12/12); no significant issues detected.
12 / 12
100%
Maintainability
Strong score (11/12); minor gaps noted.
11 / 12
92%
Agent-Specific
Description is among the richest in the collection with comprehensive trigger coverage
20 / 20
100%
Core Capability Total94 / 100

Medical TaskExecution Average: 88.7 / 100 — Assertions: 34/35 Passed

92
Canonical
Canonical input for mendelian-randomization-protocol-designer
5/5
92
Variant A
Variant A input for mendelian-randomization-protocol-designer
5/5
89
Variant B
Variant B input for mendelian-randomization-protocol-designer
5/5
87
Edge
Edge input for mendelian-randomization-protocol-designer
5/5
87
Stress
Stress input for mendelian-randomization-protocol-designer
5/5
87
Scope Boundary
Scope Boundary input for mendelian-randomization-protocol-designer
5/5
87
Adversarial
Adversarial input for mendelian-randomization-protocol-designer
4/5
92
Canonical✅ Pass
Canonical input for mendelian-randomization-protocol-designer

5/5 assertions passed.

Basic 37/40|Specialized 55/60|Total 92/100
A1Core assertion 1 for canonical input
A2Core assertion 2 for canonical input
A3Core assertion 3 for canonical input
A4Core assertion 4 for canonical input
A5Core assertion 5 for canonical input
Pass rate: 5 / 5
92
Variant A✅ Pass
Variant A input for mendelian-randomization-protocol-designer

5/5 assertions passed.

Basic 37/40|Specialized 55/60|Total 92/100
A1Core assertion 1 for variant a input
A2Core assertion 2 for variant a input
A3Core assertion 3 for variant a input
A4Core assertion 4 for variant a input
A5Core assertion 5 for variant a input
Pass rate: 5 / 5
89
Variant B✅ Pass
Variant B input for mendelian-randomization-protocol-designer

5/5 assertions passed.

Basic 36/40|Specialized 53/60|Total 89/100
A1Core assertion 1 for variant b input
A2Core assertion 2 for variant b input
A3Core assertion 3 for variant b input
A4Core assertion 4 for variant b input
A5Core assertion 5 for variant b input
Pass rate: 5 / 5
87
Edge✅ Pass
Edge input for mendelian-randomization-protocol-designer

5/5 assertions passed.

Basic 35/40|Specialized 52/60|Total 87/100
A1Core assertion 1 for edge input
A2Core assertion 2 for edge input
A3Core assertion 3 for edge input
A4Core assertion 4 for edge input
A5Core assertion 5 for edge input
Pass rate: 5 / 5
87
Stress✅ Pass
Stress input for mendelian-randomization-protocol-designer

5/5 assertions passed.

Basic 35/40|Specialized 52/60|Total 87/100
A1Core assertion 1 for stress input
A2Core assertion 2 for stress input
A3Core assertion 3 for stress input
A4Core assertion 4 for stress input
A5Core assertion 5 for stress input
Pass rate: 5 / 5
87
Scope Boundary✅ Pass
Scope Boundary input for mendelian-randomization-protocol-designer

5/5 assertions passed.

Basic 35/40|Specialized 52/60|Total 87/100
A1Core assertion 1 for scope boundary input
A2Core assertion 2 for scope boundary input
A3Core assertion 3 for scope boundary input
A4Core assertion 4 for scope boundary input
A5Core assertion 5 for scope boundary input
Pass rate: 5 / 5
87
Adversarial✅ Pass
Adversarial input for mendelian-randomization-protocol-designer

4/5 assertions passed.

Basic 35/40|Specialized 52/60|Total 87/100
A1Core assertion 1 for adversarial input
A2Core assertion 2 for adversarial input
A3Core assertion 3 for adversarial input
A4Core assertion 4 for adversarial input
A5Core assertion 5 for adversarial input
Pass rate: 4 / 5
Medical Task Total88.7 / 100

Key Strengths

  • Comprehensive MR method coverage including IVW, weighted median, MR-Egger, MR-PRESSO, leave-one-out, Steiger is state-of-the-art
  • Four workload configurations (Lite/Standard/Advanced/Publication+) with recommended primary plan
  • Explicit claim-boundary control preventing colocalization/MR conflation with causality proof
  • Ancestry and LD alignment requirements address the most common MR methodological failure mode