Protocol Design

medical-research-algorithm-matcher

Matches a user’s biomedical research direction, disease problem, study aim, data modality, and resource constraints to the most relevant recent algorithms and method papers. Always search real recent algorithm literature first, prioritize the last 12 months, expand to 1–3 years o

89100Total Score

Core Capability

91 / 100

Functional Suitability

12 / 12

Reliability

10 / 12

Performance & Context

7 / 8

Agent Usability

15 / 16

Human Usability

7 / 8

Security

12 / 12

Maintainability

11 / 12

Agent-Specific

17 / 20

Medical Task

24 / 25 Passed

91Canonical input for medical-research-algorithm-matcher

5/5

91Variant A input for medical-research-algorithm-matcher

5/5

88Variant B input for medical-research-algorithm-matcher

5/5

86Edge input for medical-research-algorithm-matcher

5/5

86Stress input for medical-research-algorithm-matcher

4/5

Veto GatesRequired pass for any deployment consideration

Skill Veto✓ All 4 gates passed

✓

Operational Stability

System remains stable across varied inputs and edge cases

PASS

✓

Structural Consistency

Output structure conforms to expected skill contract format

PASS

✓

Result Determinism

Equivalent inputs produce semantically equivalent outputs

PASS

✓

System Security

No prompt injection, data leakage, or unsafe tool use detected

PASS

Research Veto✅ PASS — Applicable

Dimension	Result	Detail
Scientific Integrity	PASS	No fabricated references, DOIs, PMIDs, statistical values, or clinical data detected.
Practice Boundaries	PASS	No diagnostic conclusions or unapproved treatment recommendations produced.
Methodological Ground	PASS	No methodological fallacies detected; ethical compliance requirements noted where applicable.
Code Usability	N/A	No code generated; Mode A skill

Core Capability91 / 100 — 8 Categories

Functional Suitability

Full marks (12/12); no significant issues detected.

12 / 12

100%

Reliability

Mandatory real algorithm paper requirement with DOI is the strongest integrity safeguard in this skill

10 / 12

83%

Performance & Context

Strong score (7/8); minor gaps noted.

7 / 8

88%

Agent Usability

Strong score (15/16); minor gaps noted.

15 / 16

94%

Human Usability

Strong score (7/8); minor gaps noted.

7 / 8

88%

Security

Full marks (12/12); no significant issues detected.

12 / 12

100%

Maintainability

Strong score (11/12); minor gaps noted.

11 / 12

92%

Agent-Specific

Description is among the most precise in the collection with clear triggering and scope signals

17 / 20

85%

Core Capability Total91 / 100

Medical TaskExecution Average: 88.4 / 100 — Assertions: 24/25 Passed

Canonical

Canonical input for medical-research-algorithm-matcher

5/5 ✓

Variant A

Variant A input for medical-research-algorithm-matcher

5/5 ✓

Variant B

Variant B input for medical-research-algorithm-matcher

5/5 ✓

Edge

Edge input for medical-research-algorithm-matcher

5/5 ✓

Stress

Stress input for medical-research-algorithm-matcher

4/5 ✓

Canonical✅ Pass

Canonical input for medical-research-algorithm-matcher

5/5 assertions passed.

Basic 36/40|Specialized 55/60|Total 91/100

✅A1Core assertion 1 for canonical input

✅A2Core assertion 2 for canonical input

✅A3Core assertion 3 for canonical input

✅A4Core assertion 4 for canonical input

✅A5Core assertion 5 for canonical input

Pass rate: 5 / 5

Variant A✅ Pass

Variant A input for medical-research-algorithm-matcher

5/5 assertions passed.

Basic 36/40|Specialized 55/60|Total 91/100

✅A1Core assertion 1 for variant a input

✅A2Core assertion 2 for variant a input

✅A3Core assertion 3 for variant a input

✅A4Core assertion 4 for variant a input

✅A5Core assertion 5 for variant a input

Pass rate: 5 / 5

Variant B✅ Pass

Variant B input for medical-research-algorithm-matcher

5/5 assertions passed.

Basic 35/40|Specialized 53/60|Total 88/100

✅A1Core assertion 1 for variant b input

✅A2Core assertion 2 for variant b input

✅A3Core assertion 3 for variant b input

✅A4Core assertion 4 for variant b input

✅A5Core assertion 5 for variant b input

Pass rate: 5 / 5

Edge✅ Pass

Edge input for medical-research-algorithm-matcher

5/5 assertions passed.

Basic 34/40|Specialized 52/60|Total 86/100

✅A1Core assertion 1 for edge input

✅A2Core assertion 2 for edge input

✅A3Core assertion 3 for edge input

✅A4Core assertion 4 for edge input

✅A5Core assertion 5 for edge input

Pass rate: 5 / 5

Stress✅ Pass

Stress input for medical-research-algorithm-matcher

4/5 assertions passed.

Basic 34/40|Specialized 52/60|Total 86/100

✅A1Core assertion 1 for stress input

✅A2Core assertion 2 for stress input

✅A3Core assertion 3 for stress input

✅A4Core assertion 4 for stress input

❌A5Core assertion 5 for stress input

Pass rate: 4 / 5

Medical Task Total88.4 / 100

Key Strengths

Mandatory verified primary method paper requirement with DOI prevents fabricated algorithm recommendations
Prioritization of last 12 months then 1-3 years ensures currency of algorithm recommendations
Published downstream papers citing the algorithm add real-world validation beyond the original paper
Explicit 'no verified algorithm found' response prevents false positive recommendations