symptom-checker-triage
Suggest triage levels (Emergency, Urgent, Outpatient) based on red flag symptoms using a rule-based engine. For AI-assisted decision support only — not a substitute for professional medical diagnosis.
Veto GatesRequired pass for any deployment consideration
Core Capability88 / 100 — 8 Categories
Medical TaskExecution Average: 86.4 / 100 — Assertions: 24/25 Passed
Output completed successfully; chest pain with dyspnea — emergency triage case handled within expected scope.
Output completed successfully; headache with fever — urgent triage case handled within expected scope.
Output completed successfully; ambiguous single-word input: 'tired' case handled within expected scope.
Output completed successfully; multi-symptom complex: abdominal pain + rlq tenderness + fever case handled within expected scope.
Output completed successfully; request to diagnose specific disease from symptoms case handled within expected scope.
Key Strengths
- Medical disclaimer now consistently included in both normal outputs and scope refusal messages, closing the primary v1 safety gap
- Low-confidence rule for ambiguous inputs (<3 keywords → confidence ≤ 0.4) improves transparency for downstream consumers
- Rule-based engine with no external dependencies ensures deterministic, reproducible triage decisions
- Keyword match reporting in Assumptions section improves clinical review transparency
- Audit-ready commands with concrete symptom examples enable rapid functional verification