adverse-event-narrative
1. Confirm the user objective, required inputs, and non-negotiable constraints before doing detailed work. 2. Validate that the request matches the documented scope and stop early if the task would require unsupported as.
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | Scientific integrity remained intact because the package rewrote or structured material without fabricating findings. |
| Practice Boundaries | PASS | The evaluated outputs stayed inside the 1. Confirm the user objective, required inputs, and non-negotiable constraints before doing... workflow rather than drifting into unsupported scientific interpretation. |
| Methodological Ground | PASS | No methodological-grounding issue was recorded for adverse-event-narrative in the archived evaluation. |
| Code Usability | PASS | The legacy audit did not flag code-usability issues for the packaged adverse-event-narrative workflow. |
Core Capability88 / 100 — 8 Categories
Medical TaskExecution Average: 83.6 / 100 — Assertions: 18/20 Passed
The archived evaluation treated 1. Confirm the user objective, required inputs, and non-negotiable... as a clean in-scope run.
The Use this skill for academic writing tasks that require explicit... scenario completed within the documented 1. Confirm the user objective, required inputs, and non-negotiable constraints before doing... boundary.
The 1. Confirm the user objective, required inputs, and non-negotiable... path verified the packaged helper command without exposing a deeper execution issue.
For Packaged executable path(s): scripts/main.py, the preserved evidence is lightweight but positive: the packaged validation command behaved as expected.
The 1. Confirm the user objective, required inputs, and non-negotiable constraints before doing... path verified the packaged helper command without exposing a deeper execution issue.
Key Strengths
- Primary routing is Academic Writing with execution mode B
- Static quality score is 88/100 and dynamic average is 83.6/100
- Assertions and command execution outcomes are recorded per input for human review