imagegenskill
Veto GatesRequired pass for any deployment consideration
Core Capability87 / 100 — 8 Categories
Medical TaskExecution Average: 94.6 / 100 — Assertions: 20/20 Passed
The You need scientific-looking diagrams/posters (laboratory poster... scenario completed within the documented Generate renderable, scientific-style SVG graphics directly from natural-language... boundary.
The The user requests SVG output specifically (e.g., “output SVG”,... scenario completed within the documented Generate renderable, scientific-style SVG graphics directly from natural-language... boundary.
The archived evaluation treated Converts a natural-language brief into a renderable SVG with a... as a clean in-scope run.
Multiple built-in styles via STYLE: remained well-aligned with the documented contract in the preserved audit.
The archived evaluation treated End-to-end case for Converts a natural-language brief into a... as a clean in-scope run.
Key Strengths
- Primary routing is Other with execution mode B
- Static quality score is 87/100 and dynamic average is 81.6/100
- Assertions and command execution outcomes are recorded per input for human review
- Execution verification summary: Script verification 1/1; adjustment=5. svg_gen.py: OK