graph-interpretation
Use when interpreting scientific graphs and charts, explaining data visualizations for research presentations, writing figure captions for publications, or analyzing trends in clinical research data. Converts complex visual data into clear, accurate explanations for academic papers, clinical reports, and public presentations.
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | The archived evaluation preserved source-faithful writing behavior without adding unsupported results or conclusions. |
| Practice Boundaries | PASS | The archived review kept this package within Use when interpreting scientific graphs and charts, explaining data visualizations for..., not result fabrication or expert advice. |
| Methodological Ground | PASS | No methodological-grounding issue was recorded for graph-interpretation in the archived evaluation. |
| Code Usability | N/A | This package is judged mainly on writing behavior, so code usability is not a central evaluation target here. |
Core Capability87 / 100 — 8 Categories
Medical TaskExecution Average: 83.6 / 100 — Assertions: 18/20 Passed
The archived evaluation treated Use when interpreting scientific graphs and charts, explaining data... as a clean in-scope run.
The archived evaluation treated Use this skill for academic writing tasks that require explicit... as a clean in-scope run.
Use when interpreting scientific graphs and charts, explaining data... remained well-aligned with the documented contract in the preserved audit.
Packaged executable path(s): scripts/main.py remained well-aligned with the documented contract in the preserved audit.
This stress case was mostly intact, but the archived review centered its concern on: The output stays within declared skill scope and target objective.
Key Strengths
- Primary routing is Academic Writing with execution mode B
- Static quality score is 87/100 and dynamic average is 83.6/100
- Assertions and command execution outcomes are recorded per input for human review