Academic Writing

meta-results-funnel-plot-generator

86100Total Score
Core Capability
77 / 100
Functional Suitability
9 / 12
Reliability
9 / 12
Performance & Context
8 / 8
Agent Usability
12 / 16
Human Usability
7 / 8
Security
8 / 12
Maintainability
9 / 12
Agent-Specific
15 / 20
Medical Task
20 / 20 Passed
96Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs. Use when user provides a funnel plot image and statistics and wants a formatted report
4/4
92Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs. Use when user provides a funnel plot image and statistics and wants a formatted report
4/4
90Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs
4/4
90Packaged executable path(s): scripts/main.py
4/4
90End-to-end case for Scope-focused workflow aligned to: Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs. Use when user provides a funnel plot image and statistics and wants a formatted report
4/4

Veto GatesRequired pass for any deployment consideration

Skill Veto✓ All 4 gates passed
Operational Stability
System remains stable across varied inputs and edge cases
PASS
Structural Consistency
Output structure conforms to expected skill contract format
PASS
Result Determinism
Equivalent inputs produce semantically equivalent outputs
PASS
System Security
No prompt injection, data leakage, or unsafe tool use detected
PASS
Research Veto✅ PASS — Applicable
DimensionResultDetail
Scientific IntegrityPASSThe archived evaluation preserved source-faithful writing behavior without adding unsupported results or conclusions.
Practice BoundariesPASSThe evaluated outputs stayed inside the Generates a Meta-analysis results section description for funnel plots, including... workflow rather than drifting into unsupported scientific interpretation.
Methodological GroundPASSThe legacy audit preserved a method-grounded interpretation of the Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs workflow.
Code UsabilityPASSNo code-usability failure was preserved for meta-results-funnel-plot-generator in the legacy evaluation.

Core Capability77 / 1008 Categories

Functional Suitability
The archived deduction in functional suitability traces back to: Improve stress-case output rigor. Stress and boundary scenarios show weaker consistency
9 / 12
75%
Reliability
Reliability was softened by the legacy issue 'Improve stress-case output rigor'. Stress and boundary scenarios show weaker consistency
9 / 12
75%
Performance & Context
The legacy audit gave full marks to performance context for this package.
8 / 8
100%
Agent Usability
The package guides agents reasonably well, while still leaving a little room for crisper trigger wording.
12 / 16
75%
Human Usability
The writing package is readable, though the archived score suggests slightly cleaner presentation would help.
7 / 8
88%
Security
The workflow stayed safe overall, with only a small remaining deduction around boundary signaling.
8 / 12
67%
Maintainability
The archived review treated the package as maintainable overall, while still leaving some cleanup headroom.
9 / 12
75%
Agent-Specific
Agent specific was softened by the legacy issue 'Improve stress-case output rigor'. Stress and boundary scenarios show weaker consistency
15 / 20
75%
Core Capability Total77 / 100

Medical TaskExecution Average: 91.6 / 100 — Assertions: 20/20 Passed

96
Canonical
Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs. Use when user provides a funnel plot image and statistics and wants a formatted report
4/4
92
Variant A
Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs. Use when user provides a funnel plot image and statistics and wants a formatted report
4/4
90
Edge
Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs
4/4
90
Variant B
Packaged executable path(s): scripts/main.py
4/4
90
Stress
End-to-end case for Scope-focused workflow aligned to: Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs. Use when user provides a funnel plot image and statistics and wants a formatted report
4/4
96
Canonical✅ Pass
Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs. Use when user provides a funnel plot image and statistics and wants a formatted report

The archived run for Generates a Meta-analysis results section description for funnel... stayed on the narrative-deliverable path rather than a code path.

Basic 35/40|Specialized 60/60|Total 96/100
A1The meta-results-funnel-plot-generator output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
92
Variant A✅ Pass
Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs. Use when user provides a funnel plot image and statistics and wants a formatted report

Generates a Meta-analysis results section description for funnel... remained a writing-first workflow and was evaluated without depending on a runnable helper script.

Basic 33/40|Specialized 59/60|Total 92/100
A1The meta-results-funnel-plot-generator output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
90
Edge✅ Pass
Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs

The archived run for Generates a Meta-analysis results section description for funnel... stayed on the narrative-deliverable path rather than a code path.

Basic 32/40|Specialized 58/60|Total 90/100
A1The meta-results-funnel-plot-generator output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
90
Variant B✅ Pass
Packaged executable path(s): scripts/main.py

The archived run for Packaged executable path(s): scripts/main.py stayed on the narrative-deliverable path rather than a code path.

Basic 31/40|Specialized 59/60|Total 90/100
A1The meta-results-funnel-plot-generator output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
90
Stress✅ Pass
End-to-end case for Scope-focused workflow aligned to: Generates a Meta-analysis results section description for funnel plots, including statistical tables (Egger's, Begg's, Trim & Fill) and figure legends. Supports English and Chinese outputs. Use when user provides a funnel plot image and statistics and wants a formatted report

The archived run for Generates a Meta-analysis results section description for funnel plots, including... stayed on the narrative-deliverable path rather than a code path.

Basic 28/40|Specialized 60/60|Total 90/100
A1The meta-results-funnel-plot-generator output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
Medical Task Total91.6 / 100

Key Strengths

  • Primary routing is Academic Writing with execution mode B
  • Static quality score is 77/100 and dynamic average is 78.6/100
  • Assertions and command execution outcomes are recorded per input for human review
  • Execution verification summary: Script verification 1/1; adjustment=5. main.py: OK