file-management
Organize, back up, compress, split, and merge files/folders using rule-driven plans; use when you need safe previews, conflict handling, and verification before executing file operations.
Veto GatesRequired pass for any deployment consideration
Core Capability88 / 100 — 8 Categories
Medical TaskExecution Average: 90.6 / 100 — Assertions: 20/20 Passed
You need to reorganize a directory (move/copy/rename) based on... remained tied to the documented analysis contract even when the preserved evidence centered on instructions instead of a full rerun.
You want a repeatable backup workflow (copy or archive) with... remained tied to the documented analysis contract even when the preserved evidence centered on instructions instead of a full rerun.
This edge case stayed within the packaged analysis boundary and kept a reviewable task contract.
This variant b case stayed within the packaged analysis boundary and kept a reviewable task contract.
This stress case stayed within the packaged analysis boundary and kept a reviewable task contract.
Key Strengths
- Primary routing is Other with execution mode B
- Static quality score is 88/100 and dynamic average is 77.6/100
- Assertions and command execution outcomes are recorded per input for human review
- Execution verification summary: Script verification 2/2; adjustment=5. merge_parts.py: OK; split_file.py: OK