time-dependent-roc
Compute time-dependent AUC and ROC curves at multiple clinical time points for survival model evaluation. Inputs: survival time, event status, continuous biomarker or risk score. Outputs: time-AUC line plot, AUC table at 1/3/5 years, iAUC summary.
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | Outputs are computed from supplied data and no fabricated study claims, identifiers, or effect estimates were introduced by the skill. |
| Practice Boundaries | PASS | The skill computes ROC and AUC artifacts only and does not cross into diagnosis, treatment advice, or prescriptive clinical recommendations. |
| Methodological Ground | PASS | The workflow correctly targets time-dependent ROC analysis for survival data and rejects unusable time specifications before running analysis. |
| Code Usability | PASS | The R entry point executed successfully on valid inputs in this environment and produced the documented tables, figure, and session metadata. |
Core Capability88 / 100 — 8 Categories
Medical TaskExecution Average: 86 / 100 — Assertions: 20/20 Passed
Executed cleanly and produced every documented artifact in the expected directory structure.
The non-default marker path worked without adjustment and preserved the requested time points.
This is an expected validation failure path; the skill rejected the request cleanly and printed help text.
This is an expected validation failure path; parameter checks ran before any data loading or analysis work.
A higher-parameter run completed successfully, including TXT exports, figure generation, and session metadata.
Key Strengths
- The skill cleanly separates concise user-facing instructions from implementation details in scripts and references.
- CLI validation is strong: missing inputs and invalid parameters fail early with stable SKILL_* messages and help output.
- The implementation is executable as shipped and produces reproducible table, figure, and session metadata artifacts on valid runs.
- Marker-column flexibility, multiple file formats, and standardized output directories make the skill practical for repeated analysis tasks.