rf-model-importance-analysis
Train random forest classifiers and rank biomarker features by importance using mean decrease in accuracy and Gini impurity. Inputs: feature matrix, class labels. Outputs: trained model, importance table, OOB error curve, partial dependence plots.
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | No fabricated scientific claims, unverifiable statistics, or invented study results appeared in any audited execution. |
| Practice Boundaries | PASS | The skill stayed inside offline statistical execution boundaries and did not issue diagnostic, treatment, or clinical decision advice. |
| Methodological Ground | PASS | The workflow remained aligned with binary random-forest feature ranking, and the documentation clearly warns that preprocessing, imputation, and multiclass use are out of scope. |
| Code Usability | PASS | The CLI help check, packaged tests, canonical runs, plot-only rerender, stress run, and repeated seeded comparison all executed successfully in the audited environment. |
Core Capability97 / 100 — 8 Categories
Medical TaskExecution Average: 88.2 / 100 — Assertions: 20/20 Passed
Executed cleanly and produced the full documented artifact set, including session metadata, model bundle, ranked tables, and both plots.
Handled a tuned forest and Gini-based ranking without instability, and the filtered output remained coherent and within the requested cap.
The expected validation failure was returned immediately with a standardized message, which is correct behavior for this boundary case.
The plot-only branch reused the saved model bundle and regenerated plots without retraining.
A heavier forest configuration completed within the timeout and still produced an interpretable ranked table.
Key Strengths
- The skill is operationally stable across standard, tuned, plot-only, stress, and expected-failure workflows.
- Input validation, path confinement, and offline-only execution give the CLI a strong safety profile.
- Deterministic behavior is explicit and was confirmed by a repeated seeded run with byte-for-byte matching output.
- Documentation, examples, troubleshooting, and packaged tests align closely with the actual implementation.