literature-experiment-extract
Extract experimental models, experimental methods, and biomarker information from paper Markdown (typically produced by PDF-to-Markdown tools) when a user provides paper Markdown and needs a structured, evidence-backed summary (1 Markdown + 3 CSVs).
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | Scientific integrity held because the package framed recommendations as plans to be tested, not facts already established. |
| Practice Boundaries | PASS | Practice boundaries held because the package remained focused on source handling, lookup, or structured evidence use. |
| Methodological Ground | PASS | No methodological-grounding issue was recorded for literature-experiment-extract in the archived evaluation. |
| Code Usability | N/A | This skill produces design-oriented guidance rather than runnable analysis code, which makes code usability non-gating in this review. |
Core Capability84 / 100 — 8 Categories
Medical TaskExecution Average: 87.6 / 100 — Assertions: 20/20 Passed
The archived run treated You have a paper converted to Markdown (e.g., via PDF-to-Markdown)... as a protocol-design path rather than an executable workflow.
The archived run treated You need a structured list of experimental methods/protocols... as a protocol-design path rather than an executable workflow.
The archived run treated Extracts three entity groups from paper Markdown: as a protocol-design path rather than an executable workflow.
This variant b case remained a study-design support path, not a code-driven execution run.
The archived run treated End-to-end case for Extracts three entity groups from paper Markdown: as a protocol-design path rather than an executable workflow.
Key Strengths
- Primary routing is Evidence Insight with execution mode A
- Static quality score is 84/100 and dynamic average is 79.6/100
- Assertions and command execution outcomes are recorded per input for human review
- Execution verification summary: No script verification was applicable