Evidence Insight

ena-database

86100Total Score
Core Capability
84 / 100
Functional Suitability
11 / 12
Reliability
9 / 12
Performance & Context
7 / 8
Agent Usability
14 / 16
Human Usability
8 / 8
Security
10 / 12
Maintainability
9 / 12
Agent-Specific
16 / 20
Medical Task
20 / 20 Passed
92Access the European Nucleotide Archive (ENA) via REST APIs and FTP/Aspera to search and retrieve sequences, raw reads (FASTQ), assemblies, and metadata when you have accession IDs or need metadata-driven discovery for genomics pipelines
4/4
88Access the European Nucleotide Archive (ENA) via REST APIs and FTP/Aspera to search and retrieve sequences, raw reads (FASTQ), assemblies, and metadata when you have accession IDs or need metadata-driven discovery for genomics pipelines
4/4
86Multi-object ENA coverage: studies/projects, samples, experiments, runs, assemblies, sequences, analyses, taxonomy records
4/4
86Two primary API styles:
4/4
86End-to-end case for Multi-object ENA coverage: studies/projects, samples, experiments, runs, assemblies, sequences, analyses, taxonomy records
4/4

Veto GatesRequired pass for any deployment consideration

Skill Veto✓ All 4 gates passed
Operational Stability
System remains stable across varied inputs and edge cases
PASS
Structural Consistency
Output structure conforms to expected skill contract format
PASS
Result Determinism
Equivalent inputs produce semantically equivalent outputs
PASS
System Security
No prompt injection, data leakage, or unsafe tool use detected
PASS
Research Veto✅ PASS — Applicable
DimensionResultDetail
Scientific IntegrityPASSScientific content remained anchored to fetched metadata or source-linked evidence in the legacy review.
Practice BoundariesPASSThe package stayed in retrieval, extraction, or evidence-organization scope rather than drifting into unsupported interpretation.
Methodological GroundPASSThe older review treated the package logic as methodologically aligned with its stated workflow.
Code UsabilityN/AThe audited artifact centers on document or reasoning outputs, so code usability is not the main evaluation target here.

Core Capability84 / 1008 Categories

Functional Suitability
Functional suitability was softened by the legacy issue 'Improve stress-case output rigor'. Stress and boundary scenarios show weaker consistency
11 / 12
92%
Reliability
Related legacy finding for ena-database: Improve stress-case output rigor. Stress and boundary scenarios show weaker consistency
9 / 12
75%
Performance & Context
The legacy audit deducted points for ena-database in performance context.
7 / 8
88%
Agent Usability
The legacy audit deducted points for ena-database in agent usability.
14 / 16
88%
Human Usability
Human usability reached full score in the archived evaluation.
8 / 8
100%
Security
The archived evaluation left some headroom for ena-database under security.
10 / 12
83%
Maintainability
A modest deduction remained in maintainability for ena-database in the archived review.
9 / 12
75%
Agent-Specific
Agent specific was softened by the legacy issue 'Improve stress-case output rigor'. Stress and boundary scenarios show weaker consistency
16 / 20
80%
Core Capability Total84 / 100

Medical TaskExecution Average: 87.6 / 100 — Assertions: 20/20 Passed

92
Canonical
Access the European Nucleotide Archive (ENA) via REST APIs and FTP/Aspera to search and retrieve sequences, raw reads (FASTQ), assemblies, and metadata when you have accession IDs or need metadata-driven discovery for genomics pipelines
4/4
88
Variant A
Access the European Nucleotide Archive (ENA) via REST APIs and FTP/Aspera to search and retrieve sequences, raw reads (FASTQ), assemblies, and metadata when you have accession IDs or need metadata-driven discovery for genomics pipelines
4/4
86
Edge
Multi-object ENA coverage: studies/projects, samples, experiments, runs, assemblies, sequences, analyses, taxonomy records
4/4
86
Variant B
Two primary API styles:
4/4
86
Stress
End-to-end case for Multi-object ENA coverage: studies/projects, samples, experiments, runs, assemblies, sequences, analyses, taxonomy records
4/4
92
Canonical✅ Pass
Access the European Nucleotide Archive (ENA) via REST APIs and FTP/Aspera to search and retrieve sequences, raw reads (FASTQ), assemblies, and metadata when you have accession IDs or need metadata-driven discovery for genomics pipelines

This canonical case stayed inside the documented workflow and remained instruction-led.

Basic 36/40|Specialized 56/60|Total 92/100
A1The ena-database output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
88
Variant A✅ Pass
Access the European Nucleotide Archive (ENA) via REST APIs and FTP/Aspera to search and retrieve sequences, raw reads (FASTQ), assemblies, and metadata when you have accession IDs or need metadata-driven discovery for genomics pipelines

This variant a case stayed inside the documented workflow and remained instruction-led.

Basic 34/40|Specialized 54/60|Total 88/100
A1The ena-database output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
86
Edge✅ Pass
Multi-object ENA coverage: studies/projects, samples, experiments, runs, assemblies, sequences, analyses, taxonomy records

Multi-object ENA coverage: studies/projects, samples, experiments,... was evaluated as a bounded documentation path, not as a runnable script workflow.

Basic 33/40|Specialized 53/60|Total 86/100
A1The ena-database output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
86
Variant B✅ Pass
Two primary API styles:

Two primary API styles: was evaluated as a bounded documentation path, not as a runnable script workflow.

Basic 32/40|Specialized 54/60|Total 86/100
A1The ena-database output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
86
Stress✅ Pass
End-to-end case for Multi-object ENA coverage: studies/projects, samples, experiments, runs, assemblies, sequences, analyses, taxonomy records

This stress case stayed inside the documented workflow and remained instruction-led.

Basic 29/40|Specialized 57/60|Total 86/100
A1The ena-database output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
Medical Task Total87.6 / 100

Key Strengths

  • Primary routing is Evidence Insight with execution mode A
  • Static quality score is 84/100 and dynamic average is 79.6/100
  • Assertions and command execution outcomes are recorded per input for human review
  • Execution verification summary: No script verification was applicable