Evidence Insight
alphafold-db
92100Total Score
Core Capability
85 / 100
Functional Suitability
11 / 12
Reliability
9 / 12
Performance & Context
8 / 8
Agent Usability
14 / 16
Human Usability
8 / 8
Security
9 / 12
Maintainability
9 / 12
Agent-Specific
17 / 20
Medical Task
20 / 20 Passed
100You have a UniProt accession (e.g., P00520) and need to download its AlphaFold-predicted 3D structure in mmCIF or PDB format
4/4
97You want to assess prediction reliability using per-residue pLDDT confidence scores
4/4
95Fetch AlphaFold DB predicted structures by UniProt accession
4/4
94Download structure files in mmCIF (default) or PDB
4/4
94End-to-end case for Fetch AlphaFold DB predicted structures by UniProt accession
4/4
Veto GatesRequired pass for any deployment consideration
Skill Veto✓ All 4 gates passed
✓
Operational Stability
System remains stable across varied inputs and edge cases
PASS✓
Structural Consistency
Output structure conforms to expected skill contract format
PASS✓
Result Determinism
Equivalent inputs produce semantically equivalent outputs
PASS✓
System Security
No prompt injection, data leakage, or unsafe tool use detected
PASSResearch Veto✅ PASS — Applicable
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | Scientific content remained anchored to fetched metadata or source-linked evidence in the legacy review. |
| Practice Boundaries | PASS | The package stayed in retrieval, extraction, or evidence-organization scope rather than drifting into unsupported interpretation. |
| Methodological Ground | PASS | The older review treated the package logic as methodologically aligned with its stated workflow. |
| Code Usability | PASS | Code usability passed because the search or lookup workflow still exposed a usable entrypoint and output expectation. |
Core Capability85 / 100 — 8 Categories
Functional Suitability
The legacy audit deducted points for alphafold-db in functional suitability.
11 / 12
92%
Reliability
A modest deduction remained in reliability for alphafold-db in the archived review.
9 / 12
75%
Performance & Context
The legacy audit gave full marks to performance context for this package.
8 / 8
100%
Agent Usability
The legacy audit deducted points for alphafold-db in agent usability.
14 / 16
88%
Human Usability
Human usability reached full score in the archived evaluation.
8 / 8
100%
Security
A modest deduction remained in security for alphafold-db in the archived review.
9 / 12
75%
Maintainability
The legacy audit deducted points for alphafold-db in maintainability.
9 / 12
75%
Agent-Specific
A modest deduction remained in agent specific for alphafold-db in the archived review.
17 / 20
85%
Core Capability Total85 / 100
Medical TaskExecution Average: 96 / 100 — Assertions: 20/20 Passed
100
Canonical
You have a UniProt accession (e.g., P00520) and need to download its AlphaFold-predicted 3D structure in mmCIF or PDB format
4/4 ✓
97
Variant A
You want to assess prediction reliability using per-residue pLDDT confidence scores
4/4 ✓
95
Edge
Fetch AlphaFold DB predicted structures by UniProt accession
4/4 ✓
94
Variant B
Download structure files in mmCIF (default) or PDB
4/4 ✓
94
Stress
End-to-end case for Fetch AlphaFold DB predicted structures by UniProt accession
4/4 ✓
100
Canonical✅ Pass
You have a UniProt accession (e.g., P00520) and need to download its AlphaFold-predicted 3D structure in mmCIF or PDB format
The archived evaluation treated You have a UniProt accession (e.g., P00520) and need to download... as a clean in-scope run.
Basic 38/40|Specialized 60/60|Total 100/100
✅A1The alphafold-db output structure matches the documented deliverable
✅A2The script execution path completed successfully for the documented case
✅A3The output stays fully within the documented skill boundary
✅A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
97
Variant A✅ Pass
You want to assess prediction reliability using per-residue pLDDT confidence scores
The archived evaluation treated You want to assess prediction reliability using per-residue pLDDT... as a clean in-scope run.
Basic 36/40|Specialized 60/60|Total 97/100
✅A1The alphafold-db output structure matches the documented deliverable
✅A2The script execution path completed successfully for the documented case
✅A3The output stays fully within the documented skill boundary
✅A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
95
Edge✅ Pass
Fetch AlphaFold DB predicted structures by UniProt accession
The archived evaluation treated Fetch AlphaFold DB predicted structures by UniProt accession as a clean in-scope run.
Basic 35/40|Specialized 60/60|Total 95/100
✅A1The alphafold-db output structure matches the documented deliverable
✅A2The script execution path completed successfully for the documented case
✅A3The output stays fully within the documented skill boundary
✅A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
94
Variant B✅ Pass
Download structure files in mmCIF (default) or PDB
The Download structure files in mmCIF (default) or PDB scenario completed within the documented Access over 200M protein structures from AlphaFold DB boundary.
Basic 34/40|Specialized 60/60|Total 94/100
✅A1The alphafold-db output structure matches the documented deliverable
✅A2The script execution path completed successfully for the documented case
✅A3The output stays fully within the documented skill boundary
✅A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
94
Stress✅ Pass
End-to-end case for Fetch AlphaFold DB predicted structures by UniProt accession
The archived evaluation treated End-to-end case for Fetch AlphaFold DB predicted structures by... as a clean in-scope run.
Basic 31/40|Specialized 60/60|Total 94/100
✅A1The alphafold-db output structure matches the documented deliverable
✅A2The script execution path completed successfully for the documented case
✅A3The output stays fully within the documented skill boundary
✅A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
Medical Task Total96 / 100
Key Strengths
- Primary routing is Evidence Insight with execution mode B
- Static quality score is 85/100 and dynamic average is 83.6/100
- Assertions and command execution outcomes are recorded per input for human review
- Execution verification summary: Script verification 1/1; adjustment=5. fetch_structure.py: OK