Academic Writing

patent-assistant

88100Total Score
Core Capability
82 / 100
Functional Suitability
11 / 12
Reliability
10 / 12
Performance & Context
8 / 8
Agent Usability
13 / 16
Human Usability
7 / 8
Security
8 / 12
Maintainability
9 / 12
Agent-Specific
16 / 20
Medical Task
20 / 20 Passed
96Assists R&D teams with patent technical disclosure drafting and patent/novelty search analysis; use when users ask to write a patent disclosure, structure an invention description, search related patents, or assess novelty
4/4
92Assists R&D teams with patent technical disclosure drafting and patent/novelty search analysis; use when users ask to write a patent disclosure, structure an invention description, search related patents, or assess novelty
4/4
90Converts colloquial technical descriptions into a structured patent technical disclosure document
4/4
90Uses a guided information-collection checklist to fill gaps (problem, prior art defects, core solution, features, effects)
4/4
90End-to-end case for Converts colloquial technical descriptions into a structured patent technical disclosure document
4/4

Veto GatesRequired pass for any deployment consideration

Skill Veto✓ All 4 gates passed
Operational Stability
System remains stable across varied inputs and edge cases
PASS
Structural Consistency
Output structure conforms to expected skill contract format
PASS
Result Determinism
Equivalent inputs produce semantically equivalent outputs
PASS
System Security
No prompt injection, data leakage, or unsafe tool use detected
PASS
Research Veto✅ PASS — Applicable
DimensionResultDetail
Scientific IntegrityPASSThe legacy review did not flag invented scientific claims in the package's writing-oriented output.
Practice BoundariesPASSThe evaluated outputs stayed inside the Assists R&D teams with patent technical disclosure drafting and patent/novelty search analysis workflow rather than drifting into unsupported scientific interpretation.
Methodological GroundPASSNo methodological-grounding issue was recorded for patent-assistant in the archived evaluation.
Code UsabilityPASSThe archived review found the packaged execution path for patent-assistant usable in its intended context.

Core Capability82 / 1008 Categories

Functional Suitability
Related legacy finding for patent-assistant: Improve stress-case output rigor. Stress and boundary scenarios show weaker consistency
11 / 12
92%
Reliability
Related legacy finding for patent-assistant: Improve stress-case output rigor. Stress and boundary scenarios show weaker consistency
10 / 12
83%
Performance & Context
Performance context reached full score in the archived evaluation.
8 / 8
100%
Agent Usability
The archived score suggests slightly clearer routing would help an agent choose the right dissemination path faster.
13 / 16
81%
Human Usability
The archived review left a small gap in how smoothly users can inspect the final dissemination artifacts.
7 / 8
88%
Security
A modest security gap remained because the package could make its safe-use limits even clearer.
8 / 12
67%
Maintainability
The archived review treated the package as maintainable overall, while still leaving some cleanup headroom.
9 / 12
75%
Agent-Specific
Agent specific was softened by the legacy issue 'Improve stress-case output rigor'. Stress and boundary scenarios show weaker consistency
16 / 20
80%
Core Capability Total82 / 100

Medical TaskExecution Average: 91.6 / 100 — Assertions: 20/20 Passed

96
Canonical
Assists R&D teams with patent technical disclosure drafting and patent/novelty search analysis; use when users ask to write a patent disclosure, structure an invention description, search related patents, or assess novelty
4/4
92
Variant A
Assists R&D teams with patent technical disclosure drafting and patent/novelty search analysis; use when users ask to write a patent disclosure, structure an invention description, search related patents, or assess novelty
4/4
90
Edge
Converts colloquial technical descriptions into a structured patent technical disclosure document
4/4
90
Variant B
Uses a guided information-collection checklist to fill gaps (problem, prior art defects, core solution, features, effects)
4/4
90
Stress
End-to-end case for Converts colloquial technical descriptions into a structured patent technical disclosure document
4/4
96
Canonical✅ Pass
Assists R&D teams with patent technical disclosure drafting and patent/novelty search analysis; use when users ask to write a patent disclosure, structure an invention description, search related patents, or assess novelty

The archived run for Assists R&D teams with patent technical disclosure drafting and... stayed on the narrative-deliverable path rather than a code path.

Basic 35/40|Specialized 60/60|Total 96/100
A1The patent-assistant output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
92
Variant A✅ Pass
Assists R&D teams with patent technical disclosure drafting and patent/novelty search analysis; use when users ask to write a patent disclosure, structure an invention description, search related patents, or assess novelty

The archived run for Assists R&D teams with patent technical disclosure drafting and... stayed on the narrative-deliverable path rather than a code path.

Basic 33/40|Specialized 59/60|Total 92/100
A1The patent-assistant output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
90
Edge✅ Pass
Converts colloquial technical descriptions into a structured patent technical disclosure document

Converts colloquial technical descriptions into a structured patent... remained a writing-first workflow and was evaluated without depending on a runnable helper script.

Basic 32/40|Specialized 58/60|Total 90/100
A1The patent-assistant output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
90
Variant B✅ Pass
Uses a guided information-collection checklist to fill gaps (problem, prior art defects, core solution, features, effects)

Uses a guided information-collection checklist to fill gaps... remained a writing-first workflow and was evaluated without depending on a runnable helper script.

Basic 31/40|Specialized 59/60|Total 90/100
A1The patent-assistant output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
90
Stress✅ Pass
End-to-end case for Converts colloquial technical descriptions into a structured patent technical disclosure document

This stress case was handled as a bounded writing workflow, not as an executable pipeline.

Basic 28/40|Specialized 60/60|Total 90/100
A1The patent-assistant output structure matches the documented deliverable
A2The instruction path remains actionable for the documented case
A3The output stays fully within the documented skill boundary
A4The response quality is acceptable for the documented path
Pass rate: 4 / 4
Medical Task Total91.6 / 100

Key Strengths

  • Primary routing is Academic Writing with execution mode B
  • Static quality score is 82/100 and dynamic average is 78.6/100
  • Assertions and command execution outcomes are recorded per input for human review
  • Execution verification summary: Script verification 2/2; adjustment=5. generate_disclosure.py: OK; patent_search.py: OK