litbase
Academic paper reading and research development system for biomedical researchers. Finds papers via Semantic Scholar, reads with structured notes, tracks discussion insights, and synthesizes literature into a Research Foundation Document (RFD) for downstream protocol design skills. 8 commands: /setup /feed /read /discuss /recap /update /sync /propose
Veto GatesRequired pass for any deployment consideration
| Dimension | Result | Detail |
|---|---|---|
| Scientific Integrity | PASS | LITERATURE_HARD_RULES.md explicitly prohibits fabricating PMIDs, DOIs, citation counts, sample sizes, and study data; enforced across all 8 commands. [GAP] and [Unverified] labeling conventions replace absent citations rather than fabricating them. |
| Practice Boundaries | PASS | Skill manages literature only; no clinical recommendations or diagnostic conclusions issued at any point in the 8-command workflow. |
| Methodological Ground | PASS | RFD generation uses sound PECOT framework; GAP markers replace unsupported claims rather than fabricating evidence; /sync command enforces retrospective citation integrity. |
| Code Usability | PASS | Optional Python scripts (recommend.py, lookup_paper.py, rename_pdfs.py) present with WebFetch fallbacks for all tiers. Skill operates fully without scripts; optional code does not block operation. |
Core Capability88 / 100 — 8 Categories
Medical TaskExecution Average: 86.7 / 100 — Assertions: 34/35 Passed
Full setup workflow executed. MEMORY.md, search_config.json, and reading_list.md generated from user answers. Tier C auto-configuration completes without requiring manual terminal commands.
Paper discovery workflow executed. Rate limiting, open-access flagging, and recommendations.md output confirmed. Complete Semantic Scholar API outage path (non-429) undefined.
Full 4-section note generated from Semantic Scholar metadata. Author h-index queries succeeded. Auto-sync confirmed across all three target files.
Abstract-only note correctly labeled and bounded. Section I (Paper Weight) limited to available metadata. Auto-sync still executed. User advised to re-run with full text.
Full RFD generated section-by-section with user confirmation gates. All citations traceable to reading_list.md [x] entries. GAP markers inserted for unsupported claims. ASCII framework diagram with citation labels produced.
Training-data knowledge about unread paper correctly labeled as field context (not citation). Discussion Log auto-recorded insights. Proposal readiness check triggered.
Hard rule fires correctly. Refusal is constructive rather than abrupt: [GAP] marker inserted, /feed offered as actionable alternative. User not left without a path forward.
Key Strengths
- LITERATURE_HARD_RULES.md is exemplary: comprehensive, specific, and non-negotiable citation integrity with [GAP] and [Unverified] labeling conventions enforced across all 8 commands
- Three-tier capability adaptation (Web Claude / Manus / Claude Code) is elegant — graceful degradation at every step with no dead ends for any runtime environment
- Eight commands each have exhaustive step-by-step logic in separate files; no ambiguity on agent execution path, tier branching, or auto-sync rules
- Composability built-in: downstream_skills metadata and standardized RFD handoff format make /propose output directly consumable by protocol design skills
- Auto-sync rules (reading_list + search_config + MEMORY.md) prevent state drift across sessions with zero user overhead