evidence-beliefs-ablation

Status: IN

Beliefs alone outperform beliefs + expert prompt: Opus 100% vs 94.2% (+5.8pp), Sonnet 94.2% vs 91.8% (+2.4pp). Adding expert prompt hurts — agent trusts its 'expertise' instead of consulting the knowledge base

Source: repo:beliefs-pi/entries/2026/03/15/beliefs-ablation-results-structured-knowledge-beats-prompt-engineering-expert-prompts-may-hurt.md

Depended on by

JSON