BOHM: Zero-cost attribution for compound AI systems from routing weights
New method extracts attribution from existing routing weights, no extra calls needed.
Get AI news that actually matters
One email a day. Zero fluff. Join 10,000+ professionals.
Compound AI systems route tasks through hierarchies of specialized components, but attributing output to individual components has relied on Shapley-based methods (SHAP) that require evaluating the system on arbitrary subsets of components. That requirement fails for third-party APIs, opaque endpoints, and agentic orchestrators that concentrate routing on a few tools. Enter BOHM, a new method from researcher Joss Armstrong that extracts a hierarchical attribution tree directly from the routing weights these systems already maintain.
BOHM's key insight is that leaf attribution equals the path product of root-to-leaf routing weights, and level-k attribution is the induced distribution over depth-k nodes. This means zero marginal cost and no access to component internals. BOHM provides multi-resolution attribution at every level simultaneously — something flat methods like SHAP cannot offer at any evaluation budget. On 18 LLMs in a 3-level hierarchy over 880 LiveCodeBench problems, BOHM yielded Kendall tau=0.928, while SHAP reached tau=0.980 but required 9,000x more coalition evaluations per seed. On a 5-driver, 7-benchmark agentic study, drivers concentrated routing on a single tool (top-share median 0.65), and cell-level agreement between BOHM and SHAP predicted whether the driver's top pick was empirically best. On a US Census hierarchy with 475 leaves and 4 levels, BOHM recovered ground-truth rankings at every level (tau up to 0.722). BOHM satisfies efficiency, monotonicity, symmetry, and weak suppression but not Shapley's additivity — it is a complementary primitive for multi-resolution decomposition wherever routing state exists.
- BOHM extracts attribution from existing routing weights with zero additional cost, unlike SHAP requiring 9,000x more evaluations.
- On 18 LLMs over 880 problems, BOHM achieved Kendall tau=0.928 vs SHAP's 0.980.
- Provides multi-resolution attribution at every hierarchy level simultaneously, which flat methods cannot offer.
Why It Matters
BOHM enables practical attribution for black-box compound AI systems, making agent orchestration more transparent with zero overhead.