IBM researchers cut agentic coding retry costs by 51.7% with runtime-structured decomposition
New architecture slashes retry tokens from 904 to 436 in code debugging tasks
Researchers Asthana, Zhang, DeLuca, Patel, and Mahindru propose runtime-structured task decomposition for agentic coding systems. Instead of monolithic prompts, they use executable control logic with LLMs only for focused judgments. Tested on Kubernetes root cause analysis and multi-file debugging, the approach achieved up to 51.7% lower retry cost than monolithic systems and 73.2% lower retry cost than static decomposition baselines, improving efficiency, debuggability, and operational reliability.
- IBM's runtime-structured decomposition uses executable control logic instead of monolithic prompts for agentic coding tasks
- Achieved retry cost of 436 tokens vs 904 tokens monolithic (51.7% lower) and 1,632 tokens static (73.2% lower) on Kubernetes root cause analysis
- Only failed subtasks are rerun; LLMs used solely for focused judgment with schema-validated outputs
Why It Matters
Makes AI coding assistants cheaper and more reliable by cutting wasteful retries and improving debuggability.