Predictive coding networks match backpropagation in wide models
New proof shows PCNs converge to BP when width dwarfs depth
Get AI news that actually matters
One email a day. Zero fluff. Join 10,000+ professionals.
A new paper from Francesco Innocenti, El Mehdi Achour, and Rafal Bogacz (University of Oxford) tackles a fundamental question in neuro-inspired AI: can predictive coding (PC) — a biologically plausible learning algorithm that minimizes an energy function locally — scale as effectively as backpropagation (BP)?
The authors rigorously analyze the infinite width and depth limits of predictive coding networks (PCNs). For linear residual networks, they prove that the set of width- and depth-stable feature-learning parameterizations is exactly the same for PC and BP. Moreover, they show that when model width is much larger than depth, the equilibrated PC energy converges to the quadratic BP loss, meaning PC computes identical gradients to BP. Experiments with nonlinear models (convolutional networks and transformers) confirm that the convergence holds as long as activity equilibrium is reached.
This work constrains which parameterizations are scalable for PC while revealing a plausible mechanism for how the brain might implement error backpropagation using only local synaptic updates — a key step toward reconciling deep learning theory with biological realism.
- Linear residual PCNs have identical width- and depth-stable parameterizations to backpropagation
- PC energy converges to quadratic BP loss when width >> depth, yielding equal gradients
- Convergence holds empirically for nonlinear models including convnets and transformers
Why It Matters
Bridges the gap between biologically plausible learning and modern deep learning's workhorse algorithm.