Linear residual PCNs have identical width- and depth-stable parameterizations to backpropagation?

Linear residual PCNs have identical width- and depth-stable parameterizations to backpropagation

PC energy converges to quadratic BP loss when width >> depth, yielding equal gradients?

PC energy converges to quadratic BP loss when width >> depth, yielding equal gradients

Convergence holds empirically for nonlinear models including convnets and transformers?

Convergence holds empirically for nonlinear models including convnets and transformers

Research & Papers

Predictive coding networks match backpropagation in wide models

arXiv cs.NE May 25, 2026

⚡New proof shows PCNs converge to BP when width dwarfs depth

Deep Dive

A new paper from Francesco Innocenti, El Mehdi Achour, and Rafal Bogacz (University of Oxford) tackles a fundamental question in neuro-inspired AI: can predictive coding (PC) — a biologically plausible learning algorithm that minimizes an energy function locally — scale as effectively as backpropagation (BP)?

The authors rigorously analyze the infinite width and depth limits of predictive coding networks (PCNs). For linear residual networks, they prove that the set of width- and depth-stable feature-learning parameterizations is exactly the same for PC and BP. Moreover, they show that when model width is much larger than depth, the equilibrated PC energy converges to the quadratic BP loss, meaning PC computes identical gradients to BP. Experiments with nonlinear models (convolutional networks and transformers) confirm that the convergence holds as long as activity equilibrium is reached.

This work constrains which parameterizations are scalable for PC while revealing a plausible mechanism for how the brain might implement error backpropagation using only local synaptic updates — a key step toward reconciling deep learning theory with biological realism.

Key Points

Linear residual PCNs have identical width- and depth-stable parameterizations to backpropagation
PC energy converges to quadratic BP loss when width >> depth, yielding equal gradients
Convergence holds empirically for nonlinear models including convnets and transformers

Why It Matters

Bridges the gap between biologically plausible learning and modern deep learning's workhorse algorithm.

Read Original Article

Predictive coding networks match backpropagation in wide models

Why It Matters

Related Articles

🚀 Stay Ahead in AI