Absolute abstraction: a renormalisation group approach
New theory argues AI needs broad training data, not just depth, to achieve human-like abstraction.
A team of researchers including Carlo Orientale Caputo, Elias Seiffert, and Matteo Marsili has published a groundbreaking paper titled 'Absolute abstraction: a renormalisation group approach' that challenges fundamental assumptions about how AI systems develop abstract representations. The paper argues that while neural network depth helps create abstraction by combining lower-level features (like edges into shapes), depth alone is insufficient for developing truly abstract representations. Instead, the researchers propose that the breadth of training data is equally critical, using a renormalisation group approach—a mathematical framework borrowed from physics—to show how representations expand to encompass broader datasets.
The researchers identify a unique fixed point in this transformation called the 'Hierarchical Feature Model,' which they propose as a candidate for absolutely abstract representations. They tested this theoretical framework through numerical experiments using Deep Belief Networks and auto-encoders trained on datasets of varying breadth. The experiments confirmed that neural network representations approach the Hierarchical Feature Model as both data breadth increases and network depth grows, validating their theoretical predictions. This work bridges theoretical physics concepts with practical machine learning, suggesting that future AI systems might require both architectural depth and exceptionally diverse training data to achieve human-like abstraction capabilities.
- Challenges conventional wisdom that neural network depth alone creates abstraction, arguing data breadth is equally important
- Introduces 'Hierarchical Feature Model' as theoretical fixed point for absolute abstraction using renormalisation group approach
- Validated through experiments with Deep Belief Networks and auto-encoders showing representations approach the model with broader data
Why It Matters
Could guide development of more capable AI systems by emphasizing both architectural design and comprehensive data collection strategies.