Jointly prunes irrelevant features and outliers using a single Bayesian marginal likelihood objective?

Jointly prunes irrelevant features and outliers using a single Bayesian marginal likelihood objective

Preserves conjugacy and enables closed-form updates, unlike many robust regression extensions?

Preserves conjugacy and enables closed-form updates, unlike many robust regression extensions

Demonstrated across eight regression tasks, achieving sparser and more robust predictions than standard Sparse Bayesian Learning?

Demonstrated across eight regression tasks, achieving sparser and more robust predictions than standard Sparse Bayesian Learning

Research & Papers

Joint ARD method prunes features and outliers in one Bayesian step

arXiv stat.ML May 29, 2026

⚡ICML 2026 paper from Timans et al. makes sparse recovery robust to data contamination.

Deep Dive

Sparse Bayesian Learning (ARD) has long been a staple for feature selection in linear systems, but its assumption of homoscedastic noise makes it brittle when data contains outliers or misspecifications. In a paper published at ICML 2026, Alexander Timans and co-authors introduce a symmetric pruning framework that jointly optimizes relevance weights for both features and individual data points. By extending the marginal likelihood objective, the method learns which samples to discard alongside which predictors to keep, effectively performing robust regression and model sparsification in a single pass.

The key technical insight is that the new formulation preserves conjugacy and admits closed-form update rules, making it computationally attractive for standard optimization pipelines. Empirical results across eight regression datasets show that joint ARD consistently outperforms standard Sparse Bayesian Learning and popular robust baselines, achieving sparser models with lower prediction error. The approach also aligns with influence function theory, providing a principled Bayesian justification for outlier removal. This work opens the door to more resilient sparse recovery in high-dimensional settings where both feature selection and data quality are critical.

Key Points

Jointly prunes irrelevant features and outliers using a single Bayesian marginal likelihood objective
Preserves conjugacy and enables closed-form updates, unlike many robust regression extensions
Demonstrated across eight regression tasks, achieving sparser and more robust predictions than standard Sparse Bayesian Learning

Why It Matters

A practical extension to ARD that automatically cleans data while selecting features, improving reliability in noisy real-world applications.

Read Original Article

Joint ARD method prunes features and outliers in one Bayesian step

Why It Matters

Related Articles

🚀 Stay Ahead in AI