Research & Papers

Fortress framework stabilizes search recommendations by pruning volatile features

New method cuts prediction volatility while keeping accuracy in app search rankings.

Deep Dive

In large-scale search and recommendation systems, predictive models often suffer from temporal instability when certain input features—particularly engagement-based ones—cause scores to fluctuate unpredictably over time. This degrades user experience and reliability, especially in multi-stage systems where consistent predictions are critical. The paper presents Fortress, a general framework that stabilizes predictions by pruning volatility-inducing features while keeping the model's predictive accuracy intact.

Fortress operates in four steps: (1) collect historical snapshots of score fluctuations, (2) identify samples with unstable predictions, (3) isolate and remove the offending features, and (4) retrain using only stable signals. The authors validate Fortress on a real-world query-to-app relevance model from a large app marketplace. Offline results show significant improvements in both stability (measured by Coefficient of Variation) and classification performance (PR-AUC), demonstrating that suppressing engagement signal volatility doesn't come at the cost of accuracy.

Key Points
  • Fortress uses temporal data augmentation (historical snapshots) to detect score fluctuations for the same entity over time.
  • The framework prunes instability-inducing features while retaining the predictive value of engagement signals.
  • Validated on a large-scale app marketplace: improved prediction stability (Coefficient of Variation) and classification (PR-AUC) simultaneously.

Why It Matters

Stable search recommendations mean fewer erratic results for users and more reliable downstream systems for platforms.