Online Quantile Regression for Nonparametric Additive Models
New algorithm processes streaming data without storing history, achieving optimal statistical rates.
Haoran Zhan's new paper introduces a projected functional gradient descent (P-FGD) algorithm specifically designed for online quantile regression with nonparametric additive models. This represents a significant advancement in statistical machine learning, extending the functional stochastic gradient descent framework to handle the pinball loss function used in quantile estimation. The algorithm's key innovation lies in its ability to process streaming data without needing to store historical observations, addressing a major limitation in traditional online learning approaches.
Technically, P-FGD maintains computational complexity of O(J_t ln J_t) per step, where J_t denotes the number of basis functions at time t, while requiring only O(J_t) time for quantile function predictions. Through a novel Hilbert space projection identity, Zhan proves the estimator achieves the minimax optimal consistency rate of O(t^{-2s/(2s+1)}), where s represents the smoothness degree of the quantile function. The paper also establishes extensions to mini-batch learning scenarios, making the approach practical for various real-world applications.
The algorithm demonstrates clear advantages over commonly used Reproducing Kernel Hilbert Space (RKHS) methods in online learning contexts, particularly in computational efficiency and memory requirements. By eliminating the need for historical data storage while maintaining statistical optimality, P-FGD enables more scalable implementations of quantile regression for time-sensitive applications where data arrives continuously and storage constraints exist.
- P-FGD algorithm processes streaming data without storing historical observations, reducing memory requirements
- Achieves O(J_t ln J_t) computational complexity per step and O(t^{-2s/(2s+1)}) minimax optimal consistency rate
- Extends functional SGD to pinball loss for quantile regression with nonparametric additive models
Why It Matters
Enables real-time statistical modeling on streaming data for finance, IoT, and monitoring systems where storage is constrained.