New algorithm learns to bid optimally despite shill manipulation in auctions
Shill bids distort feedback without changing outcomes—new algorithm achieves near-optimal regret bounds.
A new paper by Luigi Foscari, Matilde Tullii, and Vianney Perchet tackles a classic problem in auction theory: shilling. In repeated first-price auctions, a seller may insert fake bids (shills) to make competition appear stiffer, driving up prices. Crucially, the paper studies a subtle form where shilling only affects what the learner observes after losing (the max of the real bid and a shill), not whether the learner wins or loses. This makes learning—and bidding—much harder because the feedback is systematically distorted.
The authors propose a dual-algorithm strategy. The first branch uses robust interval elimination that ignores shill reports entirely, achieving a regret rate of O(T^{2/3})—the dynamic pricing baseline. The second branch is optimistic: it debiases losing-side reports when possible, exploiting low-shill events to achieve the faster O(√T) rate. A validation and racing procedure lets the algorithm switch between branches without knowing the correct scale or feedback geometry in advance. The paper also proves a matching lower bound, showing the approach is tight up to logarithmic factors. This work has direct implications for automated bidding systems in programmatic advertising, where shilling is a known challenge.
- Shill bids distort feedback (observed losing bids) but do not change the auction allocation.
- The algorithm achieves O(T^{2/3}) regret via robust interval elimination and O(√T) via optimistic debiasing.
- Upper bounds are matched by a lower bound, proving optimality within logarithmic factors for single-active-region cases.
Why It Matters
This research gives automated bidders a provably good defence against feedback manipulation, critical for real-time bidding in online advertising.