Research & Papers

JD-BP: A Joint-Decision Generative Framework for Auto-Bidding and Pricing

New AI model jointly optimizes bids and pricing corrections, delivering 6.48% better cost efficiency in live tests.

Deep Dive

A research team from JD.com and collaborating institutions has introduced JD-BP, a novel generative AI framework designed to solve a critical flaw in real-time advertising auctions. Current auto-bidding services use machine learning to set bids for advertisers aiming for goals like target return on investment (ROI). However, these systems suffer from 'ex-post' inefficiency—the optimal bid in hindsight differs from the real-time decision due to model prediction errors and system feedback latency. JD-BP's breakthrough is a joint architecture that doesn't just output a bid; it simultaneously generates a 'pricing correction term.' This additive correction works with auction payment rules (like the Generalized Second-Price, or GSP, mechanism) to adjust the final cost, effectively allowing the system to course-correct for past mistakes and uncertainties in real-time.

The framework employs several advanced techniques to make this work. It uses a 'memory-less Return-to-Go' design to encourage future value maximization without being crippled by historical constraint violations. A trajectory augmentation algorithm allows it to generate training data from any existing bidding policy, enabling easy 'plug-and-play' deployment. For training, the team combined a cross-attention module with an Energy-Based Direct Preference Optimization (DPO) method to refine the joint learning of bidding and correction. The results are significant: offline tests on the AuctionNet dataset showed state-of-the-art performance, but the real proof came from live A/B tests on JD.com's massive advertising platform. The online deployment confirmed a 4.70% lift in ad platform revenue and a 6.48% improvement in hitting target cost goals for advertisers.

Key Points
  • JD-BP jointly generates a bid and a pricing correction term to fix real-time errors from prediction and latency, a first in auto-bidding AI.
  • The framework uses Energy-Based DPO and trajectory augmentation for training, allowing deployment on top of existing RL/generative bidding models.
  • Online A/B tests on JD.com's platform proved its effectiveness, increasing ad revenue by 4.70% and improving target cost performance by 6.48%.

Why It Matters

This directly boosts profitability for digital ad platforms and improves ROI for advertisers by making AI-driven bidding more accurate and efficient.