Research & Papers

Tencent Advertising Algorithm Challenge 2025: All-Modality Generative Recommendation

Tencent releases two massive, real-world datasets with 1M and 10M user sequences to benchmark generative recommendation models.

Deep Dive

A research team from Tencent has introduced a major new benchmark for generative recommender systems (GR) through the Tencent Advertising Algorithm Challenge 2025. The core of the challenge is two novel, large-scale datasets—TencentGR-1M and TencentGR-10M—constructed from de-identified Tencent Ads logs. These datasets are designed to address a critical gap in public research: the lack of realistic, industrial-scale data that includes both collaborative identifiers (like user and item IDs) and rich multi-modal content representations (extracted using state-of-the-art embedding models) for the GR paradigm.

The TencentGR-1M dataset serves as a preliminary track with 1 million user sequences, where each sequence contains up to 100 interacted items labeled with exposure and click signals. The final track, TencentGR-10M, scales this to 10 million users and introduces a more complex, business-critical layer by explicitly distinguishing between click and high-value conversion events. The challenge task focuses on multi-modal sequence generation in an advertising context and employs a weighted evaluation metric that prioritizes predicting conversions. By releasing these datasets and baseline model implementations, Tencent aims to catalyze research into building more powerful and practical generative recommenders that can handle the full complexity of real-world digital advertising.

Key Points
  • Introduces two massive public datasets: TencentGR-1M (1M user sequences) and TencentGR-10M (10M sequences), sourced from real Tencent ad logs.
  • Focuses on 'all-modality' generative recommendation, combining collaborative IDs with multi-modal content embeddings (text, image, etc.) for sequence modeling.
  • Includes business-critical signals like clicks and conversions, with a weighted evaluation protocol that values predicting high-value user actions.

Why It Matters

Provides the first industrial-scale benchmark to advance generative AI for ads, moving research beyond simple clicks to predicting real revenue events.