AI Safety

New framework uses risk tiers to audit gender bias in AI image generation

FAccT 2026 paper introduces THUMB cards to standardise bias auditing for T2I models

Deep Dive

Text-to-image (T2I) models are increasingly used for education, media, and public communication, but they tend to reinforce gender stereotypes, leading to representational erasure and skewed perceptions of role suitability. Existing bias metrics are fragmented and reported without a shared understanding of assumptions or context, limiting their usefulness for technical auditing and governance. Researchers from the paper 'Context Matters: Auditing Gender Bias in T2I Generation through Risk-Tiered Use-Case Profiles' (FAccT 2026) address this by proposing a comprehensive framework that aligns risk categories with evaluation metrics and harms.

The framework consists of three constituents: risk-tiered use-case profiles aligned with the EU AI Act's risk categories (unacceptable, high, limited) to vary auditing expectations; a metric catalog consolidating gender bias evaluation methods into gender prediction, embedding similarity, and downstream task categories; and a harm typology mapping context-dependent harms (e.g., representational, quality-of-service) to specific risk scenarios. The paper introduces THUMB cards (Text-to-image Harms-informed Use-case-aligned Metrics of gender Bias), which systematise auditing by incorporating context, scenario, bias manifestation, harm hypotheses, and audit strategy. This enables a standardised, risk-aware approach to gender bias auditing across different deployment contexts.

Key Points
  • Framework aligns with EU AI Act risk categories (unacceptable, high, limited) to adjust auditing expectations per deployment context.
  • Metric catalog organises evaluation methods into three categories: gender prediction, embedding similarity, and downstream task.
  • THUMB cards systematise audits by integrating context, scenario, harm hypotheses, and audit strategy for reproducible bias evaluation.

Why It Matters

Standardises gender bias auditing for T2I models, enabling responsible deployment in education, media, and high-stakes applications.