A new study reveals how AI judges its own work in financial risk assessment
AI models show surprising bias when judging each other's financial risk analysis.
Researchers tested five leading AI models on their ability to evaluate merchant payment risk. The models showed significant bias, with some consistently scoring themselves too high and others too low. When the models' identities were hidden, this bias decreased by about 26%. The study found that models with a 'negative bias' were actually more aligned with human expert judgment, and four models showed strong correlation with real payment network data.
Why It Matters
This highlights the need for careful oversight when using AI to make critical financial decisions.