LLMs Show Bias in Salary Advice, Less So in Knowledge Tests

Overview

Paper Summary › Explain Like I'm Five › Conflicts of Interest › Identified Limitations › Rating Explanation › Good to know › Topic Hierarchy › File Information ›

Paper Summary

Paperzilla title

This study investigates bias in several large language models (LLMs) across knowledge assessments and simulated salary negotiations. While limited persona variety, noisy evaluation, and narrow scenarios limit generalizability, the study finds LLMs exhibit more pronounced bias in socio-economic contexts, like suggesting lower salaries for certain demographics, than in knowledge tests.

Explain Like I'm Five

Scientists found that AI computers can be unfair. They were more unfair when talking about how much money people should earn than when answering quiz questions.

Possible Conflicts of Interest

None identified

Identified Limitations

Limited Persona Scope

The study acknowledges testing only a limited set of personas, focusing on sex, ethnicity, and migrant status, but excluding other factors like gender identity, sexual orientation, religion, age, etc. This limited scope may not fully capture the complexities of bias in LLMs and could miss other forms of discrimination.

Limited Benchmark and Language

The evaluation is solely based on the MMLU benchmark and a salary negotiation scenario, both in English. Relying on a single benchmark and language may not be representative of LLMs' performance and bias across diverse tasks and languages.

Noisy Evaluation Method

Although statistical tests were used, the generative evaluation method employed in Experiments 1 and 2 is known to be noisy and sensitive to minor prompt changes. This could affect the reliability of the results, especially given that experiments were run only once for each combination.

Limited Generalizability of Salary Negotiation

The salary negotiation scenario in Experiment 3 is limited to a single US city (Denver) and a specific job title ('Specialist'). The results might not generalize to other locations, job titles, or cultural contexts, limiting their external validity.

Limited Socio-Economic Factors

The study focuses on a single socio-economic factor (pay gap) and doesn't explore other relevant factors like wealth, education, or social status, which could also influence LLM bias.

Limited number of LLMs

The study is limited to only 5 commercially available large language models which might affect the generalizability of the results.

Rating Explanation

The study presents a reasonable investigation of bias in LLMs using diverse methodologies, but suffers from limitations such as limited persona scope, noisy evaluation, and limited generalizability in the salary negotiation scenario. While the knowledge-based experiments show less clear bias, the salary advice experiment reveals more pronounced socio-economic biases, making this a valuable contribution. However, these limitations prevent a higher rating.

Good to know

This is the Starter analysis. Paperzilla Pro fact-checks every citation, researches author backgrounds and funding sources, and uses advanced AI reasoning for more thorough insights.

Explore Pro →

Topic Hierarchy

Domain: Social Sciences

Field: Social Sciences

Subfield: General Social Sciences

File Information

Original Title: Surface Fairness, Deep Bias: A Comparative Study of Bias in Language Models

Uploaded: July 21, 2025 at 06:54 AM

Privacy: Public