Proxy Variable
A seemingly neutral data point that correlates with a protected characteristic and can perpetuate discrimination.
What Is Proxy Variable?
A proxy variable is a feature or data point that, while not directly measuring a protected characteristic, is strongly correlated with one. Common proxy variables in AI hiring include zip code (proxy for race), name (proxy for ethnicity/gender), graduation year (proxy for age), university attended (proxy for socioeconomic status), and gaps in employment history (proxy for gender/disability). AI models can learn to use proxy variables to make predictions that effectively discriminate based on protected characteristics, even when those characteristics are not included as direct inputs. Identifying and mitigating proxy variables is a critical part of AI bias auditing.
Related Terms
Adverse Impact
A substantially different rate of selection in hiring that disadvantages members of a protected group.
Read moreBias Audit
An impartial evaluation of an AI hiring tool to assess whether it produces discriminatory outcomes across protected groups.
Read moreDisparate Impact
Employment practices that are facially neutral but have a disproportionately negative effect on a protected group.
Read moreAlgorithmic Fairness
The principle that algorithms should produce equitable outcomes across different demographic groups.
Read moreReady to Audit Your AI Hiring Tools?
Get your compliance report in minutes. No consulting engagement needed.
Start Your Free Audit