🚧

Note

Model Fairness is in preview mode. Contact us for early access.

Fiddler provides powerful visualizations and metrics to detect model bias. We support structured (tabular) models for classification tasks in both the Fiddler UI and the API client. These visualizations are available for both production and dataset queries.

Definitions of Fairness

Models are trained on real-world examples to mimic past outcomes on unseen data. The training data could be biased, which means the model will perpetuate the biases in the decisions it makes.

While there is not a universally agreed upon definition of fairness, we define a ‘fair’ ML model as a model that does not favor any group of people based on their characteristics.

Ensuring fairness is key before deploying a model in production. For example, in the US, the government prohibited discrimination in credit and real-estate transactions with fair lending laws like the Equal Credit Opportunity Act (ECOA) and the Fair Housing Act (FHAct).

The Equal Employment Opportunity Commission (EEOC) acknowledges 12 factors of discrimination:[1] age, disability, equal pay/compensation, genetic information, harassment, national origin, pregnancy, race/color, religion, retaliation, sex, sexual harassment. These are what we call protected attributes.

Fairness Metrics

Fiddler provides the following fairness metrics:

  • Disparate Impact
  • Group Benefit
  • Equal Opportunity
  • Demographic Parity

The choice of the metric is use case-dependent. An important point to make is that it's impossible to optimize all the metrics at the same time. This is something to keep in mind when analyzing fairness metrics.

Disparate Impact

Disparate impact is a form of indirect and unintentional discrimination, in which certain decisions disproportionately affect members of a protected group.

Mathematically, disparate impact compares the pass rate of one group to that of another.

The pass rate is the rate of positive outcomes for a given group. It's defined as follows:

pass rate = passed / (num of ppl in the group)

Disparate impact is calculated by:

DI = (pass rate of group 1) / (pass rate of group 2)

Groups 1 and 2 are interchangeable. Therefore, the following formula can be used to calculate disparate impact:

DI = min{pr_1, pr_2} / max{pr_1, pr_2}.

The disparate impact value is between 0 and 1. The Four-Fifths rule states that the disparate impact has to be greater than 80%.

For example:

pass-rate_1 = 0.3, pass-rate_2 = 0.4, DI = 0.3/0.4 = 0.75
pass-rate_1 = 0.4, pass-rate_2 = 0.3, DI = 0.3/0.4 = 0.75

📘

Info

Disparate impact is the only legal metric available. The other metrics are not yet codified in US law.

Demographic Parity

Demographic parity states that the proportion of each segment of a protected class should receive the positive outcome at equal rates.

Mathematically, demographic parity compares the pass rate of two groups.

The pass rate is the rate of positive outcome for a given group. It is defined as follow:

pass rate = passed / (num of ppl in the group)

If the decisions are fair, the pass rates should be the same.

Group Benefit

Group benefit aims to measure the rate at which a particular event is predicted to occur within a subgroup compared to the rate at which it actually occurs.

Mathematically, group benefit for a given group is defined as follows:

Group Benefit = (TP+FP) / (TP + FN).

Group benefit equality compares the group benefit between two groups. If the two groups are treated equally, the group benefit should be the same.

Equal Opportunity

Equal opportunity means that all people will be treated equally or similarly and not disadvantaged by prejudices or bias.

Mathematically, equal opportunity compares the true positive rate (TPR) between two groups. TPR is the probability that an actual positive will test positive. The true positive rate is defined as follows:

TPR = TP/(TP+FN)

If the two groups are treated equally, the TPR should be the same.

Intersectional Fairness

We believe fairness should be ensured to all subgroups of the population. We extended the classical metrics (which are defined for two classes) to multiple classes. In addition, we allow multiple protected features (e.g. race and gender). By measuring fairness along overlapping dimensions, we introduce the concept of intersectional fairness.

To understand why we decided to go with intersectional fairness, we can take a simple example. In the figure below, we observe that equal numbers of black and white people pass. Similarly, there is an equal number of men and women passing. However, this classification is unfair because we don’t have any black women and white men that passed, and all black men and white women passed. Here, we observe bias within subgroups when we take race and gender as protected attributes.

175175

The EEOC provides examples of past intersectional discrimination/harassment cases.[2]

In the context of intersectional fairness, we compute the previous fairness metrics for each subgroup. The values should be similar among subgroups. To get an overall idea if there exists some bias in the model, we display the min-max ratio, which takes the minimum value divided by the maximum value for a given metric. If this ratio is close to 1, then the metric is very similar among subgroups. The figure below gives an example for two protected attributes, Gender and Education, and the Equal Opportunity metric.

300300

For the Disparate Impact metric, we don’t display a min-max ratio but an absolute min. The intersectional version of this metric is a little different. For a given subgroup, take all possible permutations of 2 subgroups and then display the minimum. If the absolute minimum is greater than 80%, then all combinations are greater than 80%.

Model Behavior

In addition to the fairness metrics, we provide information about model outcomes and model performance for each subgroup. In the platform, you can see a visualization like the one below by default. You have the option to display the same numbers in a table for a deeper analysis.

300300 300300

Dataset Fairness

Finally, we provide a section for dataset fairness, with a mutual information matrix and a label distribution. Note that this is a pre-modeling step.

300300

Mutual information gives information about existing dependence in your dataset between the protected attributes and the remaining features. We are displaying Normalized Mutual Information (NMI). This metric is symmetric, and has values between 0 and 1, where 1 means perfect dependency.

300300

For more details about the implementation of the intersectional framework, please refer to this research paper.

Reference

[^1]: USEEOC article on Discrimination By Type
[^2]: USEEOC article on Intersectional Discrimination/Harassment


Did this page help you?