Name	Gender	Age	Give Loan
John	Male	25	No
Kate	Female	22	Yes
Brain	Male	20	No
...	...	...	...

Good evening everyone. Today I'm going to present this paper by Yuji Roh et al. (2020). FR-train is a mutual information based approach to train fair and robust models. I'm also going to present some of my experiments. Hopefully you may find them intersting.

So for those who are not familiar with fairness and robustness. What is fairness and robustness? In a broader sense, both fairness and robustness are related to trustworthiness of AI, which is a hot topic in recent years. And here is a graph showing the relationship between different aspects of AI trustworthiness. by Li et al. (2022) As we all know, nowadays AI and machine learning models are very powerful. And we wish to deploy them in many different scenarios. However, we also want to make sure the AI is trustable. This includes many aspects as we can see in this graph. Technically we want AI to be reliable, where robustness is one of the key aspects. We also want AI to follow human values, which is related to fairness.

The idea of this comes from connection between differential privacy and robust statistics

To better understand why fairness is important, let's look at this example. Here are three State of the Art face recognition models. And the task here is to predict the gender of a person base on the face image. We can see that the overall accuracy of these models are really high. However can we just say these models are good enough and put them into production?

The answer is no. Let's breakdown the performance of these models by face color and gender and see what we happen. Predict well on male and light female. But worse on darker female. The gap between the accuracy is significant. So we can't say these models fair towards different groups.

Such observations can also be found in ... Where under-represented groups are more likely to be misclassified or experience systematic disadvantage. So what we want to achieve is to eliminate or mitigate this kind of inherent biases.

First I want to introduce the metric we use for fairness in this project. Let's say we have a sensitive attribute S, and a binary label Y. We define ... We put the smaller probability in the numerator to make sure the value is between 0 and 1.

Ok that's for fairness. Now let's talk about robustness. Overall, robustness is about how well the model can perform under different conditions. like what is there's outliers in the dataset, or what if the dataset is contaminated. The definition of contamination varies in different contexts. In the paper, the authors consider the following setting where some of the labels are artificially flipped.

Here we show a formal definition of such label flipping using epsilon replacement notation. We can say our observed Y is a epsilon replacement of the true label Y if the number of flipped labels is less than epsilon times the total number of samples. In such contaminated condition, we want our model to still perform relatively well. And it's important to study fairness and robustness together. Because pursuing fairness can compromise robustness. According to this paper by Han Xu et al. (2021).

You may ask: How are these things related to our course information theory. So first let's see how mutual information is related to fairness. A bit of recap of the deinifition of mutual information. I'm not going into the details. The idea is that mutual information is a measure of dependence between two random variables. If the mutual information is 0, then the two random variables are independent.

Now we are ready to get into the details of our model FR-train. Let's take a look at the sythetic data we are using and the problem we want to solve.

The arichitecture of FR-train is made of a generator to make predictions. And two discriminators, one for fairness and one for robustness. During training the genrator and the two discriminators compete with each other. really similar to GANs. Let's first look at the fairness discriminator. which is basically the first row of the architecture. The fairness discriminator takes the logit as input and predict the sensitive attribute. The intuition is that if the discriminator can perform well, then the model is not fair..

The robust discriminator is a bit different. It achieves robustness by both reweighting samples and give the loss as feedback to the generator. This kind of combine two different ideas of robustness training. One is just to get rid of the contaminated samples. The other is make the model capable of handling contaminated samples. Also it requires a clean dataset to train the robust discriminator. Which is not really realistic in practice. That's probabiliy one of the major drawbacks of this model.

But anyways We will take a look at the detail of this discriminator.

Ok so that wrapped up the architecure of FR-train. You may ask again. Where is the mutal information we just talk about and how do we minimize the mi? That's the second connection the paper made with information theory. It's about the relationship between cross entropy and mutual information. Here's the key theorem of the paper.

Final Project

Mutual information, Fairness and Robustness

What is Fairness?

An Interesting Example: Gender Classification

Breakdown

Fairness

Metric for Fairness

Robustness

Robustness

Synthetic dataset and problem

FR-Train architecture

FR-Train architecture (cont.)

Details of robust discriminator

Cross entropy and mutual information

Discriminator loss Mutual information Fairness

Structure of our experiments

Exp 1: Original Model - Convergence of discriminator losses

Exp 1: Original model - DI and accuracy vs. iteration

Exp 2: Mutual information as loss for the generator

Exp 2: Naive solution - DI and accuracy vs. iteration

Exp 2: Comparison of performance

Exp 3: Problem with unbalanced datasets

Exp 3: Dataset generation

Exp 3: Performance on unbalanced datasets

Work in progress / Future work

References

Thank you!

Final Project

Mutual information, Fairness and Robustness

What is Fairness?

An Interesting Example: Gender Classification

Breakdown

Fairness

Metric for Fairness

Robustness

Robustness

How is fairness related to infomration theory?

Synthetic dataset and problem

FR-Train architecture

FR-Train architecture (cont.)

Details of robust discriminator

Cross entropy and mutual information

Discriminator loss Mutual information Fairness

Structure of our experiments

Exp 1: Original Model - Convergence of discriminator losses

Exp 1: Original model - DI and accuracy vs. iteration

Exp 2: Mutual information as loss for the generator

Exp 2: Naive solution - DI and accuracy vs. iteration

Exp 2: Comparison of performance

Exp 3: Problem with unbalanced datasets

Exp 3: Dataset generation

Exp 3: Performance on unbalanced datasets

Work in progress / Future work

References

Thank you!