Clarification about bias checking and latent space

Jaspier · June 21, 2023, 1:32pm

Hello,

In the C2W2 assingment, it is showing a technique to identify bias. When the encoder outputs the latent space with the means and stds for each feature, it is calculated the covariance matrix. Then the covariance matrix is used to identify the risk of bias.

I would like to know better how is the correct interpretation after generating the images with the decoder. Should the images show the same pattern for covariance as we have in the original covariance matrix ? But if we want our generations to “escape” those patterns for the sake of diversity?

In essence, my question is how to interpret the risk of bias having these informations: the latent space, the matrix of covariance, the means and stds for the features, and the generated image set.

Thanks in advance!

carlosrl · August 31, 2023, 8:44pm

Hi @Jaspier , here are some thoughts about your questions.

The latent space is a high-dimensional space representing the data’s underlying distribution. It is possible that the latent space may be biased, for instance, if there are more points in the latent space that correspond to images of one protected class than another. This could lead to the generator generating more images of that protected class, even if the discriminator is not biased.
The matrix of covariance is a measure of the correlation between different features in the data. It is possible that the matrix of covariance may be biased, for instance, if there is a stronger correlation between two features in one protected class than in another. Again, this could lead to the generator generating images with more correlated features, even if the discriminator is not biased.
The means and standards for the features are the average and standard deviation of each feature in the data. It is possible that the means and stds may be biased, for instance, if the average age of images in one protected class is different from the average age of images in another protected class. This could lead to the generator generating images that are older or younger, even if the discriminator is not biased.
The generated image set is the set of images that the generator has generated. It is possible that the generated image set may be biased, for instance, if there are more images of one protected class than another. This could be due to the latent space, the matrix of covariance, the means and standards, or the discriminator.

So, there are a number of ways that bias can be introduced into GANs.

Topic		Replies	Views
C5W2 Assignment 1 Debiasing - About "orthogonal axis" Sequence Models	3	550	September 30, 2021
Relating C1W4 lab back to the paper Build Basic Generative Adversarial Networks week-4	1	510	July 11, 2022
Build Better Generative Adversarial Networks (GANs): week2 Assignment, covariance Build Basic Generative Adversarial Networks week-1	3	523	March 7, 2022
Level of diversity within a class versus bias Build Better Generative Adversarial Networks week-2	1	504	June 6, 2022
Week 1 Programming Assignment - Conceptual Question Build Basic Generative Adversarial Networks week-1	8	735	August 25, 2021

Clarification about bias checking and latent space

Related topics