Ethics Tutorial - Ethics Capstone: Auditing an AI System

Ethics Capstone: Auditing an AI System

Pascual Vila

AI & Data Engineering Instructor // Code Syllabus

Whether you're exploring the cutting edge of QuantumML or building traditional ETL pipelines, data engineering without ethical auditing is a liability. It's our responsibility to ensure algorithms don't amplify historical discrimination.

The Problem: Proxies and Historical Data

Removing a column like "Race" or "Gender" from your dataset does not make your model fair. This is known as "Fairness through blindness" and it fails because of Proxy Variables. A machine learning model can easily infer demographic information from zip codes, browsing habits, or educational history.

Measuring Bias: Fairness Metrics

To audit a model, we actively use sensitive attributes to measure disparities. Common metrics include:

Disparate Impact Ratio: Compares the positive outcome rate of the unprivileged group to the privileged group.
Equalized Odds: Ensures that True Positive Rates and False Positive Rates are equal across demographics.
Demographic Parity: The likelihood of a positive outcome should be identical regardless of demographic membership.

Explainability (XAI): SHAP and LIME

When an audit flags a model, you need to know why. Tools like SHAP (SHapley Additive exPlanations) treat the model as a cooperative game, assigning a contribution value to each feature for every prediction. This allows auditors to see exactly which variables are driving biased outcomes.

❓ Frequently Asked Questions (AI Auditing)

What is an AI Ethics Audit?

An AI Ethics Audit is a structured process to evaluate a machine learning model or data pipeline for biases, fairness, transparency, and regulatory compliance. It involves inspecting training data, evaluating fairness metrics, and documenting mitigation strategies.

How do you mitigate bias in Machine Learning?

Bias mitigation can occur at three stages:

Pre-processing: Reweighing or resampling the training data to balance representation.
In-processing: Adding fairness constraints or adversarial networks during the model training phase.
Post-processing: Adjusting the decision thresholds for different groups to achieve equalized outcomes.

Why can't we just remove sensitive attributes from the data?

Removing sensitive attributes (like gender or race) leads to "fairness through blindness." Algorithms will still learn these biases through proxy variables (e.g., zip code or income). Keeping sensitive attributes during testing allows auditors to actively measure and correct disparities.

Ethics Capstone:
Auditing AI Systems

Audit Framework

Concept: Bias Profiling

Audit Checkpoint

Audit Challenges

Ethics Guild Holo-Net

Debate AI Fairness

Ethics Capstone: Auditing an AI System

The Problem: Proxies and Historical Data

Measuring Bias: Fairness Metrics

Explainability (XAI): SHAP and LIME

❓ Frequently Asked Questions (AI Auditing)

Audit Glossary