A MoE Approach to Machine Unlearning

Overview

This repository contains the implementation of the Deep Learning and Applied AI 2024 project: A Mixture of Experts (MoE) Approach to Machine Unlearning. The project proposes an efficient unlearning mechanism by leveraging the modular nature of MoE architectures combined with ideas from the SISA framework.

Introduction

Machine unlearning is the process of selectively removing specific data from a trained model to address privacy, security, and regulatory concerns. Traditional methods often require retraining from scratch, which is computationally expensive. This project introduces a novel approach that:

Utilizes a Mixture of Experts (MoE) model to distribute learning across specialized sub-networks.
Identifies and selectively retrains only the most relevant model components to facilitate efficient unlearning.
Employs a targeted unlearning loss function to reinforce forgetting.

Methodology

The method combines principles from the SISA framework and the MoE model:

MoE Layer: Uses multiple convolutional expert networks, each specializing in different feature sets.
Sparsity Mechanism: A router network selects a subset of experts to process each input, optimizing computational efficiency.
Targeted Expert Identification: When a request to forget a specific class is received, the most frequently activated experts for that class are identified.
Selective Retraining: Instead of retraining the full model, only the identified experts and router network are updated.
Custom Unlearning Loss Function: Encourages the model to forget target class information by penalizing correct classifications.

Results

Experiments were conducted using the CIFAR-10 and CIFAR-100 datasets. Key findings include:

Optimal Hyperparameters:
- Training Data Used: 100%
- Retraining Epochs: 5
- Learning Rate: 0.0001
Performance:
- Accuracy on the forgotten class reduced to 0.7%.
- Accuracy on other classes retained at >98%.

Installation

To set up the environment, run:

pip install -r requirements.txt

Ensure you have PyTorch and the necessary dependencies installed.

Future Work

Extend to transformer-based architectures.
Develop more granular unlearning techniques at the feature level.
Establish theoretical guarantees for selective unlearning.

References

Bourtoule et al., 2020: Machine Unlearning (arXiv:1912.03817)
Jiang et al., 2024: Mixtral of Experts (arXiv:2401.04088)
Krizhevsky: The CIFAR-10 and CIFAR-100 Datasets (Link)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
classes_to_forget.csv		classes_to_forget.csv
data.py		data.py
dataset-preprocess.ipynb		dataset-preprocess.ipynb
grid_search_results.json		grid_search_results.json
heatmap_plot.png		heatmap_plot.png
learning_rate_analysis.png		learning_rate_analysis.png
mnist.ipynb		mnist.ipynb
model.pth		model.pth
moe.ipynb		moe.ipynb
moe.py		moe.py
moe_linear.ipynb		moe_linear.ipynb
resnet.ipynb		resnet.ipynb
resnet18_cifar10.pth		resnet18_cifar10.pth
resnet_cifar.ipynb		resnet_cifar.ipynb
routedConv2d.py		routedConv2d.py
routed_cnn.pth		routed_cnn.pth
simplenet_cifar.ipynb		simplenet_cifar.ipynb
train_data.json		train_data.json
train_data.jsonl		train_data.jsonl
train_data_impact.png		train_data_impact.png
train_net.py		train_net.py
val_data.json		val_data.json
val_data.jsonl		val_data.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A MoE Approach to Machine Unlearning

Overview

Table of Contents

Introduction

Methodology

Results

Installation

Future Work

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A MoE Approach to Machine Unlearning

Overview

Table of Contents

Introduction

Methodology

Results

Installation

Future Work

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages