Fairness for Cooperative Multi-Agent Learning with Equivariant Policies Reproducibility Study

This repository contains a reproducibility and extension study for the Fairness for Cooperative Multi-Agent Learning with Equivariant Policies Reproducibility Research Paper. The goal was to analyse the paper's reproducibility while studying if the author's claims generalise to a reduced, simpler world, avoiding the requirement of long training times.

Cloning the repository

To clone the repository with the simple_particle_envs submodule:

git clone --recurse-submodules <link_to_repository>

Setup

Installing the Recommended Anaconda environment (Python 3.9.15)

Windows:

conda env create -f environment_windows.yml

Linux:

conda env create -f environment_linux.yml

To setup multi-agent environments:

cd simple_particle_envs
pip install -e .

To verify installation, run:

xvfb-run -a python baselines/baselines.py --mode test --render

Training

To train a Fair-E model, run:

python main.py --env simple_torus --algorithm ddpg_symmetric

To train a Fair-E model with equivariance and shared reward, run:

python main.py --env simple_torus --algorithm ddpg_symmetric --equivariant --collaborative

To train a Fair-ER model, run:

python main.py --env simple_torus --algorithm ddpg_speed_fair --lambda_coeff 0.5

The control parameter of fairness can be adjusted in configs.py.

To resume training from a checkpoint, run:

python main.py --env simple_torus --algorithm ddpg_symmetric --checkpoint_path /path/to/model/checkpoints

To train with a varying number of evaders and pursuers we use the simple_torus.py scenario:

python main.py --env simple_torus --algorithm ddpg_symmetric --nb_agents 5 --nb_prey 1

Evaluation

It is important to set the same flags that you used for training, so if you have used the equivariant and collaborative flag, you should also set them when running the evaluation.

To collect trajectories from a trained model, run eval/collect_actions.py or eval/collect_actions_symmetric.py. Here are a few examples:

Greedy pursuers against random-moving evader:

python eval/collect_actions.py --env simple_torus --pred_policy greedy --prey_policy random --seed 75

CD-DDPG pursuers (Fair-E) against sophisticated evader:

python eval/collect_actions_symmetric.py --env simple_torus --pred_policy ddpg --prey_policy cosine --seed 72 --checkpoint_path /path/to/model/checkpoints

* CD-DDPG pursuers (Fair-ER) against sophisticated evader: 
```eval
python eval/collect_actions.py --env simple_torus --pred_policy ddpg --prey_policy cosine --seed 72 --checkpoint_path /path/to/model/checkpoints

To collect trajectories trained with a varying number of evaders and pursuers we use the simple_torus scenario again. For example, with a Fair-E model:

python eval/collect_actions_symmetric.py --env simple_torus --pred_policy ddpg --prey_policy cosine --seed 72 --checkpoint_path /path/to/model/checkpoints --nb_agents 5 --nb_prey 1

To create the plots, run:

python eval/make_plots.py --fp path/of/trajectories --plot (1-5)

for 4 agents:
python eval/make_plots_4_predators.py --fp path/of/trajectories --plot (1-3)

Name		Name	Last commit message	Last commit date
Latest commit History 155 Commits
algorithms		algorithms
baselines		baselines
eval		eval
models		models
models_evaluation		models_evaluation
simple_particle_envs @ c8a9b1d		simple_particle_envs @ c8a9b1d
trajectories_for_plots		trajectories_for_plots
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE.txt		LICENSE.txt
MLRC_2022_paper.pdf		MLRC_2022_paper.pdf
README.md		README.md
__init__.py		__init__.py
configs.py		configs.py
environment_linux.yml		environment_linux.yml
environment_windows.yml		environment_windows.yml
main.py		main.py
readme_jupyter.ipynb		readme_jupyter.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fairness for Cooperative Multi-Agent Learning with Equivariant Policies Reproducibility Study

Cloning the repository

Setup

Training

Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fairness for Cooperative Multi-Agent Learning with Equivariant Policies Reproducibility Study

Cloning the repository

Setup

Training

Evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages