Note

Main scripts | Typical train and test flow | Citation

Results

Trained with 3 agent and executed with 10 agents

Trained with 3 agent and executed with 100 agents

Main scripts

algorithm/marl_ppo.py for training Multi agent PPO on target MPE environment.
- Note run this script as python module with python -m algorithm/marl_ppo.py for imports to work properly.
envs/target_mpe_env.py. This is the main class that defines the target MPE environment.
- Also look at envs/wrapper.py for env wrappers.
config/mappo_config.py. This is the one and only file for changing config values to run experiments. Used python classes instead of yaml file to get auto complete and type checking and easier refactor when accessing and changing the structure of config.
visualize_actor.py for visualizing the trained actor in a local environment.
model/actor_critic_rnn.py has all the flax linen networks used in the PPO.

Typical train and test flow

Run the train_with_gpu.ipynb notebook in a colab with gpu.
- Remember to set up the config in WandbConfig in config/mappo_config.py and change mode online to get wandb logging.
- The artifacts are saved under the name "PPO_RNN_Runner_State"
Visualize the actor with visualize_actor.py after changing the artifact_version variable in the block. if __name__ == "__main__"

Note

It is recommended to first install either requirements_jax_cpu.txt or requirements_jax_cuda.txt before requirements.txt since the packages in requirements will install a jax version for you.

Citing JaxInforMARL

If you use JaxInforMARL in your work, please cite as follows:

@software{JaxInforMARL,
      title={JaxInforMARL: Multi-Agent Target MPE RL Environments with GNNs in JAX},
      author={Joseph Selvaraaj},
      year = {2025},
      url = {https://github.com/jselvaraaj/JaxInforMARL},
      version = {1.0.0}
    }

Name		Name	Last commit message	Last commit date
Latest commit History 137 Commits
algorithm		algorithm
config		config
envs		envs
model		model
.gitignore		.gitignore
100_agents.gif		100_agents.gif
10_agents.gif		10_agents.gif
InforMARLJAX.code-workspace		InforMARLJAX.code-workspace
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
calculate_metric.py		calculate_metric.py
calculate_metric_for_a_run.py		calculate_metric_for_a_run.py
calculate_metrics_for_a_run.ipynb		calculate_metrics_for_a_run.ipynb
interactable_viz_actor.ipynb		interactable_viz_actor.ipynb
playground.ipynb		playground.ipynb
requirements.txt		requirements.txt
requirements_jax_cpu.txt		requirements_jax_cpu.txt
requirements_jax_cuda.txt		requirements_jax_cuda.txt
train_with_gpu.ipynb		train_with_gpu.ipynb
visualize_actor.py		visualize_actor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Results

Trained with 3 agent and executed with 10 agents

Trained with 3 agent and executed with 100 agents

Main scripts

Typical train and test flow

Note

Citing JaxInforMARL

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Results

Trained with 3 agent and executed with 10 agents

Trained with 3 agent and executed with 100 agents

Main scripts

Typical train and test flow

Note

Citing JaxInforMARL

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages