GitHub - Ivan-267/2P-Box-Sorting-RL-Environment: A 2P cooperative reinforcement learning environment made using Godot and Godot RL Agents.

2 Player Box Sorting Environment

A reinforcement learning environment made using Godot and Godot RL Agents. Both players are controlled using a single RL agent (it receives observations from both players, and sends actions for both players).

2p_coop_box_sort_env.mp4

How to test the env:

Open the project in Godot Engine (made with 4.5.beta6.mono on Windows, may work with other versions too).
Find the test scene (scenes/test_scene/testing_scene.tscn) and open it in the Godot Editor.
Press 'F6' to start the scene.

You should see the trained agent solving the environment using onnx inference. If you wish to train your own agent, refer to the Godot RL Agents repository. Check the tutorial section to learn more.

Goal:

Two players must cooperate to push boxes into correct goals based on the category within a time limit. The boxes must not reach the limit (red area).

Observations:

The RL agent receives data from position sensors, raycast sensors, and current fraction of the allowed time.

Position sensor data (relative positions to objects):

First robot:

Box checkpoint (not visible, located at the opening between the two rooms) Second robot:
Green goal
Yellow goal
Box checkpoint
Collision shape of the limit (red area)

Raycast sensors:

There are multiple raycast sensors attached to each robot. They provide distances to the walls, as well as yellow and green boxes.

Each sensors only reacts with a single physics layer, so e.g. the wall raycast sensor only reports the distances to the walls, ignoring any occluding boxes.

Action space:

The env uses a multi-discrete action space. For each player, there is a single discrete action which determines the movement direction.

func get_action_space() -> Dictionary:
	return {
		"p1_move": {"size": move_dirs.size(), "action_type": "discrete"},
		"p2_move": {"size": move_dirs.size(), "action_type": "discrete"},
	}


var move_dirs = [
	Vector2.LEFT,
	Vector2.RIGHT,
	Vector2.UP,
	Vector2.DOWN,
	Vector2.LEFT + Vector2.UP,
	Vector2.RIGHT + Vector2.UP,
	Vector2.LEFT + Vector2.DOWN,
	Vector2.RIGHT + Vector2.DOWN,
]

Rewards:

+10 for each box entering the checkpoint
+10 for each box entering the correct goal
-1 for each box entering the limit
-1 for each box entering the wrong goal
-1 for timeout

Training settings:

Training was done with the SB3 example training script, modified to use different hyperparameters.

Modified lines include:

    model: PPO = PPO(
        "MultiInputPolicy",
        env,
        ent_coef=0.025,
        n_steps=1024,
        batch_size=1024 * env.num_envs,
        target_kl=0.006,
        n_epochs=40,
        vf_coef=0.1,
        tensorboard_log=args.experiment_dir,
        learning_rate=learning_rate,
    )

Training:

Training was started with the following CL args (excluding the arg used for checkpoint saving, also you will need to export the game to fill in the env path, do not run the command before filling in the correct executable path):

--env_path=PATH_TO_EXPORTED_EXECUTABLE
--n_parallel=4
--onnx_export_path=model.onnx
--timesteps=100_000_0000
--save_model_path=model.zip
--speedup=20
--experiment_name=2p_coop_box_sorting2

Training stats (rewards/success rates are during training, not eval):

Versions used:

Godot - 4.5.beta6.mono on Win10
Godot RL Agents - One version behind the currently newest commit, but it should work with the current one (https://github.com/edbeeching/godot_rl_agents/commit/d32518a7a0725b6a3e68ac6cb4ecec26517e18eb)
Godot RL Agents Plugin - Local version that should be similar to: edbeeching/godot_rl_agents_plugin#53.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
addons/godot_rl_agents		addons/godot_rl_agents
assets		assets
scenes		scenes
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ReachGoal.csproj		ReachGoal.csproj
ReachGoal.sln		ReachGoal.sln
export_presets.cfg		export_presets.cfg
model.onnx		model.onnx
project.godot		project.godot
sync.gd		sync.gd
sync.gd.uid		sync.gd.uid

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

2 Player Box Sorting Environment

How to test the env:

Goal:

Observations:

Position sensor data (relative positions to objects):

Raycast sensors:

Action space:

Rewards:

Training settings:

Training:

Versions used:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

2 Player Box Sorting Environment

How to test the env:

Goal:

Observations:

Position sensor data (relative positions to objects):

Raycast sensors:

Action space:

Rewards:

Training settings:

Training:

Versions used:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages