P1_Navigation_Submission

The navigation project uses RL to search for an optimal policy in the Banana environment. This environment is a 3D world in which the agent can move around and collect bananas. There are 2 types of bananas, yellow bananas which yield a reward of +1 and blue bananas which yield a reward of -1 banana. The aim is to find a policy which maximises the reward.

States

The state space has 37 dimensions including the velocity of the agent and some ray based perceptions in the forward direction. There are 4 dicrete actions in the action space, corresponding to moving forward and backward, and turning left and right.

Solution

The environment is considered solved once the average reward of 100 consecutive episodes is greater than 13.

Installation

The jupyter notebooks are written to function on a Windows machine with a cuda enabled GPU.

Dependancies

First, install conda: https://www.anaconda.com/distribution/#download-section

Next, create a new conda enviornment and activate

conda create -n Navigation python=3.6.3 anaconda

activate Navigation

Now install pytorch and unity agents

conda install pytorch=0.4.0 cuda80 -c pytorch

pip install mlagents==0.4.0

Finally, the environment and scripts are downloaded from

git clone https://github.com/SamJCKnox/P1_Navigation_Submission.git

Instructions

The Navigation script includes all code required to run. Run all sections to train the agent. Outputs will show how the agent is performing.

Report.pdf shows how the architecture of the Qnetwork has been refined along with the hyperparamteres.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Banana_Windows_x86_64		Banana_Windows_x86_64
Data		Data
.gitattributes		.gitattributes
CheckpointAlpha.pth		CheckpointAlpha.pth
Navigation.ipynb		Navigation.ipynb
README.md		README.md
Report.pdf		Report.pdf
unity-environment.log		unity-environment.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

P1_Navigation_Submission

States

Solution

Installation

Dependancies

Instructions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

P1_Navigation_Submission

States

Solution

Installation

Dependancies

Instructions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages