Relationship between number of samples and number of iterations

Hi Rutav,
The plots that are provided in the paper - example given below plot the number of samples vs the success rate
<img width="467" alt="Screenshot 2023-02-01 at 21 13 50" src="https://user-images.githubusercontent.com/33202960/216237512-31297199-c72b-454f-8b2c-505f7c194a11.png">
I also see that the code logs the success rate after every iteration as mentioned here - https://github.com/ShahRutav/mjrl/blob/6cdb8b8c72279abe8d9d8b8a800f8ac396413e42/mjrl/utils/train_agent.py#L119 and according to the default configuration file here - https://github.com/facebookresearch/RRL/blob/main/examples/config/hammer_dapg.yaml#L40 - the code is being run for 200 iterations, I also see here https://github.com/facebookresearch/RRL/blob/main/examples/config/hammer_dapg.yaml#L40 that the number of trajectories is 200, so I think the horizon length, say h, has to be 100, assuming that the training is happening for 4 x 10e6 if I am not wrong - I have the following doubts - 
1. Here - https://github.com/ShahRutav/mjrl/blob/6cdb8b8c72279abe8d9d8b8a800f8ac396413e42/mjrl/algos/batch_reinforce.py#L64 I see that the horizon length is 1e6 - am I looking at the right place for that or is there some other parameters that I am missing?
2. Is the eval_success being logged at each iteration according to the code and the plots which are reported in the paper simply have their axis scaled by an appropriate factor, in our case something like 2 x 10eX (X depends on the answer to the above point) - so essentially the total number of logged eval_success is 200?

Please do let me know if my understanding is right in this setting.
Thanks,
Megh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relationship between number of samples and number of iterations #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Relationship between number of samples and number of iterations #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions