Skip to content

Actor Critic#11

Closed
ryanpeach wants to merge 15 commits into
mainfrom
feature/actor-critic
Closed

Actor Critic#11
ryanpeach wants to merge 15 commits into
mainfrom
feature/actor-critic

Conversation

@ryanpeach
Copy link
Copy Markdown
Owner

@ryanpeach ryanpeach commented Apr 20, 2024

Great job making a new implementation!

Here are some tasks to complete before merging this PR:

Task List for all Jupyter Notebooks

  • Add the proper headers to the notebook:
%load_ext autoreload
%autoreload 2
  • Make sure the jupyter notebook is importable without running any code. Most cells should look like this:
def foo():
    pass

if __name__ == '__main__':
    foo()
  • APA style citations for all sources at the bottom.

  • Actually read the paper you cite, not just the tutorial

  • Make logseq compatible flash cards. Save them to a readme in the same folder as the notebook.

  • In natural language, explain your understanding of the solution.

  • In mathematical language, document all implementations of all equations in the same notation as the source paper. Preferably you would do this in the functions docstrings, and cite the equation number in the paper. If it's a single line expression you can instead add a comment in the code.

  • Comment the shapes of any numpy arrays or torch tensors. Make assertions on the output.

  • Ask ChatGPT to review your work.

  • Functions that have more than one parameter should have a * before the first parameter to force the user to use named arguments.

  • Do not set default values, you might forget to pass parameters up the stack.

  • Make sure it runs on cpu and gpu

@ryanpeach ryanpeach force-pushed the feature/actor-critic branch 3 times, most recently from 12b60bc to 13d6569 Compare April 22, 2024 21:16
@ryanpeach ryanpeach force-pushed the feature/actor-critic branch from 13d6569 to f45318f Compare June 15, 2024 18:52
@ryanpeach ryanpeach force-pushed the feature/actor-critic branch from 936d4e5 to dc0d159 Compare June 18, 2024 15:16
@ryanpeach ryanpeach force-pushed the feature/actor-critic branch from 14edc3b to d54a5a5 Compare February 14, 2025 16:16
@ryanpeach ryanpeach force-pushed the feature/actor-critic branch from d54a5a5 to 8e6cde0 Compare February 14, 2025 16:17
@ryanpeach ryanpeach force-pushed the feature/actor-critic branch 2 times, most recently from 32c99c8 to ae468fd Compare February 14, 2025 17:32
@ryanpeach ryanpeach force-pushed the feature/actor-critic branch from ae468fd to 5730b33 Compare February 14, 2025 17:33
…90.26, 79.8]

Main Results

Results:
{'last_10_percent_mean': [94.82, 99.26, 93.82, 96.94, 90.26, 79.8]}
…5.58, 97.88]

Main Results

Results:
{'last_10_percent_mean': [90.02, 87.16, 98.1, 99.84, 95.58, 97.88]}
…4.8, 95.2]

Main Results

Results:
{'last_10_percent_mean': [74.12, 88.0, 93.96, 96.52, 94.8, 95.2]}
@ryanpeach ryanpeach force-pushed the feature/actor-critic branch from 544e379 to 3a3a3d8 Compare February 25, 2025 18:42
@ryanpeach ryanpeach closed this Apr 25, 2025
@ryanpeach ryanpeach reopened this Apr 25, 2025
@ryanpeach ryanpeach closed this Mar 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant