Skip to content

Rtau sumeet#1

Open
SUMEETRM wants to merge 3 commits into
mainfrom
rtau_sumeet
Open

Rtau sumeet#1
SUMEETRM wants to merge 3 commits into
mainfrom
rtau_sumeet

Conversation

@SUMEETRM

Copy link
Copy Markdown
Collaborator

Joar's description: Given a reward function R and transition function \tau, let R^\tau be the reward function R^\tau(s,a,s) = E_{S' ~ \tau(s,a)}[R(s,a,S')]. Moreover, given a reward metric d, let d^\tau(R1, R2) = d(R1^\tau, R2^\tau). Now, the question is; is d^\tau in practice noticeably better than d?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant