You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think you're correct, this should mean that you take all the rewards in each episode, sum them together, and then calculate the per-episode average over 100 episodes.
Like https://gym.openai.com/evaluations/eval_aqTWbALwQEKrLIyU9ZzmLw/ this one, is there any list of each environments's evaluation since most of environments' page did show the evaluation results like this :https://gym.openai.com/envs/Reacher-v2/ .
Also, when it said
"Average reward" here means the mean of cumulative rewards(sum of one step reward within one episode) over 100 episodes?
Thanks!.
The text was updated successfully, but these errors were encountered: