-
Notifications
You must be signed in to change notification settings - Fork 109
Open
Description
Hi, I am oscar and I do appreciate those source codes with integrating various algorithms.
I have tried to run the nature DQN with default setting through Pong and BeamRider environment and found that the reward scale is not as large as the one posted in main page.
For Pong Environment,
I just manually set the clip_rewards = False and got the final mean around 27.430 which is far from the max level(around 300) posted.
Is it due to difference hyper-parameters setting or may be due to some plotting techniques?
BTW, I will really appreciate if you can update the plotting code, Thank you!
Metadata
Metadata
Assignees
Labels
No labels