(Solved) No env.reset() at the end of each training epoch.

【**Existing code:**】
Only reset the environment at the beginning of training loop, that is, only call env.reset() at the first epoch.
【**Right(might) training paradigm**】
I checked OpenAI spinning-up's implement of PPO [https://github.com/openai/spinningup/blob/master/spinup/algos/pytorch/ppo/ppo.py](url), they do reset the env at the end of each epoch (same as reset it at the beginning of each epoch).

Correct me if I were wrong:)

P.S.: It;s still nice code!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(Solved) No env.reset() at the end of each training epoch. #67

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

(Solved) No env.reset() at the end of each training epoch. #67

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions