Skip to content

(Solved) No env.reset() at the end of each training epoch. #67

@slDeng1003

Description

@slDeng1003

Existing code:
Only reset the environment at the beginning of training loop, that is, only call env.reset() at the first epoch.
Right(might) training paradigm
I checked OpenAI spinning-up's implement of PPO https://github.com/openai/spinningup/blob/master/spinup/algos/pytorch/ppo/ppo.py, they do reset the env at the end of each epoch (same as reset it at the beginning of each epoch).

Correct me if I were wrong:)

P.S.: It;s still nice code!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions