Skip to content

[Question] 关于safe rlhf-v的cm和rm模型训练的脚本和代码 #202

@Tunanzzz

Description

@Tunanzzz

Required prerequisites

Questions

我看到了safe rlhf-v的ppo训练脚本,但是他需要的rm和cm部分的训练脚本目前没有看到,请问可以给点指导嘛?

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions