### Required prerequisites - [x] I have read the documentation <https://align-anything.readthedocs.io>. - [x] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/align-anything/issues) and [Discussions](https://github.com/PKU-Alignment/align-anything/discussions) that this hasn't already been reported. (+1 or comment there if it has.) - [x] Consider asking first in a [Discussion](https://github.com/PKU-Alignment/align-anything/discussions/new). ### Questions 我看到了safe rlhf-v的ppo训练脚本,但是他需要的rm和cm部分的训练脚本目前没有看到,请问可以给点指导嘛?