File tree Expand file tree Collapse file tree 2 files changed +12
-8
lines changed Expand file tree Collapse file tree 2 files changed +12
-8
lines changed Original file line number Diff line number Diff line change @@ -96,10 +96,13 @@ class NashMDTrainer(OnlineDPOTrainer):
96
96
preprocess_logits_for_metrics (`Callable[[torch.Tensor, torch.Tensor], torch.Tensor]`):
97
97
The function to use to preprocess the logits before computing the metrics.
98
98
99
- .. deprecated:: 0.22.0
100
- The following parameters are deprecated and will be removed in a future version:
99
+ reward_model:
101
100
102
- * `reward_model`: Use `reward_funcs` instead. For example, change `reward_model=model` to `reward_funcs=model`.
101
+ <Deprecated version="0.22.0">
102
+
103
+ This parameter is deprecated and will be removed in version 0.25.0. Use `reward_funcs` instead.
104
+
105
+ </Deprecated>
103
106
"""
104
107
105
108
_tag_names = ["trl" , "nash-md" ]
Original file line number Diff line number Diff line change @@ -96,12 +96,13 @@ class XPOTrainer(OnlineDPOTrainer):
96
96
preprocess_logits_for_metrics (`Callable[[torch.Tensor, torch.Tensor], torch.Tensor]`):
97
97
The function to use to preprocess the logits before computing the metrics.
98
98
99
- .. deprecated:: 0.22.0
100
- The following parameters are deprecated and will be removed in a future version:
99
+ reward_model:
101
100
102
- * `reward_model`: Use `reward_funcs` instead. For example, change `reward_model=model` to `reward_funcs=model`.
103
- * `reward_processing_class`: Use `reward_processing_classes` instead. For example, change
104
- `reward_processing_class=tokenizer` to `reward_processing_classes=tokenizer`.
101
+ <Deprecated version="0.22.0">
102
+
103
+ This parameter is deprecated and will be removed in version 0.25.0. Use `reward_funcs` instead.
104
+
105
+ </Deprecated>
105
106
"""
106
107
107
108
_tag_names = ["trl" , "xpo" ]
You can’t perform that action at this time.
0 commit comments