Large fluctuations in eval results at step 0

**Issue Description**
When starting training with the current configuration, the evaluation results at step 0 show significant fluctuations. This makes the initial evaluation unreliable and affects the reproducibility of experiments.

**Proposed Solution**
Enable validation before training starts by setting the following in train_XXX.yaml:

``` yaml
trainer:
  val_before_train: True
```

**Experimental Results**
After applying the fix, evaluation results on gsm8k-eval at step 0 became much more stable (see figure below):

<img width="1218" height="696" alt="Image" src="https://github.com/user-attachments/assets/669472a4-c392-472e-9619-ef586907a553" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Large fluctuations in eval results at step 0 #282

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Large fluctuations in eval results at step 0 #282

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions