fix(reproducibility): add opt-in strict determinism across trainers by SoheylM · Pull Request #61 · IDEALLab/EngiOpt

SoheylM · 2026-03-05T08:54:15Z

Description

Adds an opt-in strict reproducibility path for all training entrypoints while preserving current default behavior.

Introduces engiopt/reproducibility.py with shared helpers:
- seed_training(seed)
- enable_strict_determinism(warn_only=True)
- make_dataloader_generator(seed)
Adds strict_determinism: bool = False to all targeted training Args dataclasses.
Replaces ad-hoc seeding with seed_training(args.seed) in each training script.
Enables strict deterministic controls only when --strict-determinism is passed.
Hardens DataLoader reproducibility by supplying seeded generators for shuffle=True loaders.
Updates README with optional reproducibility usage (--strict-determinism).

Fixes SOH-14 (Linear)

Type of change

Documentation only change (no code changed)
Bug fix (non-breaking change which fixes an issue)
New algorithm (non-breaking change which adds a new model/algorithm)
Improvement to existing algorithm (non-breaking change which improves functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Screenshots

N/A

Checklist:

Code Quality

I have run the pre-commit checks with pre-commit run --all-files
I have run ruff check . and ruff format
I have run mypy .
I have commented my code, particularly in hard-to-understand areas
My changes generate no new warnings

CleanRL Philosophy (for new/modified algorithms)

The implementation follows the CleanRL single-file philosophy: all training logic is contained in one file
The code is reproducible: random seeds are set, PyTorch determinism is enabled
Hyperparameters are configurable via command-line arguments using tyro
WandB logging is integrated with --track flag support
The model can be saved and restored via WandB artifacts (--save-model flag)

Algorithm Completeness (for new algorithms)

Both training script (algorithm.py) and evaluation script (evaluate_algorithm.py) are provided
The algorithm works with EngiBench's Problem interface
The algorithm is added to the README table with correct metadata

Documentation

I have made corresponding changes to the documentation
New algorithms include docstrings explaining the approach and any paper references

Validation

Determinism smoke check (cgan_cnn_2d) run twice on same machine with strict mode and fixed seed; resulting checkpoint tensor hashes matched for both generator and discriminator.
Non-strict short run completed successfully (default behavior path unchanged).
Strict mode configured with warn_only=True for nondeterministic ops to warn and continue.

linear · 2026-03-05T08:54:19Z

SOH-14 Non reproducibility in training algorithms

g-braeunlich

Just 2 small suggestions

engiopt/gan_bezier/gan_bezier.py

engiopt/reproducibility.py

SoheylM · 2026-03-16T06:43:40Z

@g-braeunlich quick ping when you have a moment: could you confirm yes/no on the latest TF32 guard update in the open thread? If yes, we’ll resolve and merge. Thanks!

fix(reproducibility): add opt-in strict determinism across trainers

2d376c5

SoheylM requested a review from g-braeunlich March 5, 2026 09:45

g-braeunlich reviewed Mar 5, 2026

View reviewed changes

engiopt/gan_bezier/gan_bezier.py Outdated Show resolved Hide resolved

engiopt/reproducibility.py Outdated Show resolved Hide resolved

SoheylM added 2 commits March 5, 2026 16:20

refactor(gan_bezier): simplify deterministic dataloader generator wiring

21e760f

refactor(reproducibility): guard TF32 toggle by backend capability

f11c598

SoheylM requested a review from g-braeunlich March 16, 2026 06:42

g-braeunlich approved these changes Mar 16, 2026

View reviewed changes

SoheylM merged commit 709f886 into main Mar 16, 2026
3 checks passed

SoheylM deleted the codex/soh-14-strict-determinism-hardening branch March 16, 2026 07:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(reproducibility): add opt-in strict determinism across trainers#61

fix(reproducibility): add opt-in strict determinism across trainers#61
SoheylM merged 3 commits intomainfrom
codex/soh-14-strict-determinism-hardening

SoheylM commented Mar 5, 2026

Uh oh!

linear bot commented Mar 5, 2026

Uh oh!

g-braeunlich left a comment

Uh oh!

Uh oh!

Uh oh!

SoheylM commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SoheylM commented Mar 5, 2026

Description

Type of change

Screenshots

Checklist:

Code Quality

CleanRL Philosophy (for new/modified algorithms)

Algorithm Completeness (for new algorithms)

Documentation

Validation

Uh oh!

linear bot commented Mar 5, 2026

Uh oh!

g-braeunlich left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

SoheylM commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants