Release v0.1.13 · oumi-ai/oumi

What's Changed

Update dev_setup.md by @wizeng23 in #1641
[tiny] Remove vllm install commands by @wizeng23 in #1643
Support for custom processor args: misc improvements by @nikg4 in #1642
Add Countdown dataset and reward function by @wizeng23 in #1645
Adding LoRA train config for Qwen-VL 2.0 by @optas in #1637
[Evaluation] Convenience function for standard config retrieval by @kaisopos in #1644
Add demo script by @oelachqar in #1647
[bug] fix build errors by @oelachqar in #1649
Adding LoRA train config for SmolVLM by @optas in #1639
[tiny] Update cli help shorthand by @oelachqar in #1648
Oelachqar/update hooks by @oelachqar in #1650
Add verl PPO trainer by @wizeng23 in #1646
Fix a missing dependency in the verl trainer. by @taenin in #1651
Integrate verl GRPO trainer into train script by @wizeng23 in #1652
Update e2e tests to run on lambda by @wizeng23 in #1653
Add Qwen3 32B configs by @wizeng23 in #1661
Add Qwen3 30B A3B configs by @wizeng23 in #1665
[verl] Populate verl config from Oumi config by @wizeng23 in #1659
Provide option to configure label_ignore_index in training config by @nikg4 in #1666
[Documentation] Custom Evaluations (PR 1-of-2) by @kaisopos in #1664
InterVL-3.0 SFT with limited training capabilities by @optas in #1663
Add verl GRPO Countdown configs by @wizeng23 in #1668
Set explicit permissions for our test workflows. by @taenin in #1670
Add support for repetition_penalty in GrpoParams by @REDDITARUN in #1654
Fix broken tests due to precommit violations by @taenin in #1671
[Documentation] Custom Evaluations (PR 2-of-2) by @kaisopos in #1669
Migrate to logger.warning usage by @emmanuel-ferdman in #1673
Update the Oumi launcher and e2e tests to support runpod. by @taenin in #1672
Switch back to using GCP for e2e tests. by @taenin in #1675
Mark an e2e test as is_lora by @taenin in #1676
Add Phi4 reasoning plus configs by @wizeng23 in #1674
Fix a test breakage caused by a new Click version (8.2.0) by @taenin in #1679
chore: edited the link to the stars badge by @Radovenchyk in #1681
Update verl GRPO countdown configs by @wizeng23 in #1682
[very nit] center oumi logo in the cli by @oelachqar in #1683
[tiny] Update training environments doc by @wizeng23 in #1686
Add Geometry3K VLM dataset by @nikg4 in #1687
Add torchao version to pyproject.toml by @nikg4 in #1688
[Feature] Save evaluation config as YAML in output_dir #1546 by @asish-kun in #1680
Create a script to calculate memory used during training by @wizeng23 in #1441
Support VLM-s with VERL_GRPO trainer by @nikg4 in #1689
docs: Add GRPO/verl documentation by @wizeng23 in #1690
Update GRPO letter counting reward function and hparams for stability by @jgreer013 in #1692
[GRPO] Update letter counting notebook by @wizeng23 in #1694
Add Lambda Inference Engine by @oelachqar in #1695
Basic shell script for launching jobs on OLCF Frontier HPC cluster by @nikg4 in #1691
Add CoALM dataset class by @oelachqar in #1696
Added exponential backoff and content-type error handling in remote inference engine by @abhiramvad in #1685
Make SFT datasets usable with GRPO_TRL trainer by @nikg4 in #1698
Implement Falcon H1 by @dhiaEddineRhaiem in #1699
[tiny] Remove deprecated use_async_dataset from configs by @wizeng23 in #1701
Add sample inference configs for HuggingFaceTB/SmolVLM-Instruct by @nikg4 in #1703

New Contributors

@REDDITARUN made their first contribution in #1654
@emmanuel-ferdman made their first contribution in #1673
@Radovenchyk made their first contribution in #1681
@asish-kun made their first contribution in #1680
@abhiramvad made their first contribution in #1685
@dhiaEddineRhaiem made their first contribution in #1699

Full Changelog: v0.1.12...v0.1.13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.1.13

What's Changed

New Contributors

Contributors

Uh oh!