v0.1.13
What's Changed
- Update dev_setup.md by @wizeng23 in #1641
- [tiny] Remove vllm install commands by @wizeng23 in #1643
- Support for custom
processor args
: misc improvements by @nikg4 in #1642 - Add Countdown dataset and reward function by @wizeng23 in #1645
- Adding LoRA train config for Qwen-VL 2.0 by @optas in #1637
- [Evaluation] Convenience function for standard config retrieval by @kaisopos in #1644
- Add demo script by @oelachqar in #1647
- [bug] fix build errors by @oelachqar in #1649
- Adding LoRA train config for SmolVLM by @optas in #1639
- [tiny] Update cli help shorthand by @oelachqar in #1648
- Oelachqar/update hooks by @oelachqar in #1650
- Add verl PPO trainer by @wizeng23 in #1646
- Fix a missing dependency in the verl trainer. by @taenin in #1651
- Integrate verl GRPO trainer into train script by @wizeng23 in #1652
- Update e2e tests to run on lambda by @wizeng23 in #1653
- Add Qwen3 32B configs by @wizeng23 in #1661
- Add Qwen3 30B A3B configs by @wizeng23 in #1665
- [verl] Populate verl config from Oumi config by @wizeng23 in #1659
- Provide option to configure
label_ignore_index
in training config by @nikg4 in #1666 - [Documentation] Custom Evaluations (PR 1-of-2) by @kaisopos in #1664
- InterVL-3.0 SFT with limited training capabilities by @optas in #1663
- Add verl GRPO Countdown configs by @wizeng23 in #1668
- Set explicit permissions for our test workflows. by @taenin in #1670
- Add support for repetition_penalty in GrpoParams by @REDDITARUN in #1654
- Fix broken tests due to precommit violations by @taenin in #1671
- [Documentation] Custom Evaluations (PR 2-of-2) by @kaisopos in #1669
- Migrate to
logger.warning
usage by @emmanuel-ferdman in #1673 - Update the Oumi launcher and e2e tests to support runpod. by @taenin in #1672
- Switch back to using GCP for e2e tests. by @taenin in #1675
- Mark an e2e test as is_lora by @taenin in #1676
- Add Phi4 reasoning plus configs by @wizeng23 in #1674
- Fix a test breakage caused by a new Click version (8.2.0) by @taenin in #1679
- chore: edited the link to the stars badge by @Radovenchyk in #1681
- Update verl GRPO countdown configs by @wizeng23 in #1682
- [very nit] center oumi logo in the cli by @oelachqar in #1683
- [tiny] Update training environments doc by @wizeng23 in #1686
- Add Geometry3K VLM dataset by @nikg4 in #1687
- Add
torchao
version topyproject.toml
by @nikg4 in #1688 - [Feature] Save evaluation config as YAML in output_dir #1546 by @asish-kun in #1680
- Create a script to calculate memory used during training by @wizeng23 in #1441
- Support VLM-s with VERL_GRPO trainer by @nikg4 in #1689
- docs: Add GRPO/verl documentation by @wizeng23 in #1690
- Update GRPO letter counting reward function and hparams for stability by @jgreer013 in #1692
- [GRPO] Update letter counting notebook by @wizeng23 in #1694
- Add Lambda Inference Engine by @oelachqar in #1695
- Basic shell script for launching jobs on OLCF Frontier HPC cluster by @nikg4 in #1691
- Add CoALM dataset class by @oelachqar in #1696
- Added exponential backoff and content-type error handling in remote inference engine by @abhiramvad in #1685
- Make SFT datasets usable with GRPO_TRL trainer by @nikg4 in #1698
- Implement Falcon H1 by @dhiaEddineRhaiem in #1699
- [tiny] Remove deprecated
use_async_dataset
from configs by @wizeng23 in #1701 - Add sample inference configs for
HuggingFaceTB/SmolVLM-Instruct
by @nikg4 in #1703
New Contributors
- @REDDITARUN made their first contribution in #1654
- @emmanuel-ferdman made their first contribution in #1673
- @Radovenchyk made their first contribution in #1681
- @asish-kun made their first contribution in #1680
- @abhiramvad made their first contribution in #1685
- @dhiaEddineRhaiem made their first contribution in #1699
Full Changelog: v0.1.12...v0.1.13