-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Update Python package info, release 0.1.1 #3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+26
−5
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Closed
arnavsinghvi11
pushed a commit
that referenced
this pull request
Mar 26, 2024
adding the inspect_history
GFarnon
added a commit
to GFarnon/dspy
that referenced
this pull request
Jul 13, 2024
chenmoneygithub
added a commit
that referenced
this pull request
May 14, 2025
* D1 for GRPO * Improve type for arbor * Add temp test script for grpo * Add note about assumption of same inputs to all predictors * Disable LM cache in GRPO * Add support for valset * Add configurable variable module invocation handling strategy * Noahs dspy.LM changes and dspy.ArborProvider implementation * Add latest arbor changes * First working grpo version * Add modules * Add training args in initialize * Fix grpo * Add batches * Update finetuning infra * Revise server interface * Update example script * Move temporary interface to a separate file * Add LM level reinforce interface * Update testing script * Update api_base access for finetune * Style check * Style check all * Add Test script with MATH dataset * Ensure grpo trainer does not crash due to format issues, but temporary fix * Add error log * Fix termination * Delete temp files * Add diff * Add model update endpoint support * Remove experimental flag * Remove extra files * Add GRPO error resiliency to avoid parsing failures lead to crashes * Param Passthrough and Consistent Tutorial Script (#3) * Add param passthrough and default banking77 tutorial * Add more threads * Update banking tutorial --------- Co-authored-by: Noah Ziems <nziems2@[email protected]> * Lower beta param for banking tutorial * Add warning on no training data * Add train logging to GRPIO * Add max_prompt_length and max_completion_length support * fix litellm retries * no jsonadapter * fix errors * fix tests * fix tests * add the retry strategy back * Add working implementation of format errors and negative rewards * Fix bugs in validation * Add validation logic to grpo * Add more supported args * Support max grad norm * Add Train Shuffling logic * Add lora support * Add soft format rewards * Disable proivide_traceback in all grpo invoked evaluates * Remove temporary tutorial script * Revert classification finetuning tutorial * Comment out json adapter test * Fix ruff errors * Add teacher (#8) * Modify teacher preparation logic * Re-add teachers to GRPO * Style fix * Update tutorial script * Housekeeping * Revert number of train steps * Address PR comments * Add wandb support for GRPO training runs * Add completion logging * Add logging steps support * update report_to to be default none * Add max_context_length * Fix num_samples_per_input computation * Checkpointing Endpoints (#10) * Fix typo * Fix checkpoint url * fix merge conflict leftover * shorten the warning message in json adapter * fix the error piping --------- Co-authored-by: Lakshya A Agrawal <[email protected]> Co-authored-by: Dilara Soylu <[email protected]> Co-authored-by: Noah Ziems <nziems2@[email protected]> Co-authored-by: chenmoneygithub <[email protected]>
LakshyAAAgrawal
pushed a commit
to gepa-ai/dspy
that referenced
this pull request
Aug 13, 2025
* Add param passthrough and default banking77 tutorial * Add more threads * Update banking tutorial --------- Co-authored-by: Noah Ziems <nziems2@[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.