-
Notifications
You must be signed in to change notification settings - Fork 106
Add over saturation constraint #438
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add over saturation constraint #438
Conversation
85cf65e to
f996254
Compare
## Summary E2E tests which check basic GuideLLM functionality, using vLLM simulator. ## Details - [x] Max requests test - [x] Max duration test - [ ] Over-saturation stopping test - skipped for now, will be enabled when #438 lands ## Test Plan - [x] Local testing - [x] GitHub action --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [x] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`)
ce85b1c to
46bc491
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
prompt-python_module_docs.md
Overall looks good. I have a few comments/discussion points to run through around names, types, package layout, etc. I haven't gone through the full logic yet to validate, will do that soon, but adding a writeup of the algorithm would help at the top.
Also, can you ensure all public methods have docs? I'll attach the prompt I use to generate docs that is generally fairly self sufficient and accurate in case you'd like to use that so it doesn't take too much time.
59ffa16 to
3ebdcaa
Compare
jaredoconnell
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I started the process of reviewing this PR. It appears to work, though I need to fix some problems with my VLLM instance to be sure about that.
I think this feature needs some more user-facing documentation. It may make sense to add a markdown file in the docs section explaining what the typical end users need to know about this.
e8a0bb6 to
c1e97fe
Compare
Done |
|
This PR is now ready again after applying all review changes |
sjmonson
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Few more nits. Also some of Mark's comments were missed about import style.
ccab11e to
4ef1f98
Compare
4ef1f98 to
44a668b
Compare
jaredoconnell
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is close to ready. The documentation needs to be updated, the other reviewers' latest comments need to be addressed, and one E2E test is still failing.
4b1774f to
265a146
Compare
Add over-saturation detection and stopping capability to GuideLLM CLI. - Implement OverSaturationDetector with statistical slope detection - Add OverSaturationConstraint for scheduler integration - Add CLI flags --stop-over-saturated and --stop-osd - Integrate with benchmark entrypoints and main CLI Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Alon Kellner <[email protected]>
Signed-off-by: Samuel Monson <[email protected]>
Signed-off-by: Samuel Monson <[email protected]>
Signed-off-by: Samuel Monson <[email protected]>
Signed-off-by: Samuel Monson <[email protected]>
Signed-off-by: Samuel Monson <[email protected]>
Signed-off-by: Samuel Monson <[email protected]>
Signed-off-by: Samuel Monson <[email protected]>
265a146 to
89b145d
Compare
jaredoconnell
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, with the exception of one last out of date documentation part. I'll approve since once that's fixed I don't need to check again.
markurtz
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM pending @jaredoconnell's docs comment
Signed-off-by: Samuel Monson <[email protected]>
Summary
This PR adds over-saturation stopping to the GuideLLM CLI.
It's based on the OSD (Over-Saturation Detection) algorithm we developed and evaluated at Jounce.
Use
--stop-over-saturatedor--stop-osdto enable.Details
This PR adds:
--stop-over-saturated)Test Plan
Use of AI
## WRITTEN BY AI ##)