-
Notifications
You must be signed in to change notification settings - Fork 676
fix: improved leader clean up #4201
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: krishung5 <[email protected]>
Signed-off-by: krishung5 <[email protected]>
Signed-off-by: krishung5 <[email protected]>
… the leader Signed-off-by: Ryan Olson <[email protected]>
|
@coderabbitai full review |
✅ Actions performedFull review triggered. |
WalkthroughAdded a new trait method Changes
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~25 minutes
Poem
Pre-merge checks❌ Failed checks (2 warnings, 1 inconclusive)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 0
🧹 Nitpick comments (1)
lib/kvbm/src/block_manager/vllm/connector/leader/recorder.rs (1)
323-325: Recordupdate_connector_outputevents for replay parity.Every other Leader method recorded here emits an
Actionso the recorder/replayer path stays faithful. Whenupdate_connector_outputgains real behavior, having no corresponding action will make recordings incomplete. Please consider adding a matchingActionvariant (or equivalent instrumentation) and enqueueing it here so we keep the trace consistent.
📜 Review details
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Pro
📒 Files selected for processing (3)
.gitignore(1 hunks)lib/kvbm/src/block_manager/vllm/connector/leader.rs(3 hunks)lib/kvbm/src/block_manager/vllm/connector/leader/recorder.rs(1 hunks)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (7)
- GitHub Check: clippy (launch/dynamo-run)
- GitHub Check: tests (.)
- GitHub Check: tests (lib/bindings/python)
- GitHub Check: clippy (.)
- GitHub Check: tests (lib/runtime/examples)
- GitHub Check: tests (launch/dynamo-run)
- GitHub Check: Build and Test - dynamo
🔇 Additional comments (1)
lib/kvbm/src/block_manager/vllm/connector/leader.rs (1)
477-512: Nice cleanup ofFinishingslots.Re-entering the lock, advancing the state, and pruning both the slot and
inflight_requestswhen we finally hitFinishedcloses the leak path we saw in Finishing-only scenarios. Looks solid.
Under the condition
request_finishedmarks the slot as Finishing, but not Finished; there is no future re-evaluation of the slot to transition it to Finished and no calls to remove that slot from the slot manager, resulting in minor host memory leak.Summary by CodeRabbit