Skip to content

Conversation

lizhenyun01
Copy link
Collaborator

@lizhenyun01 lizhenyun01 commented Sep 22, 2025

优化deepep buffer clean方式 只在第0层做clear

Copy link

paddle-bot bot commented Sep 22, 2025

Thanks for your contribution!

"""
if layer.ep_size > 1:
if layer.fd_config.parallel_config.moe_phase.phase == "prefill":
if layer.fd_config.parallel_config.splitwise_role == "mixed":
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个条件也还保留着

self.ep_prefill_runner.clean_low_latency_buffer()
return self.apply_ep_prefill(layer, x, gate)
else:
if layer.fd_config.parallel_config.splitwise_role == "mixed":
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

同上

@lizhenyun01
Copy link
Collaborator Author

mark下 PR暂不合入

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants