Add global param to refined_recompute
#2899
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
PR types
New features
PR changes
APIs
Description
Add
globalparam torefined_recompute不同设备的显存大小可能不同,添加
global参数可以精细调节重计算百分比,方便在不同设备上达到显存占用和性能的平衡。refined_recompute: "global:x"的含义:n层ErnieDecoderLayer中有x层不重计算,n-x层重计算。默认值为0。n由模型结构确定,以ERNIE VL 28B为例,此时
n=14。global:3,以关闭3/14的重计算,换取性能提升。