Skip to content

Commit db11867

Browse files
Becker-ZHGoogle-ML-Automation
authored andcommitted
XProf GPU - Expose # of Chips to Profile
PiperOrigin-RevId: 829570660
1 parent 6b847ef commit db11867

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

docs/profiling.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -363,6 +363,9 @@ The following options are available for GPU profiling:
363363
* `gpu_pm_sample_interval_us`: Sets the sampling interval in microseconds
364364
for CUPTI PM sampling. Defaults to `500`.
365365
* `gpu_pm_sample_buffer_size_per_gpu_mb`: Sets the system memory buffer size per device in MB for CUPTI PM sampling. Defaults to 64MB. The maximum supported value is 4GB.
366+
* `gpu_num_chips_to_profile_per_task`: Specifies the number of GPU devices to
367+
profile per task. If set to 0, all available GPUs will be profiled. This can
368+
be used to decrease the trace collection size.
366369
* `gpu_dump_graph_node_mapping`: If enabled, dumps CUDA graph node
367370
mapping information into the trace. Defaults to `False`.
368371

0 commit comments

Comments
 (0)