Skip to content

Commit aafd5d3

Browse files
authored
[Benchmark] Drop memory bound kernels and replace them with gemms (#887)
1 parent 8994ec5 commit aafd5d3

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

.github/workflows/benchmark.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -120,7 +120,7 @@ jobs:
120120
121121
source .venv/bin/activate
122122
123-
KERNELS=("softmax" "geglu" "swiglu" "jsd" "welford" "kl_div" "int4_gemm" "layer_norm" "layer_norm-bwd" "rms_norm" "rms_norm-bwd" "cross_entropy" "flash_attention")
123+
KERNELS=("softmax" "jsd" "welford" "kl_div" "layer_norm" "layer_norm-bwd" "rms_norm" "rms_norm-bwd" "cross_entropy" "flash_attention" "gemm" "grouped_gemm")
124124
NUMSHARDS=${{ inputs.num-shards }}
125125
SHARD=${{ inputs.shard }}
126126

.github/workflows/benchmark_dispatch.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ jobs:
2626
if: ${{ github.event.inputs.run_h100 == 'true' || github.event_name == 'schedule' }}
2727
uses: ./.github/workflows/compute-benchmark-matrix.yml
2828
with:
29-
max-runners: 13
29+
max-runners: 12
3030

3131
run-h100:
3232
needs: gen-matrix-h100
@@ -51,7 +51,7 @@ jobs:
5151
uses: ./.github/workflows/compute-benchmark-matrix.yml
5252
if: ${{ github.event.inputs.run_b200 == 'true' || github.event_name == 'schedule' }}
5353
with:
54-
max-runners: 13
54+
max-runners: 12
5555

5656
run-b200:
5757
needs: gen-matrix-b200

0 commit comments

Comments
 (0)