Skip to content

Commit df50eb4

Browse files
committed
do not specialize M
1 parent 3f992aa commit df50eb4

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

lmdeploy/pytorch/kernels/cuda/blocked_gemm_fp8.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@
1111
logger = get_logger('lmdeploy')
1212

1313

14-
@triton.jit
14+
@triton.jit(do_not_specialize=['M', 'M_out'])
1515
def _quant_fp8_kernel(
1616
a_ptr,
1717
out_ptr,

0 commit comments

Comments
 (0)