Move packing from inside MMA kernels #11

shalinib-ibm · 2025-06-09T10:36:54Z

This patch moves calls from packing routines from inside MMA kernel to one step behind.
Current call stack :
matmul->mnpack->gemm->kernel->PackTanspose+MMA instructions Changed call stack:
matmul->mnpack->gemm->PackTranspose->kernel->MMA instrutcions

Not seeing much perf difference with this change

Make sure to read the contributing guidelines before submitting a PR

This patch moves calls from packing routines from inside MMA kernel to one step behind. Current call stack : matmul->mnpack->gemm->kernel->PackTanspose+MMA instructions Changed call stack: matmul->mnpack->gemm->PackTranspose->kernel->MMA instrutcions Not seeing much perf difference with this change Signed-off-by: Shalini Salomi Bodapati <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move packing from inside MMA kernels #11

Move packing from inside MMA kernels #11

Uh oh!

shalinib-ibm commented Jun 9, 2025

Uh oh!

Uh oh!

Move packing from inside MMA kernels #11

Are you sure you want to change the base?

Move packing from inside MMA kernels #11

Uh oh!

Conversation

shalinib-ibm commented Jun 9, 2025

Uh oh!

Uh oh!