When oneDNN is enabled,  an unoptimized matmul is called by Tensorflow on aarch64

- [The code for calling the matmul of oneDNN in tensorflow.](https://github.com/tensorflow/tensorflow/blob/a6866df48e1aba0996bd99565fa76e79e1875915/tensorflow/core/kernels/mkl/mkl_matmul_op.cc#L146-L177)

- [The code of dnnl_sgemm in oneDNN](https://github.com/oneapi-src/oneDNN/blob/49720e78e6593d4683a68a2deb6eaf5fb08e352f/src/cpu/gemm/gemm.cpp#L103-L146)

From the above code, we can see that if cblas is not enabled, an unoptimized matmul will be called by Tensorflow on aarch64, which will cause performance degradation. So, I think a fully optimized matmul of acl should be added to dnnl_sgemm to make full use of aarch64‘s isa and improve performance of mkl_matmul(a tf op) on aarch64.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

When oneDNN is enabled, an unoptimized matmul is called by Tensorflow on aarch64 #168

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

When oneDNN is enabled, an unoptimized matmul is called by Tensorflow on aarch64 #168

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions