Skip to content

Conversation

shalinib-ibm
Copy link
Owner

Implement dot product of two vectors using MMA.
Tried change with different batch sizes.
batch_size = 4,8,16,32 -> llama-batched-benchm similar results to base. after batch_size=64, degrades perf .
llama-bench also gives similar results to base. No perf gain.

Make sure to read the contributing guidelines before submitting a PR

Tried this change with different batch sizes.
batch_size = 4,8,16,32 -> llama-batched-benchm similar results to base.
after batch_size=64, degrades perf .
llama-bench also gives similar results to base. Not much perf gain.

Signed-off-by: Shalini Salomi Bodapati <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant