[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention #3690

rohan-tan-bhowmik · 2024-09-06T20:54:04Z

Enabled mask and is_causal parameters for torch.aten.scaled_dot_product attention + relevant comments + tests.

The tests added highlight the new capabilities introduced in this PR, including:

Attention with F16 mask
Attention with Boolean mask
Causal attention with same Q K V shapes
Causal attention without Q K V shapes

Made sure that one cannot input both mask and is_causal.

…caled_dot_product

…h-mlir into sdpa_mask

projects/pt1/e2e_testing/xfail_sets.py

…h-mlir into sdpa_mask

projects/pt1/e2e_testing/xfail_sets.py

lib/Dialect/TMTensor/IR/TMTensorOps.cpp

…h-mlir into sdpa_mask

rsuderman · 2024-09-09T20:06:47Z

You need to still add the passing sdpa ops to the stable hlo tests

projects/pt1/e2e_testing/xfail_sets.py

rohan-tan-bhowmik and others added 8 commits September 5, 2024 14:36

[WIP] Initial implementation of attention masking in torch.ops.aten.s…

07de1d0

…caled_dot_product

(WIP) Added causal and boolean masking

05b1065

(WIP) Added causal and boolean masking

dd290bf

(WIP) Added checks and tests

a29fce4

(WIP) Added checks and tests

263b380

Clean up and robustness work

8983210

Formatting

2a112ec

Merge branch 'llvm:main' into sdpa_mask

f6e873e

rohan-tan-bhowmik requested review from raikonenfnu, rsuderman and vivekkhandelwal1 September 6, 2024 20:54

raikonenfnu mentioned this pull request Sep 6, 2024

[LinalgExt] Masked Attention Implementation iree-org/iree#18461

Closed

rohan-tan-bhowmik added 5 commits September 6, 2024 15:22

Restored llvm-project

9d6f70d

Merge branch 'sdpa_mask' of https://github.com/rohan-tan-bhowmik/torc…

0e41ca3

…h-mlir into sdpa_mask

Restored llvm-project

f067dd0

XFAIL on stable version, PASS on nightly version

f70166d

XFAIL on stable version, PASS on nightly version

84cc5cb

raikonenfnu reviewed Sep 6, 2024

View reviewed changes

projects/pt1/e2e_testing/xfail_sets.py Outdated Show resolved Hide resolved

rohan-tan-bhowmik and others added 7 commits September 6, 2024 16:49

XFAIL on stable version, PASS on nightly version

1c0f138

XFAIL on stable version, PASS on nightly version

fd981c4

XFAIL on stable version, PASS on nightly version

ebd4c0c

Merge branch 'llvm:main' into sdpa_mask

daeff4a

XFAIL on stable version, PASS on nightly version

3c269d4

Merge branch 'sdpa_mask' of https://github.com/rohan-tan-bhowmik/torc…

0ea7cba

…h-mlir into sdpa_mask

Merge branch 'llvm:main' into sdpa_mask

67b318f

rsuderman requested changes Sep 9, 2024

View reviewed changes

projects/pt1/e2e_testing/xfail_sets.py Outdated Show resolved Hide resolved

lib/Dialect/TMTensor/IR/TMTensorOps.cpp Show resolved Hide resolved

lib/Dialect/TMTensor/IR/TMTensorOps.cpp Outdated Show resolved Hide resolved

lib/Dialect/TMTensor/IR/TMTensorOps.cpp Outdated Show resolved Hide resolved

rohan-tan-bhowmik added 4 commits September 9, 2024 11:17

XFAIL on stable version, PASS on nightly version

a743b2f

Merge branch 'sdpa_mask' of https://github.com/rohan-tan-bhowmik/torc…

2cab3c0

…h-mlir into sdpa_mask

Review changes

fd94911

Review changes

75f1240

Review changes

d01f743

rohan-tan-bhowmik added 6 commits September 9, 2024 13:22

XFAIL Changes

f78f878

XFAIL Changes

fc41aa3

XFAIL Changes

04f9e4d

XFAIL Changes

bf88ec5

XFAIL Changes

1a9466e

XFAIL Changes

421b74e

rsuderman requested changes Sep 9, 2024

View reviewed changes

projects/pt1/e2e_testing/xfail_sets.py Outdated Show resolved Hide resolved

rsuderman approved these changes Sep 9, 2024

View reviewed changes

rsuderman merged commit e86f56b into llvm:main Sep 9, 2024
3 checks passed

rohan-tan-bhowmik deleted the sdpa_mask branch September 21, 2024 09:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention #3690

[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention #3690

Uh oh!

rohan-tan-bhowmik commented Sep 6, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rsuderman commented Sep 9, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention #3690

[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention #3690

Uh oh!

Conversation

rohan-tan-bhowmik commented Sep 6, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rsuderman commented Sep 9, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants