在scaled_dot_product_attention函数中，改为如果输入是3D，输出也是3D #601

Qin-sx · 2025-07-03T09:49:04Z

PR Docs

PR APIs

scaled_dot_product_attention

新增参数

函数	PaConvert	Docs
KaimingNormal	#594	PaddlePaddle/docs#7324
SmoothL1Loss	#593	PaddlePaddle/docs#7323
Upsample	#567	PaddlePaddle/docs#7205
Module.eval	#597	PaddlePaddle/docs#7330

优化功能

函数	PaConvert	Docs
scaled_dot_product_attention 3D输入	#601	PaddlePaddle/docs#7353
scaled_dot_product_attention bool mask	#586	PaddlePaddle/docs#7318
signal.stft	#598	PaddlePaddle/docs#7329
unique	#569	PaddlePaddle/docs#7206
atleast	-	-
to_tensor	-	-

解决bug

LayerList.insert, bernoulli, Tensor.data, LSTMCell, fused_rms_norm, softmax

atleast和to_tensor和解决bug中的函数不需要修改PaConvert和Docs

modified: paconvert/api_matcher.py modified: tests/test_scaled_dot_product_attention.py

zhwesky2010

LGTM

modified: tests/test_scaled_dot_product_attention.py

zhwesky2010 · 2025-07-28T09:02:04Z

tests/test_scaled_dot_product_attention.py

+        import torch
+        np.random.seed(100)
+        x = np.random.rand(8, 128, 64)
+        query = torch.tensor(x, dtype=torch.float16)


这个在CPU下目前还不支持bfloat16。所以跑不了。

zhwesky2010

这一块由于 scaled_dot_product_attention 目前后端选择比较混乱。

可能很容易存在选择了不合理的后端导致无法通过。想要通过可能需要仔细设计下case，更合理的方式是优化scaled_dot_product_attention后端选择逻辑，避免总是选择不合理的后端。

Qin-sx · 2025-07-28T15:35:18Z

这一块由于 scaled_dot_product_attention 目前后端选择比较混乱。

可能很容易存在选择了不合理的后端导致无法通过。想要通过可能需要仔细设计下case，更合理的方式是优化scaled_dot_product_attention后端选择逻辑，避免总是选择不合理的后端。

嗯，收到

zhwesky2010 · 2025-07-29T04:00:40Z

这一块由于 scaled_dot_product_attention 目前后端选择比较混乱。
可能很容易存在选择了不合理的后端导致无法通过。想要通过可能需要仔细设计下case，更合理的方式是优化scaled_dot_product_attention后端选择逻辑，避免总是选择不合理的后端。

嗯，收到

先修改下case吧，看看怎么能测到这个3D的功能，同时又能避开后端选择错误的问题。先把这个PR合入进去。

modified: tests/test_scaled_dot_product_attention.py

Qin-sx · 2025-07-30T12:39:21Z

在case10中暂时考虑了mask
PaddlePaddle/Paddle#74319

modified for 3D input and outpupt

00171ae

modified: paconvert/api_matcher.py modified: tests/test_scaled_dot_product_attention.py

paddle-bot bot added the contributor External developers label Jul 3, 2025

zhwesky2010 approved these changes Jul 23, 2025

View reviewed changes

PaddlePaddle locked and limited conversation to collaborators Jul 23, 2025

PaddlePaddle unlocked this conversation Jul 23, 2025

changed float32 to float16

5faf671

modified: tests/test_scaled_dot_product_attention.py

zhwesky2010 reviewed Jul 28, 2025

View reviewed changes

added mask in case10 temporarily

3ed655c

modified: tests/test_scaled_dot_product_attention.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

在scaled_dot_product_attention函数中，改为如果输入是3D，输出也是3D #601

在scaled_dot_product_attention函数中，改为如果输入是3D，输出也是3D #601

Uh oh!

Qin-sx commented Jul 3, 2025 •

edited

Loading

Uh oh!

zhwesky2010 left a comment

Uh oh!

zhwesky2010 Jul 28, 2025

Uh oh!

zhwesky2010 left a comment •

edited

Loading

Uh oh!

Qin-sx commented Jul 28, 2025

Uh oh!

zhwesky2010 commented Jul 29, 2025

Uh oh!

Qin-sx commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

在scaled_dot_product_attention函数中，改为如果输入是3D，输出也是3D #601

Are you sure you want to change the base?

在scaled_dot_product_attention函数中，改为如果输入是3D，输出也是3D #601

Uh oh!

Conversation

Qin-sx commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Docs

PR APIs

Uh oh!

zhwesky2010 left a comment

Choose a reason for hiding this comment

Uh oh!

zhwesky2010 Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

zhwesky2010 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Qin-sx commented Jul 28, 2025

Uh oh!

zhwesky2010 commented Jul 29, 2025

Uh oh!

Qin-sx commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Qin-sx commented Jul 3, 2025 •

edited

Loading

zhwesky2010 left a comment •

edited

Loading