reverting attn exp calculations to before 3n by ncylich · Pull Request #511 · cactus-compute/cactus

ncylich · 2026-03-09T04:43:38Z

You can see there's no more diffs between this and the prior to 3n commit's kernel_attention.cpp:
d5cd5c0...d0ad24a

Signed-off-by: Noah Cylich <noahcylich@gmail.com>

Copilot

Pull request overview

This PR reverts the vectorized exp/exp2 approximation used inside the FP16 attention softmax path back to the pre-“3n” polynomial form, aligning kernel_attention.cpp with the earlier implementation referenced in the PR description.

Changes:

Replaced the higher-order fused-multiply-add polynomial for the fractional exponent with a 2nd-order Taylor approximation using ln(2) and ln(2)^2/2.
Applied the same change in both the Apple Accelerate-based attention path and the general NEON vectorized attention path.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

reverting attn exp calculations to before 3n

d0ad24a

Signed-off-by: Noah Cylich <noahcylich@gmail.com>

Copilot AI review requested due to automatic review settings March 9, 2026 04:43

Copilot started reviewing on behalf of ncylich March 9, 2026 04:44 View session

Copilot AI reviewed Mar 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reverting attn exp calculations to before 3n#511

reverting attn exp calculations to before 3n#511
ncylich wants to merge 1 commit intomainfrom
revert-attn

ncylich commented Mar 9, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ncylich commented Mar 9, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants