Skip to content

Conversation

danbev
Copy link
Member

@danbev danbev commented Oct 1, 2025

This commit disables flash-attention for the Java binding test so that the testFullTranscribe test passes.

Without this change the test was failing because the expected output mismatches after the flash-attention change:

<And so my fellow Americans ask not what your country can do for you ask what you can do for your country.>
but was:
<and so my fellow Americans ask not what your country can do for you ask what you can do for your country>

An alternative would also be to update the expected output but it felt better to keep the same expected output and disable flash-attention and not just change the expected output to match the new behavior.

This commit disables flash-attention for the Java binding test so that
the testFullTranscribe test passes.

Without this change the test was failing because the expected output
mismatches after the flash-attention change:
```console
<And so my fellow Americans ask not what your country can do for you ask what you can do for your country.>
but was:
<and so my fellow Americans ask not what your country can do for you ask what you can do for your country>
```

An alternative would also be to update the expected output but it felt
better to keep the same expected output and disable flash-attention and
not just change the expected output to match the new behavior.
@danbev
Copy link
Member Author

danbev commented Oct 1, 2025

The Ruby CI failure should be addressed by #3447.

@danbev danbev requested a review from ggerganov October 1, 2025 06:29
@danbev danbev merged commit 2a56869 into ggml-org:master Oct 1, 2025
65 of 66 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants