-
Notifications
You must be signed in to change notification settings - Fork 19
Set BUILDKITE_CANCEL_SIGNAL to SIGQUIT for build and test #366
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
5131f92 to
8f01134
Compare
|
Observe here that I intentionally canceled this build, so that it would print SIGQUIT (which it successfully did). Though looks like no upload task then was able to get the core files (the timeout for uploading the core files after cancellation occurs is set to 10 seconds by default, unless changed in the worker configuration) |
f0e4bc8 to
94cdb9d
Compare
This comment was marked as outdated.
This comment was marked as outdated.
e6713e5 to
67d5efc
Compare
|
Looks to me like it's still not working. |
|
Does it need an updated copy of the sandbox binary with JuliaContainerization/Sandbox.jl@269e91f? |
This seems to indicate the running version is quite old? |
e671766 to
3f8451c
Compare
Indeed, there were more layers of sandboxing than I remembered :P |
|
well that is entertaining on musl: |
|
Aha, my extra debugging information coming in handy! Here's a fix for that minor issue: JuliaCI/rootfs-images#252 |
|
I tried canceling one of the test jobs, and it looks to me like the new sandbox executable segfaulted: https://buildkite.com/julialang/julia-buildkite/builds/1577#01904c26-78c2-4120-bdcc-d01a439b5bfa/1373-1378 |
|
It seems more likely to me that the child segfaulted and then the sandbox killed itself with the same exit code. Seems like the SIGQUIT got propagated though, which looked pretty good. Not sure why it then segfaulted and did not coredump, but maybe that is a quirk of rr? Or is the ulimit -c potentially disabled? |
julia-buildkite/utilities/test_julia.sh Line 141 in dea1b33
|
|
But this is https://github.com/JuliaCI/julia-buildkite/blob/dea1b33fba992d460ffae69bce54f598330fb0d3/utilities/build_julia.sh, so is that yes? |
|
The segfault I linked was a test job. |
|
Right, but it was an rr job too |
|
Right, so for |
|
Would it hurt to just upload core dumps unconditionally (for both |
|
|
It's just wasted compute time and storage space. There's really no information in a coredump that an |
|
@staticfloat can you change DROPME: Use sandbox#main to use |
53b4920 to
ee38857
Compare
|
Looks like sandbox needs that branch pushed to correspond to the release tags? And a rebase here |
b2a61b6 to
8423537
Compare
8423537 to
8e93d18
Compare
8e93d18 to
e78acba
Compare
|
Okay, looks very close, but didn't get enough time to upload the coredump reports for the exact failing test we hope this PR will help with. But it did generate them when canceling (though didn't generate them when killing due to timeout). This will likely require JuliaCI/coreupload-buildkite-plugin#14 and fixing the JuliaCI/coreupload branch name in this PR |
4c09738 to
a5c6431
Compare
a5c6431 to
2948146
Compare
|
Needs resigning, and merge? |
2948146 to
b1e8881
Compare
This should get us `zstd`, for core uploading
da63e14 to
d9e925b
Compare
|
Assuming I updated correctly the SHA hash for the new rootfs versions, this may be still ready to re-sign and merge? |
No description provided.