-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Open
Labels
P2We'll consider working on this in future. (Assignee optional)We'll consider working on this in future. (Assignee optional)team-Remote-ExecIssues and PRs for the Execution (Remote) teamIssues and PRs for the Execution (Remote) teamtype: bug
Description
Description of the bug:
Very rarely, Bazel crashes with error messages like these:
14:39:02 211 FATAL: bazel crashed due to an internal error. Printing stack trace:
14:39:02 212 java.lang.RuntimeException: Unrecoverable error while evaluating node 'ActionLookupData0{actionLookupKey=ConfiguredTargetKey{label=//atlassian-embed-repo/atlassian-embed/src/main/kotlin/com/lucidchart/atlassian/model/lucid/documentembed:documentembed, config=BuildConfigurationKey[9e0af580f2bac6c86c12e854872b9f6763fe92a00d331709dbc4890bd7b17228]}, actionIndex=0}' (requested by nodes 'ActionLookupData1{actionLookupKey=ConfiguredTargetKey{label=//atlassian-embed-repo/atlassian-embed/src/main/kotlin/com/lucidchart/atlassian/model/lucid/documentembed:documentembed, config=BuildConfigurationKey[9e0af580f2bac6c86c12e854872b9f6763fe92a00d331709dbc4890bd7b17228]}, actionIndex=1}', 'ActionLookupData3{actionLookupKey=ConfiguredTargetKey{label=//atlassian-embed-repo/atlassian-embed/src/main/kotlin/com/lucidchart/atlassian/model/lucid/documentembed:documentembed, config=BuildConfigurationKey[9e0af580f2bac6c86c12e854872b9f6763fe92a00d331709dbc4890bd7b17228]}, actionIndex=3}')
14:39:02 213 at com.google.devtools.build.skyframe.AbstractParallelEvaluator$Evaluate.run(AbstractParallelEvaluator.java:547)
14:39:02 214 at com.google.devtools.build.lib.concurrent.AbstractQueueVisitor$WrappedRunnable.run(AbstractQueueVisitor.java:435)
14:39:02 215 at java.base/java.util.concurrent.ForkJoinTask$RunnableExecuteAction.compute(Unknown Source)
14:39:02 216 at java.base/java.util.concurrent.ForkJoinTask$RunnableExecuteAction.compute(Unknown Source)
14:39:02 217 at java.base/java.util.concurrent.ForkJoinTask$InterruptibleTask.exec(Unknown Source)
14:39:02 218 at java.base/java.util.concurrent.ForkJoinTask.doExec(Unknown Source)
14:39:02 219 at java.base/java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(Unknown Source)
14:39:02 220 at java.base/java.util.concurrent.ForkJoinPool.runWorker(Unknown Source)
14:39:02 221 at java.base/java.util.concurrent.ForkJoinWorkerThread.run(Unknown Source)
14:39:02 222 Caused by: java.lang.NullPointerException: ActionInput not found for filename bazel-out/k8-fastbuild/bin/atlassian-embed-repo/atlassian-embed/src/main/kotlin/com/lucidchart/atlassian/core/lucidproduct/lucidproduct.jar in CacheNotFoundException
14:39:02 223 at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:1008)
14:39:02 224 at com.google.devtools.build.lib.remote.common.BulkTransferException.getLostInputs(BulkTransferException.java:92)
14:39:02 225 at com.google.devtools.build.lib.remote.RemoteSpawnRunner.execLocallyAndUploadOrFail(RemoteSpawnRunner.java:591)
14:39:02 226 at com.google.devtools.build.lib.remote.RemoteSpawnRunner.exec(RemoteSpawnRunner.java:346)
14:39:02 227 at com.google.devtools.build.lib.exec.AbstractSpawnStrategy.exec(AbstractSpawnStrategy.java:161)
14:39:02 228 at com.google.devtools.build.lib.dynamic.RemoteBranch.runRemotely(RemoteBranch.java:89)
14:39:02 229 at com.google.devtools.build.lib.dynamic.RemoteBranch.callImpl(RemoteBranch.java:153)
14:39:02 230 at com.google.devtools.build.lib.dynamic.Branch.call(Branch.java:188)
14:39:02 231 at com.google.devtools.build.lib.dynamic.Branch.call(Branch.java:43)
14:39:02 232 at com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly(TrustedListenableFutureTask.java:128)
14:39:02 233 at com.google.common.util.concurrent.InterruptibleTask.run(InterruptibleTask.java:74)
14:39:02 234 at com.google.common.util.concurrent.TrustedListenableFutureTask.run(TrustedListenableFutureTask.java:80)
14:39:02 235 at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
14:39:02 236 at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
14:39:02 237 at java.base/java.lang.Thread.run(Unknown Source)
Here are a few more instances of the error occurring:
error2.txt
error3.txt
error4.txt
These errors are happening across different mnemonics, so we suspect it's a bug in Bazel or our remote execution software (Buildfarm), but we're not sure which.
We have dynamic execution enabled, in case that's relevant.
Which category does this issue belong to?
Remote Execution
What's the simplest, easiest way to reproduce this bug? Please provide a minimal example if possible.
Because the error occurs so rarely, it's very difficult to create a minimum reproducability case.
Which operating system are you running Bazel on?
Ubuntu 24.04.3
What is the output of bazel info release?
release 8.4.0
If bazel info release returns development version or (@non-git), tell us how you built Bazel.
No response
What's the output of git remote get-url origin; git rev-parse HEAD ?
If this is a regression, please try to identify the Bazel commit where the bug was introduced with bazelisk --bisect.
No response
Have you found anything relevant by searching the web?
We've started a discussion in the Bazel Slack workspace:
https://bazelbuild.slack.com/archives/CD4MDG09Z/p1760125221906699
Any other information, logs, or outputs that you want to share?
No response
Metadata
Metadata
Assignees
Labels
P2We'll consider working on this in future. (Assignee optional)We'll consider working on this in future. (Assignee optional)team-Remote-ExecIssues and PRs for the Execution (Remote) teamIssues and PRs for the Execution (Remote) teamtype: bug