`--experimental_remote_cache_compression` causes 3-5x higher Bazel server heap usage

### Description of the bug:


Certain actions, possibly those that trigger a high number of remote cache CAS hits, lead to excessively high Bazel server memory use when using `--experimental_remote_cache_compression`. A build using `--experimental_remote_cache_compression` may result in 3-5x times higher JVM heap use compared to not using that flag.

### What's the simplest, easiest way to reproduce this bug? Please provide a minimal example if possible.


I've minimized a repro here: https://github.com/jfirebaugh/bazel_remote_cache_compression

Checkout that repository, add .bazelrc with appropriate remote cache configuration, and then compare the output of:

1. `bazel clean && bazel shutdown && bazel build --memory_profile=memprof :binary && grep 'Build artifacts:heap:used' memprof`
2. `bazel clean && bazel shutdown && bazel build --experimental_remote_cache_compression --memory_profile=memprof :binary && grep 'Build artifacts:heap:used' memprof`

### Which operating system are you running Bazel on?


macOS

### What is the output of `bazel info release`?


release 6.2.1

### If `bazel info release` returns `development version` or `(@non-git)`, tell us how you built Bazel.


_No response_

### What's the output of `git remote get-url origin; git rev-parse master; git rev-parse HEAD` ?


```text
git@github.com:jfirebaugh/bazel_remote_cache_compression.git
master
fatal: ambiguous argument 'master': unknown revision or path not in the working tree.
Use '--' to separate paths from revisions, like this:
'git <command> [<revision>...] -- [<file>...]'
fb76727c785f013541bb150120f2fecf14e79c58
```


### Is this a regression? If yes, please try to identify the Bazel commit where the bug was introduced.


_No response_

### Have you found anything relevant by searching the web?

High memory use with `--experimental_remote_cache_compression` was reported by another user on Bazel slack: https://bazelbuild.slack.com/archives/CA31HN1T3/p1646921767090359?thread_ts=1646911448.469939&cid=CA31HN1T3

### Any other information, logs, or outputs that you want to share?

I have done some initial investigation using JFR memory profiling, and it looks like one possible cause is the following:

```
Stack Trace	Count	Percentage
void java.nio.HeapByteBuffer.<init>(int, int, MemorySegmentProxy)	504	34.4 %
   ByteBuffer java.nio.ByteBuffer.allocate(int)	504	34.4 %
   ByteBuffer com.github.luben.zstd.NoPool.get(int)	471	32.2 %
      void com.github.luben.zstd.ZstdInputStreamNoFinalizer.<init>(InputStream, BufferPool)	471	32.2 %
      void com.github.luben.zstd.ZstdInputStreamNoFinalizer.<init>(InputStream)	471	32.2 %
      void com.google.devtools.build.lib.remote.zstd.ZstdDecompressingOutputStream.<init>(OutputStream)	471	32.2 %
      ListenableFuture com.google.devtools.build.lib.remote.GrpcCacheClient.requestRead(RemoteActionExecutionContext, RemoteRetrier$ProgressiveBackoff, Digest, CountingOutputStream, Supplier, Channel)	471	32.2 %
```

[`requestRead` allocates `ZstdDecompressingOutputStream`](https://github.com/bazelbuild/bazel/blob/e81cdb9411fc45cab22d270ee374628fcaab82f8/src/main/java/com/google/devtools/build/lib/remote/GrpcCacheClient.java#L394), which allocates `ZstdInputStreamNoFinalizer`, which uses `NoPool` to allocate via `ByteBuffer.allocate`. It appears the size allocated here is:

https://github.com/luben/zstd-jni/blob/e07f8970be0c72ce02dcdf1877daa034208915d0/src/main/native/decompress/zstd_decompress.c#L1668

Which if I've calculated correctly is 131 kB. And I assume this will allocate once for every in-flight cache read.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`--experimental_remote_cache_compression` causes 3-5x higher Bazel server heap usage #18997

Description of the bug:

What's the simplest, easiest way to reproduce this bug? Please provide a minimal example if possible.

Which operating system are you running Bazel on?

What is the output of `bazel info release`?

If `bazel info release` returns `development version` or `(@non-git)`, tell us how you built Bazel.

What's the output of `git remote get-url origin; git rev-parse master; git rev-parse HEAD` ?

Is this a regression? If yes, please try to identify the Bazel commit where the bug was introduced.

Have you found anything relevant by searching the web?

Any other information, logs, or outputs that you want to share?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

--experimental_remote_cache_compression causes 3-5x higher Bazel server heap usage #18997

Description

Description of the bug:

What's the simplest, easiest way to reproduce this bug? Please provide a minimal example if possible.

Which operating system are you running Bazel on?

What is the output of bazel info release?

If bazel info release returns development version or (@non-git), tell us how you built Bazel.

What's the output of git remote get-url origin; git rev-parse master; git rev-parse HEAD ?

Is this a regression? If yes, please try to identify the Bazel commit where the bug was introduced.

Have you found anything relevant by searching the web?

Any other information, logs, or outputs that you want to share?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`--experimental_remote_cache_compression` causes 3-5x higher Bazel server heap usage #18997

What is the output of `bazel info release`?

If `bazel info release` returns `development version` or `(@non-git)`, tell us how you built Bazel.

What's the output of `git remote get-url origin; git rev-parse master; git rev-parse HEAD` ?