Temporarily disable arm build #296

yakutovicha · 2025-05-05T16:23:16Z

No description provided.

edoardob90 · 2025-05-05T20:59:35Z

@yakutovicha This seems to work on Renku. Pausing/resuming the session as well. It seems that there're problems with multi-arch images, maybe something related to how K8s is set up?

I added the annotations according to this page: https://docs.docker.com/build/exporters/image-registry/

rokroskar · 2025-05-06T08:00:40Z

Hi, I'm from the Renku team 👋

Great that you're able to get your image to work! However, I noticed it's rather large, it looks like over 11GB which means that sessions take a really long time to start. Is there any way you can reduce the image size? Are you adding data to the image or is it purely python packages?

rokroskar · 2025-05-06T08:09:13Z

btw I believe to make multiarch images work you need to disable the provenance flag, e.g. like we do here

yakutovicha · 2025-05-06T08:11:57Z

Hey @rokroskar, thanks for reaching out!

Great that you're able to get your image to work!

The image doesn't work on my account, unfortunately. It remains like this and never proceeds further (not sure why 🤷 ) :

However, I noticed it's rather large, it looks like over 11GB which means that sessions take a really long time to start.

On my PC the image is 3.47 GB, see below:

I feel a bit stuck here, any help would be greatly appreciated 🙏

rokroskar · 2025-05-06T08:19:09Z

I see this in our logs

Successfully pulled image "ghcr.io/empa-scientific-it/python-tutorial:35139ae5b38f" in 560ms (560ms including waiting). Image size: 11587224791 bytes.

Your docker client might be reporting the compressed size? not sure.

Unfortunately very large images can take a really long time to download and to start. I see torch installed - this can easily lead to very large images, especially if it is installed potentially multiple times. Could this be part of what is causing the large size?

edoardob90 · 2025-05-06T08:21:54Z

Hi, I'm from the Renku team 👋

Great that you're able to get your image to work! However, I noticed it's rather large, it looks like over 11GB which means that sessions take a really long time to start. Is there any way you can reduce the image size? Are you adding data to the image or is it purely python packages?

Hi @rokroskar! We're not even adding the content of the repository in the image. The base image is quay.io/jupyter/minimal-notebook:latest (which is about 1.6 GB locally), and then we're simply adding some dependencies via apt, plus updating the base environment. I suspect it's that step that bloats the image somehow, but I don't know it could get that big.

Is there any way to know about the image's actual size Renku is going to pull?

edoardob90 · 2025-05-06T08:24:25Z

I see this in our logs
Successfully pulled image "ghcr.io/empa-scientific-it/python-tutorial:35139ae5b38f" in 560ms (560ms including waiting). Image size: 11587224791 bytes.
Your docker client might be reporting the compressed size? not sure.

Unfortunately very large images can take a really long time to download and to start. I see torch installed - this can easily lead to very large images, especially if it is installed potentially multiple times. Could this be part of what is causing the large size?

Ah, well, PyTorch might easily be the responsible here. Honestly, I don't know a workaround other than trying to use another base image (or a combination?), where PT is already installed and maybe optimized

rokroskar · 2025-05-06T08:25:40Z

I was more wondering if the different torch-enabled packages are maybe adding their own versions of cuda? That is what contributes to torch bloat.

yakutovicha · 2025-05-06T08:36:04Z

I see this in our logs
Successfully pulled image "ghcr.io/empa-scientific-it/python-tutorial:35139ae5b38f" in 560ms (560ms including waiting). Image size: 11587224791 bytes.
Your docker client might be reporting the compressed size? not sure.

This image was built with repo2docker, we changed the approach recently. The image from this PR should be much smaller (ghcr.io/empa-scientific-it/python-tutorial:pr-296), but it somehow still fails to start.

olevski · 2025-05-06T08:53:54Z

dive is a tool that can let you analyze docker images - specifically it can tell you how large is each layer in your image. Then you can use that information to optimize. It also has some tests/way to determine how "efficient" the image is, although I am not sure what metrics/heuristics it uses to determine that.

https://github.com/wagoodman/dive

olevski · 2025-05-06T08:59:06Z

@yakutovicha I am one of the renku developers. Can you share the project where you are trying to run the image that is failing? Is that possible? Or if not can you share the session launcher configuration you are using to launch that image?

yakutovicha · 2025-05-06T09:06:18Z

@olevski, thanks, yes sure. The project is https://renkulab.io/v2/projects/empa-scientific-it/empa-it-python-tutorial

Just some heads up, I was wrong about failing. It took about 20 minutes to start for the first time, but then it was working.

Regarding the image size, @edoardob90 will explore the options in a separate PR. Thanks a lot for the suggestions 🙏

rokroskar · 2025-05-06T09:20:00Z

Maybe another data point - I made a fork of this repo and moved the environment.yml file to the root directory - then I used renku to build an image for it automatically. You can see it in the python tutorial launcher here. This makes a slightly smaller image (7GB) and launches fine - the limitation (for now) is that it uses vscode, but this will be changed soon to also support jupyter. Still takes ~5 minutes to launch, but might be a workable option? Feel free to try it out to see if all the notebooks work as expected (I ran a few and they seem ok).

olevski · 2025-05-06T09:42:14Z

@olevski, thanks, yes sure. The project is https://renkulab.io/v2/projects/empa-scientific-it/empa-it-python-tutorial

Ok if the session eventually starts then that is ok. I thought it was misconfigured and it was truly failing. What you describe is definitely the case of the image being really big.

What @rokroskar suggested is a really viable option though. And it would save you some time in building and publishing the image. When you let Renku build the image we also store it in our own image repository which is much faster to access. Also I think ghcr and dockerhub start to throttle you after they see you keep pulling images. This cannot happen in the case where we build and host the image.

Currently when you let Renku build the image for you we only support VSCodium. But in the new release (that is coming out in about a week from now) we will support also Jupyterlab.

edoardob90 · 2025-05-06T09:43:44Z

Maybe another data point - I made a fork of this repo and moved the environment.yml file to the root directory - then I used renku to build an image for it automatically. You can see it in the python tutorial launcher here. This makes a slightly smaller image (7GB) and launches fine - the limitation (for now) is that it uses vscode, but this will be changed soon to also support jupyter. Still takes ~5 minutes to launch, but might be a workable option? Feel free to try it out to see if all the notebooks work as expected (I ran a few and they seem ok).

Only problem with using VS Code: some features for the interactive exercises rely on Jupyter+pytest, and that doesn't work properly when the notebooks are opened directly in VS Code (or Codium).

yakutovicha · 2025-05-06T12:15:39Z

superseded by #298

Temporarily disable arm build.

ee65750

yakutovicha marked this pull request as draft May 5, 2025 16:23

edoardob90 changed the title ~~Temporarily disable arm build.~~ Temporarily disable arm build May 5, 2025

Add annotations via "build-and-push.outputs"

7ce142e

Fix bug with quotes

01605f3

yakutovicha closed this May 6, 2025

yakutovicha deleted the workaround/disable-arm-build branch May 6, 2025 12:15

Temporarily disable arm build #296

Temporarily disable arm build #296

Uh oh!

Conversation

yakutovicha commented May 5, 2025

Uh oh!

edoardob90 commented May 5, 2025

Uh oh!

rokroskar commented May 6, 2025

Uh oh!

rokroskar commented May 6, 2025

Uh oh!

yakutovicha commented May 6, 2025

Uh oh!

rokroskar commented May 6, 2025

Uh oh!

edoardob90 commented May 6, 2025

Uh oh!

edoardob90 commented May 6, 2025

Uh oh!

rokroskar commented May 6, 2025

Uh oh!

yakutovicha commented May 6, 2025

Uh oh!

olevski commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

olevski commented May 6, 2025

Uh oh!

yakutovicha commented May 6, 2025

Uh oh!

rokroskar commented May 6, 2025

Uh oh!

olevski commented May 6, 2025

Uh oh!

edoardob90 commented May 6, 2025

Uh oh!

yakutovicha commented May 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

olevski commented May 6, 2025 •

edited

Loading