feat: Added --build_variant flag for cpu only build. #8329

Sunidhi-Gaonkar1 · 2025-08-05T09:39:22Z

What does the PR do?

Checklist

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

Related PRs:

Where should the reviewer start?

Test plan:

Used below commands to create triton server docker instance and starting up the triton server:
1.sudo docker run --privileged -it -p8000:8000 --volume /model_repository:/var/models --name triton_server_test tritonserver
2../bin/tritonserver --model-repository=/var/models

Tested the simple identity model using below commands in a different session:
1.curl -X POST localhost:8000/v2/repository/index | jq

curl -X POST localhost:8000/v2/repository/index | jq
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100    58  100    58    0     0  29000      0 --:--:-- --:--:-- --:--:-- 29000
[
  {
    "name": "simple_identity",
    "version": "1",
    "state": "READY"
  }
]

2.curl localhost:8000/v2/models/simple_identity | jq

curl localhost:8000/v2/models/simple_identity | jq
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100   193  100   193    0     0   188k      0 --:--:-- --:--:-- --:--:--  188k
{
  "name": "simple_identity",
  "versions": [
    "1"
  ],
  "platform": "python",
  "inputs": [
    {
      "name": "INPUT0",
      "datatype": "BYTES",
      "shape": [
        -1,
        -1
      ]
    }
  ],
  "outputs": [
    {
      "name": "OUTPUT0",
      "datatype": "BYTES",
      "shape": [
        -1,
        -1
      ]
    }
  ]
}

Caveats:

Background

Hi team,
This PR includes changes made to add Power support to triton-inference-server for python backend. Can you please review/merge this?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

mkumatag · 2025-09-18T16:02:32Z

build.py

-    find /opt/tritonserver/python -maxdepth 1 -type f -name \\
-    "tritonfrontend-*.whl" | xargs -I {} pip install --upgrade {}[all]
+RUN if [ "$(uname -m)" = "ppc64le" ]; then \
+        VARIANT="cpu"; \


I feel that the cpu variant could be applicable for other use cases as well, instead of limiting it to the ppc64le architecture. Can we just parameterize this variant and set all as a default value (this way we can safely remove this uname check here)?

Sure, we are working on these changes.

I have made the changes to remove uname check , added a --build_variant flag which is set to all by default.

mkumatag · 2025-09-18T16:03:20Z

build.py

+        && apt-get install -y --no-install-recommends \
+        clang-15 \
+        && ln -s /usr/bin/clang-15 /usr/bin/clang -f \
+        && ln -s /usr/bin/clang++-15 /usr/bin/clang++ -f \
+        && pip3 install cython ninja; \


why are we installing explicitly on ppc64le architecture and not on other architectures?

We were facing linker issue for numpy as it was getting built from source. Hence, need to install clang-15 for ppc64le architecture. For other architectures numpy wheels are available.

I have removed these changes as we are no longer facing the linker issue.

mkumatag

LGTM,

squash the commits and change the PR heading accordingly

alhad-deshpande · 2025-10-14T04:31:59Z

@mc-nv @dmitry-tokarev-nv @whoisj
Can you please review/approve this PR?

whoisj · 2025-11-05T22:27:02Z

These changes make sense to me. @mc-nv to review and determine if the changes are risky or compatible w/ out internal build process or not (I think they should be).

mc-nv · 2025-11-05T22:51:05Z

To me looks like the change is about to select the required wheel package dependencies during installation aka all vs cpu.

I'm struggling to find in any of our pyproject.toml or setup.py CPU requirements that are using optional dependency flag cpu. It's also not a part of this change.
On other side we have flag --enable-gpu which is probably can be reused for same purpose in current context, there why introduction of new flag is questionable.

whoisj · 2025-11-06T19:21:58Z

@Sunidhi-Gaonkar1 please address @mc-nv's concerns. Thank you.

Sunidhi-Gaonkar1 · 2025-11-07T04:22:57Z

Hi @whoisj @mc-nv Sorry for the delayed response.

We tried using enable_gpu flag to build cpu only wheel but it ran into issues due to following reasons:

When we set enable_gpu flag in the build command then all gpu related dependencies like cuda-keyring gets installed using apt command.
When we don't set the enable_gpu flag in the build command then wheels are built for "all" variant but cupy_cuda library gets installed via pip command.

Hence regardless of enable_gpu flag is set or unset it tries loading up some or the other cuda related library.

If any architecture does not have gpu support then this new build_variant flag can be used to enable cpu only build. When build_variant is passed as "cpu" it will not load any cuda related library regardless of whether enable_gpu is set or unset. This creates separate responsibilities for both the flags enable_gpu and build_variant. The default value of the build_variant has been kept as "all".

Please let us know your thoughts.

Sunidhi-Gaonkar1 · 2025-11-07T07:13:48Z

We are also currently facing build failure due to this cuda related library , libnvshmem3-cuda-13, we are working to resolve this issue.

Sunidhi-Gaonkar1 · 2025-11-10T09:40:41Z

Hi @whoisj @mc-nv, we have fixed the libnvshmem3-cuda-13 library issue by adding build_variant check. Please review the PR and let us know if any changes are required.

alhad-deshpande mentioned this pull request Aug 25, 2025

Umbrella Issue: Porting Kubeflow to IBM Power (ppc64le) kubeflow/community#781

Open

54 tasks

mkumatag suggested changes Sep 18, 2025

View reviewed changes

Sunidhi-Gaonkar1 marked this pull request as draft September 23, 2025 04:38

Sunidhi-Gaonkar1 marked this pull request as ready for review October 6, 2025 11:04

mkumatag approved these changes Oct 6, 2025

View reviewed changes

Sunidhi-Gaonkar1 force-pushed the ubuntu-power branch from adabaa2 to ce46097 Compare October 7, 2025 05:31

Sunidhi-Gaonkar1 changed the title ~~feat: Added power support for python backend on ubuntu.~~ feat: Added --build_variant flag for cpu only build. Oct 7, 2025

whoisj requested review from mc-nv and whoisj November 5, 2025 22:27

whoisj added the PR: build Changes that affect the build system or external dependencies label Nov 5, 2025

Sunidhi-Gaonkar1 force-pushed the ubuntu-power branch 2 times, most recently from f6dee1d to 552a0bb Compare November 10, 2025 08:22

Updated build.py to resolve cuda issue.

cb7c462

Sunidhi-Gaonkar1 force-pushed the ubuntu-power branch from 552a0bb to cb7c462 Compare November 11, 2025 06:35

feat: Added --build_variant flag for cpu only build. #8329

Are you sure you want to change the base?

feat: Added --build_variant flag for cpu only build. #8329

Conversation

Sunidhi-Gaonkar1 commented Aug 5, 2025

What does the PR do?

Checklist

Commit Type:

Related PRs:

Where should the reviewer start?

Test plan:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Uh oh!

mkumatag Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

Sunidhi-Gaonkar1 Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

Sunidhi-Gaonkar1 Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

mkumatag Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

Sunidhi-Gaonkar1 Sep 19, 2025

Choose a reason for hiding this comment

Uh oh!

Sunidhi-Gaonkar1 Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

mkumatag left a comment

Choose a reason for hiding this comment

Uh oh!

alhad-deshpande commented Oct 14, 2025

Uh oh!

whoisj commented Nov 5, 2025

Uh oh!

mc-nv commented Nov 5, 2025

Uh oh!

whoisj commented Nov 6, 2025

Uh oh!

Sunidhi-Gaonkar1 commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sunidhi-Gaonkar1 commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sunidhi-Gaonkar1 commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

5 participants

Sunidhi-Gaonkar1 commented Nov 7, 2025 •

edited

Loading

Sunidhi-Gaonkar1 commented Nov 7, 2025 •

edited

Loading