[Inference Providers] fix inference with URL endpoints #3041

hanouticelina · 2025-04-30T16:06:44Z

No description provided.

HuggingFaceDocBuilderDev · 2025-04-30T16:12:50Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Wauplin

Looks good 👍
Let's not forget a test or two to definitely get this fixed (maybe one with chat_completion and one with text_to_image? hopefully the two main use cases for local TGI and text-to-image in InferenceEndpoint)

src/huggingface_hub/inference/_providers/__init__.py

Counterpart of huggingface/huggingface_hub#3041 # TL;DR Specifying `endpointUrl` is currently failing because of this call: https://github.com/huggingface/huggingface.js/blob/d95c30026fb1775c47c5d9cd6a7890d945799a97/packages/inference/src/lib/getInferenceProviderMapping.ts#L122

Wauplin

Thanks! Approving though I still have a question about always adding /v1/chat/completions path. Ok to merge "as is" is aligned with what we do/what we want to do

tests/test_inference_client.py

Wauplin · 2025-05-06T09:45:39Z

tests/test_inference_client.py

+        pytest.param(
+            "https://my-custom-endpoint.com/custom_path",
+            "model",
+            "https://my-custom-endpoint.com/custom_path/v1/chat/completions",
+            "dummy",
+            id="client_model_is_url",
+        ),


Are we sure this is the expected behavior? In the past I've made a difference between InferenceClient(model="https://...").chat_completion and InferenceClient(base_url="https://...").chat_completion but I haven't tested it for a very long time. My reasoning is that if we always add /v1/chat/completions to the URL, we can't provide a URL that does not support it (what if "https://..." is already the URL to the endpoint handling chat_completion?)

but ok to remove this distinction if it helps with clarity / aligns better with JS client

I might be mistaken but I'm not sure we had this distinction in the pre-provider implementation of InferenceClient

huggingface_hub/src/huggingface_hub/inference/_client.py

Line 849 in 10e403d

def _resolve_chat_completion_url(self, model: Optional[str] = None) -> str:

but imo if you pass an URL (either via model or base_url), when you call chat_completion you're calling /chat/completions route by default (same behavior as the OpenAI client), but agree that we cannot customize this route, which is a bit annoying but I'm not sure if it's worth doing it.

Thanks for checking back. I took a look at #2540 and always adding (/v1)/chat/completions seems indeed the way to go (more consistent). Looks like we merged both behaviors even before the "pre-provider" implementation ^^

Co-authored-by: Lucain <[email protected]>

tests/test_inference_client.py

Wauplin

Based on #3041 (comment), all good to merge! 🎉

hanouticelina · 2025-05-06T10:41:20Z

let's merge! failing tests are unrelated (need to check where it comes from and fix that in another PR).

Counterpart of huggingface/huggingface_hub#3041 # TL;DR Specifying `endpointUrl` is currently failing because of this call: https://github.com/huggingface/huggingface.js/blob/d95c30026fb1775c47c5d9cd6a7890d945799a97/packages/inference/src/lib/getInferenceProviderMapping.ts#L122

fix inference with url endpoints

f13e733

hanouticelina requested a review from Wauplin April 30, 2025 16:06

style

d89024a

Wauplin reviewed Apr 30, 2025

View reviewed changes

src/huggingface_hub/inference/_providers/__init__.py Outdated Show resolved Hide resolved

parentheses

8529f23

SBrandeis mentioned this pull request May 2, 2025

[Inference providers] fix: endpointUrl + provider huggingface/huggingface.js#1420

Merged

add test

2757d8e

hanouticelina requested a review from Wauplin May 6, 2025 08:37

hanouticelina marked this pull request as ready for review May 6, 2025 08:38

Merge branch 'main' into fix-endpoints

1a9cf0a

Wauplin approved these changes May 6, 2025

View reviewed changes

hanouticelina and others added 2 commits May 6, 2025 11:48

Update tests/test_inference_client.py

82be638

Co-authored-by: Lucain <[email protected]>

Update tests/test_inference_client.py

68059d3

Co-authored-by: Lucain <[email protected]>

Wauplin reviewed May 6, 2025

View reviewed changes

tests/test_inference_client.py Show resolved Hide resolved

Wauplin added 2 commits May 6, 2025 12:09

Update tests/test_inference_client.py

ea62959

Merge branch 'main' into fix-endpoints

8e38f01

Wauplin approved these changes May 6, 2025

View reviewed changes

hanouticelina merged commit 1e40c62 into main May 6, 2025
21 of 25 checks passed

hanouticelina deleted the fix-endpoints branch May 6, 2025 10:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Inference Providers] fix inference with URL endpoints #3041

[Inference Providers] fix inference with URL endpoints #3041

hanouticelina commented Apr 30, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 30, 2025

Uh oh!

Wauplin left a comment

Uh oh!

Uh oh!

Wauplin left a comment

Uh oh!

Uh oh!

Uh oh!

Wauplin May 6, 2025

Uh oh!

hanouticelina May 6, 2025

Uh oh!

Wauplin May 6, 2025

Uh oh!

Uh oh!

Wauplin left a comment

Uh oh!

hanouticelina commented May 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Inference Providers] fix inference with URL endpoints #3041

[Inference Providers] fix inference with URL endpoints #3041

Conversation

hanouticelina commented Apr 30, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 30, 2025

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Wauplin May 6, 2025

Choose a reason for hiding this comment

Uh oh!

hanouticelina May 6, 2025

Choose a reason for hiding this comment

Uh oh!

Wauplin May 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

hanouticelina commented May 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants