Skip to content

Conversation

@oOraph
Copy link

@oOraph oOraph commented Nov 5, 2025

Add the possibility to customize both the scaling metric and threshold when creating or updating an endpoint.

@oOraph oOraph force-pushed the dev/hf_endpoints_autoscaling branch from 45212e7 to 425ad7a Compare November 5, 2025 16:28
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@oOraph oOraph force-pushed the dev/hf_endpoints_autoscaling branch from 425ad7a to 11b044b Compare November 6, 2025 08:36
@Wauplin
Copy link
Contributor

Wauplin commented Nov 6, 2025

Thanks for the PR @oOraph :) @hanouticelina very recently added a CLI for Inference Endpoints, could you add these two parameters to it as well? It's implementated here: https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/cli/inference_endpoints.py

Add the possibility to customize both the scaling metric and threshold when creating or updating an endpoint.

Signed-off-by: Raphael Glon <[email protected]>
@oOraph oOraph force-pushed the dev/hf_endpoints_autoscaling branch from 11b044b to 7c54803 Compare November 10, 2025 13:30
@oOraph
Copy link
Author

oOraph commented Nov 10, 2025

Thanks for the PR @oOraph :) @hanouticelina very recently added a CLI for Inference Endpoints, could you add these two parameters to it as well? It's implementated here: https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/cli/inference_endpoints.py

done :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants