Skip to content

Conversation

@Kanchan-Microsoft
Copy link
Contributor

Purpose

This pull request adds a new documentation file to help users deploy the Content Processing Solution Accelerator when their Azure OpenAI quota is limited. The guide explains how to adjust or bypass quota validation in the deployment process, ensuring users with lower token-per-minute (TPM) quotas can still deploy successfully.

Documentation additions and deployment guidance:

  • Added docs/DeployWithLimitedQuota.md with step-by-step instructions for deploying with reduced Azure OpenAI quota, including options to remove quota validation or modify quota thresholds in infra/main.bicep.
  • Provided configuration steps and example commands for setting deployment capacity and running the deployment with azd.
  • Included performance considerations and recommendations for optimal TPM values to maintain application performance.
  • Linked to related resources for further deployment, customization, and quota management guidance.
  • Explained the rationale behind quota validation and why these steps are necessary for successful deployment with limited quota.

Does this introduce a breaking change?

  • Yes
  • No

Golden Path Validation

  • I have tested the primary workflows (the "golden path") to ensure they function correctly without errors.

Deployment Validation

  • I have validated the deployment process successfully and all services are running as expected with this change.

What to Check

Verify that the following are valid

  • ...

Other Information

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants