Skip to content
This repository was archived by the owner on May 27, 2025. It is now read-only.

Commit 946b95c

Browse files
author
Gabriel Nieves
committed
Updated deployment documentation
1 parent 89d6a47 commit 946b95c

File tree

1 file changed

+8
-2
lines changed

1 file changed

+8
-2
lines changed

docs/DEPLOYMENT-GUIDE.md

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -97,20 +97,26 @@ In the `deploy.parameters.json` file, provide values for the following required
9797
`GRAPHRAG_LLM_MODEL` | gpt-4 | No | Name of the Azure OpenAI LLM model to use (or deploy).
9898
`GRAPHRAG_LLM_MODEL_VERSION` | turbo-2024-04-09 | No | Model version of the LLM model to use (or deploy). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
9999
`GRAPHRAG_LLM_DEPLOYMENT_NAME` | gpt-4 | No | Deployment name of the LLM model to use (or deploy).
100-
`GRAPHRAG_LLM_MODEL_QUOTA` | 80 | No | TPM quota of the LLM model in units of 1000 (i.e. 10 = 10,000 TPM). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
100+
`GRAPHRAG_LLM_MODEL_CONCURRENT_REQUEST` | 15 | No | The max number of simultaneous chat completions LLM requests allowed.
101+
`GRAPHRAG_LLM_MODEL_QUOTA` | 80 | No | TPM quota of the LLM model in units of 1000 (i.e. 10 = 10,000 TPM). Only required if deploying a new AOAI instance (i.e.
102+
`GRAPHRAG_API_BASE` is left undefined).
101103
`GRAPHRAG_EMBEDDING_MODEL` | text-embedding-ada-002 | No | Name of the Azure OpenAI embedding model.
102104
`GRAPHRAG_EMBEDDING_MODEL_VERSION` | 2 | No | Model version of the embedding model to use (or deploy). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
103105
`GRAPHRAG_EMBEDDING_DEPLOYMENT_NAME` | text-embedding-ada-002 | No | Deployment name of the embedding model to use (or deploy).
106+
`GRAPHRAG_EMBEDDING_MODEL_CONCURRENT_REQUEST` | 15 | No | The max number of simultaneous embedding requests allowed.
104107
`GRAPHRAG_EMBEDDING_MODEL_QUOTA` | 300 | No | TPM quota of the embedding model in units of 1000 (i.e. 10 = 10,000 TPM). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
105108
`GRAPHRAG_IMAGE` | graphrag:backend | No | The name and tag of the graphrag docker image in the container registry. Will default to `graphrag:backend` and be hosted at `my_container_registry_name>.azurecr.io/graphrag:backend`.
106109
`CONTAINER_REGISTRY_LOGIN_SERVER` | <container_registry_name>.azurecr.io | No | Endpoint of an existing Azure Container Registry where the `GRAPHRAG_IMAGE` docker image is hosted. If not provided, a unique name will be generated (recommended).
107110
`COGNITIVE_SERVICES_AUDIENCE` | `https://cognitiveservices.azure.com/.default` | No | Endpoint for cognitive services identity authorization. Should be explicitly set for deployments in other Azure clouds.
108111
`APIM_NAME` | <auto_generated_unique_name> | No | Hostname of the graphrag API. Must be a globally unique name. The API will be available at `https://<APIM_NAME>.azure-api.net`.
109-
`APIM_TIER` | Developer | No | The [APIM tier](https://azure.microsoft.com/en-us/pricing/details/api-management) to use. Can be either `Developer` or `StandardV2`. `StandardV2` costs more but will deploy faster.
112+
`APIM_TIER` | Developer | No | The [APIM tier](https://azure.microsoft.com/en-us/pricing/details/api-management) to use. Can be either `Developer` or
113+
`StandardV2`. `StandardV2` costs more but will deploy faster.
114+
`AI_SEARCH_TIER` | standard | No | The [AI Search tier](https://learn.microsoft.com/en-us/azure/search/search-sku-tier) to use. Can be either `free`, `basic`, `standard`, `standard2`, `standard3`, `storage_optimized_l1`, or `storage_optimized_l2`
110115
`RESOURCE_BASE_NAME` | | No | Suffix to apply to all azure resource names. If not provided a unique suffix will be generated.
111116
`AISEARCH_ENDPOINT_SUFFIX` | `search.windows.net` | No | Suffix to apply to AI search endpoint. Should be overridden for deployments in other Azure clouds.
112117
`AISEARCH_AUDIENCE` | `https://search.azure.com/` | No | AAD audience for AI Search. Should be overridden for deployments in other Azure clouds.
113118

119+
114120
### 5. Deploy solution accelerator to the resource group
115121
```shell
116122
cd infra

0 commit comments

Comments
 (0)