You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on May 27, 2025. It is now read-only.
Copy file name to clipboardExpand all lines: docs/DEPLOYMENT-GUIDE.md
+8-2Lines changed: 8 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -97,20 +97,26 @@ In the `deploy.parameters.json` file, provide values for the following required
97
97
`GRAPHRAG_LLM_MODEL` | gpt-4 | No | Name of the Azure OpenAI LLM model to use (or deploy).
98
98
`GRAPHRAG_LLM_MODEL_VERSION` | turbo-2024-04-09 | No | Model version of the LLM model to use (or deploy). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
99
99
`GRAPHRAG_LLM_DEPLOYMENT_NAME` | gpt-4 | No | Deployment name of the LLM model to use (or deploy).
100
-
`GRAPHRAG_LLM_MODEL_QUOTA` | 80 | No | TPM quota of the LLM model in units of 1000 (i.e. 10 = 10,000 TPM). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
100
+
`GRAPHRAG_LLM_MODEL_CONCURRENT_REQUEST` | 15 | No | The max number of simultaneous chat completions LLM requests allowed.
101
+
`GRAPHRAG_LLM_MODEL_QUOTA` | 80 | No | TPM quota of the LLM model in units of 1000 (i.e. 10 = 10,000 TPM). Only required if deploying a new AOAI instance (i.e.
102
+
`GRAPHRAG_API_BASE` is left undefined).
101
103
`GRAPHRAG_EMBEDDING_MODEL` | text-embedding-ada-002 | No | Name of the Azure OpenAI embedding model.
102
104
`GRAPHRAG_EMBEDDING_MODEL_VERSION` | 2 | No | Model version of the embedding model to use (or deploy). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
103
105
`GRAPHRAG_EMBEDDING_DEPLOYMENT_NAME` | text-embedding-ada-002 | No | Deployment name of the embedding model to use (or deploy).
106
+
`GRAPHRAG_EMBEDDING_MODEL_CONCURRENT_REQUEST` | 15 | No | The max number of simultaneous embedding requests allowed.
104
107
`GRAPHRAG_EMBEDDING_MODEL_QUOTA` | 300 | No | TPM quota of the embedding model in units of 1000 (i.e. 10 = 10,000 TPM). Only required if deploying a new AOAI instance (i.e. `GRAPHRAG_API_BASE` is left undefined).
105
108
`GRAPHRAG_IMAGE` | graphrag:backend | No | The name and tag of the graphrag docker image in the container registry. Will default to `graphrag:backend` and be hosted at `my_container_registry_name>.azurecr.io/graphrag:backend`.
106
109
`CONTAINER_REGISTRY_LOGIN_SERVER` | <container_registry_name>.azurecr.io | No | Endpoint of an existing Azure Container Registry where the `GRAPHRAG_IMAGE` docker image is hosted. If not provided, a unique name will be generated (recommended).
107
110
`COGNITIVE_SERVICES_AUDIENCE` | `https://cognitiveservices.azure.com/.default` | No | Endpoint for cognitive services identity authorization. Should be explicitly set for deployments in other Azure clouds.
108
111
`APIM_NAME` | <auto_generated_unique_name> | No | Hostname of the graphrag API. Must be a globally unique name. The API will be available at `https://<APIM_NAME>.azure-api.net`.
109
-
`APIM_TIER` | Developer | No | The [APIM tier](https://azure.microsoft.com/en-us/pricing/details/api-management) to use. Can be either `Developer` or `StandardV2`. `StandardV2` costs more but will deploy faster.
112
+
`APIM_TIER` | Developer | No | The [APIM tier](https://azure.microsoft.com/en-us/pricing/details/api-management) to use. Can be either `Developer` or
113
+
`StandardV2`. `StandardV2` costs more but will deploy faster.
114
+
`AI_SEARCH_TIER` | standard | No | The [AI Search tier](https://learn.microsoft.com/en-us/azure/search/search-sku-tier) to use. Can be either `free`, `basic`, `standard`, `standard2`, `standard3`, `storage_optimized_l1`, or `storage_optimized_l2`
110
115
`RESOURCE_BASE_NAME` | | No | Suffix to apply to all azure resource names. If not provided a unique suffix will be generated.
111
116
`AISEARCH_ENDPOINT_SUFFIX` | `search.windows.net` | No | Suffix to apply to AI search endpoint. Should be overridden for deployments in other Azure clouds.
112
117
`AISEARCH_AUDIENCE` | `https://search.azure.com/` | No | AAD audience for AI Search. Should be overridden for deployments in other Azure clouds.
113
118
119
+
114
120
### 5. Deploy solution accelerator to the resource group
0 commit comments