-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support OpenAI Dynamic Quota (DynamicThrottling) in azure-native.cognitiveservices.Deployment #3564
Comments
Hi @onordberg, the Azure spec defines |
Ah! Thanks for spotting that, @thomas11. I think it is defined both as properties of accounts and deployments. I will test what happens if we define it at the account level right away. Maybe it defaults all deployments accordingly. That would strictly speaking suffice for us. |
I get the following error which leads me to believe that this setting needs to be configured at the
|
That's unfortunate. The There are some hints on the web that HTTP PATCH needs to be used to update an existing deployment. If that's the case, this provider cannot support it out of the box but we could add it with a manual addition. I filed an upstream issue. |
Hello!
Issue details
Dynamic quota is an Azure OpenAI feature that enables a standard (pay-as-you-go) deployment to opportunistically take advantage of more quota when extra capacity is available. In the GUI it is default set to
true
as there is little downside to enabling it. More about the feature: https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/dynamic-quotaBased on a similar feature request in the Terraform project (hashicorp/terraform-provider-azurerm#23988) and the Azure Rest API implementation (https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/resource-manager/Microsoft.CognitiveServices/preview/2024-06-01-preview/cognitiveservices.json) it seems like this can be set with the
dynamicThrottlingEnabled
key.Example of a TypeScript constructor with this configuration exposed:
Affected area/feature
azure-native.cognitiveservices.Deployment
The text was updated successfully, but these errors were encountered: