Skip to content

Commit 354469e

Browse files
committed
Reduces embedding and chat model capacity
Decreases the embedding and chat model capacity to 100, potentially optimizing resource allocation and cost efficiency.
1 parent d388f44 commit 354469e

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

infra/app/ai/cognitive-services.bicep

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ param embeddingModelFormat string = 'OpenAI'
1717
param embeddingModelSkuName string = 'Standard'
1818

1919
@description('The embedding model capacity')
20-
param embeddingModelCapacity int = 150
20+
param embeddingModelCapacity int = 100
2121

2222
@description('The chat model name to deploy')
2323
param chatModelName string = 'gpt-4o'
@@ -29,7 +29,7 @@ param chatModelFormat string = 'OpenAI'
2929
param chatModelSkuName string = 'Standard'
3030

3131
@description('The chat model capacity')
32-
param chatModelCapacity int = 150
32+
param chatModelCapacity int = 100
3333

3434
resource aiServices 'Microsoft.CognitiveServices/accounts@2025-04-01-preview' = {
3535
name: aiServicesName

0 commit comments

Comments
 (0)