Add section about productionizing #577

pamelafox · 2023-08-31T20:52:15Z

Purpose

This README adds a section with tips for putting this template into production, based on customer experiences.

Does this introduce a breaking change?

[ ] Yes
[X] No

Pull Request Type

What kind of change does this Pull Request introduce?

[ ] Bugfix
[ ] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[X] Documentation content changes
[ ] Other... Please describe:

…ai-demo

charris-msft

I LOVE THIS!!!

charris-msft · 2023-09-01T00:06:13Z

README.md

+* **OpenAI Capacity**: The default TPM (tokens per minute) is set to 30K. That is equivalent
+  to approximately 30 conversations per minute (assuming 1K per user message/response).
+  You can increase the capacity by changing the `chatGptDeploymentCapacity` and `embeddingDeploymentCapacity` parameters in `infra/main.bicep` to your account's maximum capacity.
+  You can also view the Quotas tab from Azure OpenAI studio to understand how much capacity you have.


"from" > "in"

charris-msft · 2023-09-01T00:07:09Z

README.md

+  You can also view the Quotas tab from Azure OpenAI studio to understand how much capacity you have.
+* **Azure Storage**: The default storage account uses the `Standard_LRS` SKU.
+  We recommend using `Standard_ZRS` for production deployments,
+  which you can specify using the `sku` property in `infra/main.bicep`.


...sku property under module storage in ...

charris-msft · 2023-09-01T00:09:24Z

README.md

+  You can increase the capacity by changing the `chatGptDeploymentCapacity` and `embeddingDeploymentCapacity` parameters in `infra/main.bicep` to your account's maximum capacity.
+  You can also view the Quotas tab from Azure OpenAI studio to understand how much capacity you have.
+* **Azure Storage**: The default storage account uses the `Standard_LRS` SKU.
+  We recommend using `Standard_ZRS` for production deployments,


To improve your resiliency we recommend...

README.md

charris-msft · 2023-09-01T00:10:51Z

README.md

+  which you can specify using the `sku` property in `infra/main.bicep`.
+* **Azure Cognitive Search**: The default search service uses the `Standard` SKU
+  with the free semantic search option. You should either change `semanticSearch` to "standard"
+  or disable semantic search entirely in the approaches files. If you see errors about search service capacity being exceeded, you may find it helpful to increase the number of replicas by changing `replicaCount` in `infra/core/search/search-services.bicep`.


in the /app/backend/approaches files.

I think it's in infra/core/search/search-services.bicep, or manually scaling it from the Azure Portal.

chuwik

Left a couple of nits, otherwise looking good!

chuwik · 2023-09-01T00:48:35Z

README.md

+* **OpenAI Capacity**: The default TPM (tokens per minute) is set to 30K. That is equivalent
+  to approximately 30 conversations per minute (assuming 1K per user message/response).
+  You can increase the capacity by changing the `chatGptDeploymentCapacity` and `embeddingDeploymentCapacity` parameters in `infra/main.bicep` to your account's maximum capacity.
+  You can also view the Quotas tab from Azure OpenAI studio to understand how much capacity you have.


Would be nice to use a link for Azure OpenAI studio

chuwik · 2023-09-01T00:50:30Z

README.md

+  which you can specify using the `sku` property in `infra/main.bicep`.
+* **Azure Cognitive Search**: The default search service uses the `Standard` SKU
+  with the free semantic search option. You should either change `semanticSearch` to "standard"
+  or disable semantic search entirely in the approaches files. If you see errors about search service capacity being exceeded, you may find it helpful to increase the number of replicas by changing `replicaCount` in `infra/core/search/search-services.bicep`.


I think it's in infra/core/search/search-services.bicep, or manually scaling it from the Azure Portal.

* Remove defaults for getenv * Remove print * missing output * readme section * Update README with productionizing tips * Add networking section * Review feedback from comments

pamelafox added 7 commits August 22, 2023 00:00

Remove defaults for getenv

5f4ba89

Remove print

d14d3f5

missing output

ceba0c8

Merge branch 'main' of https://github.com/pamelafox/azure-search-open…

59be26f

…ai-demo

Merge branch 'main' of https://github.com/pamelafox/azure-search-open…

db77096

…ai-demo

readme section

ace3a04

Update README with productionizing tips

79557a1

pamelafox requested review from charris-msft and chuwik August 31, 2023 20:52

Add networking section

824aa18

charris-msft requested changes Sep 1, 2023

View reviewed changes

chuwik approved these changes Sep 1, 2023

View reviewed changes

Review feedback from comments

a91de66

pamelafox requested a review from charris-msft September 1, 2023 16:36

charris-msft approved these changes Sep 1, 2023

View reviewed changes

pamelafox merged commit 9cdbc1c into Azure-Samples:main Sep 1, 2023
6 checks passed

ratkinsoncinz pushed a commit to cinzlab/azure-search-openai-demo that referenced this pull request Oct 6, 2024

Fix quart start command (Azure-Samples#577)

16c9354

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add section about productionizing #577

Add section about productionizing #577

pamelafox commented Aug 31, 2023

charris-msft left a comment

charris-msft Sep 1, 2023

charris-msft Sep 1, 2023

charris-msft Sep 1, 2023

charris-msft Sep 1, 2023

chuwik Sep 1, 2023

chuwik left a comment

chuwik Sep 1, 2023

chuwik Sep 1, 2023

Add section about productionizing #577

Add section about productionizing #577

Conversation

pamelafox commented Aug 31, 2023

Purpose

Does this introduce a breaking change?

Pull Request Type

charris-msft left a comment

Choose a reason for hiding this comment

charris-msft Sep 1, 2023

Choose a reason for hiding this comment

charris-msft Sep 1, 2023

Choose a reason for hiding this comment

charris-msft Sep 1, 2023

Choose a reason for hiding this comment

charris-msft Sep 1, 2023

Choose a reason for hiding this comment

chuwik Sep 1, 2023

Choose a reason for hiding this comment

chuwik left a comment

Choose a reason for hiding this comment

chuwik Sep 1, 2023

Choose a reason for hiding this comment

chuwik Sep 1, 2023

Choose a reason for hiding this comment