Add retry logic for OpenAI API requests #1

lemillermicrosoft · 2023-03-03T18:55:04Z

Context and Motivation

This PR adds a retry mechanism for OpenAI API requests that encounter throttling or service errors, using the retry-after header if available. This improves the reliability and performance of the OpenAI client, especially when the request volume is high or unpredictable.

Description

The main changes in this PR are:

Add a new class RetryOnceWithDelay that implements IRetryMechanism and retries once after a delay, using the retry-after header if available.
Add a new class PassThroughWithoutRetry that implements IRetryMechanism and does not retry, for testing purposes.
Add unit tests for the new classes and the retry logic.
Modify the OpenAIClientAbstract class to check for the retry-after header in the response and throw an AIException with the retry-after value as an additional property.
Modify the OpenAIClientRetryHandler class to use the retry-after value as the delay before retrying, if available. Otherwise, it falls back to the exponential backoff strategy.

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows SK Contribution Guidelines (https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md)
The code follows the .NET coding conventions (https://learn.microsoft.com/dotnet/csharp/fundamentals/coding-style/coding-conventions) verified with dotnet format
All unit tests pass, and I have added new tests where possible
I didn't break anyone 😄

dotnet/src/SemanticKernel/AI/AIException.cs

dotnet/src/SemanticKernel/AI/OpenAI/Clients/OpenAIClientAbstract.cs

dotnet/src/SemanticKernel/Configuration/KernelConfig.cs

dotnet/src/SemanticKernel/Reliability/RetryWithDelay.cs

dotnet/src/SemanticKernel/Reliability/NullRetryPolicy.cs

samples/dotnet/kernel-syntax-examples/Reliability/RetryThreeTimesWithBackoff.cs

dotnet/src/SemanticKernel/AI/AIException.cs

dotnet/src/SemanticKernel/Reliability/DefaultHttpRetryPolicy.cs

dotnet/src/SemanticKernel/AI/OpenAI/Clients/OpenAIClientAbstract.cs

samples/dotnet/kernel-syntax-examples/Reliability/RetryThreeTimesWithRetryAfterBackoff.cs

samples/dotnet/kernel-syntax-examples/Example12_Planning.cs

samples/dotnet/kernel-syntax-examples/Example08_RetryPolicy.cs

SergeyMenshykh · 2023-03-08T10:41:38Z

dotnet/src/SemanticKernel/AI/OpenAI/Clients/OpenAIClientAbstract.cs

-            HttpResponseMessage response = await this.HTTPClient.PostAsync(url, content);
+
+            HttpResponseMessage response =
+                await this._retryPolicy.ExecuteWithRetryAsync(async () => await this.HTTPClient.PostAsync(url, content, cancellationToken), this.Log,


nit: Retries through a DelegatingHandler would be more elegant solution for the problem because would decouple the code from the retry logic which would make this class a little bit simpler.

Yeah, agreed. I think I'll go ahead and take a stab at doing it that way instead for compare/contrast purposes.

I've taken a stab at this here: #2

The DelegatingHandler cannot be shared, so this also uses a Factory interface to aid with creation of DelegatingHandler instances.

… that implement it, allowing the caller to customize the retry logic for HTTP requests to AI services. The commit also modifies the OpenAI and AzureAI clients and services to accept and use the retry policy when making requests. Additionally, the commit adds a cancellation token parameter to the CompleteAsync method of the text completion clients and services, and passes it to the retry policy. The commit also updates the KernelConfig class, the kernel syntax examples, and the samples to use the new interface and classes. Finally, the commit adds unit tests and integration tests for the new functionality, and removes some unused or deprecated code.

lemillermicrosoft · 2023-03-10T20:24:25Z

Closing PR. Will be publishing #2 against the main repo.

Add Memory to the list and update diagrams with more recent information.

…soft#3415) ### Motivation and Context  ### Description  ### Contribution Checklist  - [ ] The code builds clean without any errors or warnings - [ ] The PR follows the [SK Contribution Guidelines](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md) and the [pre-submission formatting script](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md#development-scripts) raises no violations - [ ] All unit tests pass, and I have added new tests where possible - [ ] I didn't break anyone 😄

dluc suggested changes Mar 3, 2023

View reviewed changes

lemillermicrosoft commented Mar 4, 2023

View reviewed changes

dotnet/src/SemanticKernel/Reliability/RetryWithDelay.cs Outdated Show resolved Hide resolved

jansenbe reviewed Mar 4, 2023

View reviewed changes

dotnet/src/SemanticKernel/Reliability/RetryWithDelay.cs Outdated Show resolved Hide resolved

lemillermicrosoft commented Mar 6, 2023

View reviewed changes

dotnet/src/SemanticKernel/Reliability/NullRetryPolicy.cs Show resolved Hide resolved

lemillermicrosoft commented Mar 7, 2023

View reviewed changes

samples/dotnet/kernel-syntax-examples/Reliability/RetryThreeTimesWithBackoff.cs Show resolved Hide resolved

lemillermicrosoft force-pushed the u/lemiller/retryafter branch from e8f71ac to 99db183 Compare March 7, 2023 06:00

lemillermicrosoft requested a review from dluc March 7, 2023 06:05

lemillermicrosoft force-pushed the u/lemiller/retryafter branch from 4d7ee16 to ca50027 Compare March 7, 2023 20:53

lemillermicrosoft commented Mar 8, 2023

View reviewed changes

dotnet/src/SemanticKernel/AI/AIException.cs Outdated Show resolved Hide resolved

lemillermicrosoft commented Mar 8, 2023

View reviewed changes

dotnet/src/SemanticKernel/Reliability/DefaultHttpRetryPolicy.cs Outdated Show resolved Hide resolved

lemillermicrosoft commented Mar 8, 2023

View reviewed changes

dotnet/src/SemanticKernel/Reliability/DefaultHttpRetryPolicy.cs Show resolved Hide resolved

lemillermicrosoft force-pushed the u/lemiller/retryafter branch from c52796c to 807f7d4 Compare March 8, 2023 00:42