Adding 4MB limit for Azure Storage Table batch insert #191

gled4er · 2018-06-14T09:50:13Z

Hello @cgillum,

I am opening this PR to give your early visibility on the first changes for #339.

Let me know if you spot something that it needs to be fixed immediately.

I plan to double check the code and add tests for the change next.

Thank you!

…t payload will go over the limit of 4 MB.

gled4er · 2018-06-14T17:18:55Z

Hello @cgillum,

After thinking a bit I added support for two more cases that we check for each case in the switch statement:

If the data payload itself is above 4 MB, we throw a Storage Exception
If adding the current payload will increase the batch size above 4 MB, first we insert the batch without the current payload and then start a new batch with the current payload

I think with adding these cases we have better coverage.

I am interested to hear your opinion on these suggestions. I like the simplicity of your initial design but I am worried that there can be edge cases we can potentially miss. Let me know if these are valid considerations.

Thank you!

cgillum · 2018-06-14T17:46:45Z

Thanks @gled4er I will review this.

I don't understand the case where we throw an exception if the payload is above 4 MB. With your fix, I would expect this would never happen because that's exactly what we're intending to prevent. I'll look at the code changes to see if I can understand better what you're suggesting.

cgillum · 2018-06-14T17:49:14Z

src/DurableTask.AzureStorage/DurableTask.AzureStorage.csproj

@@ -21,6 +21,10 @@
    <PackageReference Include="WindowsAzure.Storage" version="8.5.0" />
  </ItemGroup>

+  <ItemGroup>
+    <PackageReference Include="System.ValueTuple" Version="4.5.0" />


Is this change necessary? If not, I suggest we remove it. I'm not super-comfortable adding these newer semi-framework packages since I don't fully understand the implications.

Yes, you are right. I will remove it. The value provided is not worth extending the list of dependencies. Thank you!

cgillum · 2018-06-14T17:52:36Z

src/DurableTask.AzureStorage/Tracking/AzureTableTrackingStore.cs

+            int byteCount = Encoding.Unicode.GetByteCount(data);
+            if(byteCount > 4 * 1024 * 1024)
+            {
+                throw new StorageException($"Data with payload of {byteCount} bytes can not be added to a storage table.");


This will cause an infinite failure loop, and will result in the exact same problem that this PR is intended to fix. We need to ensure that the payload never exceeds 4 MB. Ling previously did the Large Message Support work to ensure this is not possible for any individual row. The purpose of this PR is simply to measure the aggregate of multiple rows.

Thank you for clarifying that for me. I had the same feeling. I will remove this logic. Thank you!

cgillum · 2018-06-14T17:54:30Z

src/DurableTask.AzureStorage/Tracking/AzureTableTrackingStore.cs

@@ -502,6 +520,43 @@ public override Task StartAsync()
                orchestrationInstanceUpdateStopwatch.ElapsedMilliseconds);
        }

+        private async Task<(int, string)> GetDataByteCountAndETag(string data, int estimatedBytes, string instanceId, string executionId, TableBatchOperation historyEventBatch, StringBuilder newEventListBuffer, IList<HistoryEvent> newEvents, string eTagValue)


FWIW, I think a ref parameter is a more appropriate way to update estimatedBytes from within this method.

Yes, this was my first idea as well but async methods can't have out or ref parameters.

cgillum · 2018-06-14T17:55:18Z

src/DurableTask.AzureStorage/Tracking/AzureTableTrackingStore.cs

+            {
+                if(byteCount + estimatedBytes > 4 * 1024 * 1024)
+                {
+                    updatedETagValue = await this.UploadHistoryBatch(instanceId, executionId, historyEventBatch, newEventListBuffer, newEvents.Count, eTagValue);


Why would we upload the history batch here instead of doing it outside the switch statement in the calling code?

You are right Chris. Now checking the code with a fresh look, I think these changes complicates and does not provide value. I will revert it. Thank you for providing very valuable feedback on that!

gled4er · 2018-06-14T22:30:35Z

Hello @cgillum,

Thank you for the great feedback.

I will fix the issues.

Thank you!

cgillum · 2018-06-15T01:14:45Z

Thanks. Generally speaking, I think the first iteration you had was sufficient, but let me know if you are confident about other gaps. I want to include this fix in the 1.5 release, so I will wait for this to get merged before doing the release.

gled4er · 2018-06-15T02:30:44Z

Hello @cgillum,

I reverted the changes from the last commit.

Thank you for taking time and pointing the issues it introduced.

Thank you!

Kanio

gled4er · 2018-06-15T03:25:14Z

Hello @cgillum,

I wanted to ask you for clarification. When we check the value of estimatedBytes we assume we work with the actual data but in case of this data is more than 60 KB we actually send it to a blob and add a new property with the name of the blob. The original property is null (for example Input, Result and so on). So in this case shouldn't we use the byte size of the blob name (GUID) coming from the ResultBlobName property for example instead of the original data size for deciding to push the batch to the table. If this is true this will give us nice optimization and we will be able to reduce the calls to the storage account.

If this idea is useful, we can change a bit the compress method to return a the bytes for the blob name (always GUID) or have a boolean marker if the compression happened such as:

async Task<int> CompressLargeMessageAsync(DynamicTableEntity entity)
        {
            int byteSize = 0;
            string propertyKey = this.GetLargeTableEntity(entity);
            if (propertyKey != null)
            {
                string blobName = Guid.NewGuid().ToString();
                byteSize = Encoding.Unicode.GetByteCount(blobName);

                // e.g.InputBlobName, OutputBlobName, ResultBlobName
                string blobNameKey = $"{propertyKey}{BlobNamePropertySuffix}";
                byte[] messageBytes = this.GetPropertyMessageAsBytes(entity);
                await this.messageManager.CompressAndUploadAsBytesAsync(messageBytes, blobName);
                entity.Properties.Add(blobNameKey, new EntityProperty(blobName));
                this.SetPropertyMessageToEmptyString(entity);
            }
            return byteSize;
        }

And then in UpdateStateAsync we have:

for (int i = 0; i < newEvents.Count; i++)
{
      HistoryEvent historyEvent = newEvents[i];
      DynamicTableEntity entity = this.tableEntityConverter.ConvertToTableEntity(historyEvent);

      int compressedLargeMessageNameByteSize = await this.CompressLargeMessageAsync(entity);

      // Monitor for orchestration instance events 
      switch (historyEvent.EventType)
       {
            case EventType.ExecutionStarted:
                 estimatedBytes += compressedLargeMessageNameByteSize > 0 ? compressedLargeMessageNameByteSize : Encoding.Unicode.GetByteCount(executionStartedEvent.Input);
            break;
            case EventType.ExecutionCompleted:
                   estimatedBytes += compressedLargeMessageNameByteSize > 0 ? compressedLargeMessageNameByteSize : Encoding.Unicode.GetByteCount(executionCompletedEvent.Result);
            break;
            case EventType.ExecutionTerminated:
                    estimatedBytes += compressedLargeMessageNameByteSize > 0 ? compressedLargeMessageNameByteSize : Encoding.Unicode.GetByteCount(executionTerminatedEvent.Input);
            break;
            case EventType.ContinueAsNew:
                     estimatedBytes += compressedLargeMessageNameByteSize > 0 ? compressedLargeMessageNameByteSize : Encoding.Unicode.GetByteCount(executionCompletedEvent.Result);
            break;
         }

       // Table storage only supports inserts of up to 100 entities at a time or 4 MB at a time.
       if (historyEventBatch.Count == 99 || estimatedBytes > 3 * 1024 * 1024 /* 3 MB */)
       {

       }
}

So for example if I have an input that is 5 MB I will push the batch immediately to storage with the existing logic. But #we can wait since the actual entity size has only 72 bytes replacement of the original 5 MB payload due to the blob compression.

I am interested to hear your opinion on this suggestion.

Thank you!

cgillum · 2018-06-15T05:31:28Z

It's true that in some cases these properties might be null. That could be the case for large message payloads like you mentioned, or it could be true because a function was given null as an input or output. The code we write needs to work with way (i.e. it shouldn't throw any exceptions).

I didn't understand the part about the optimization. How is it we can reduce calls to storage?

I'm not sure how valuable it is to return the number of bytes for the blob name. Won't it always be a small fixed size? If the name is 72 bytes and the maximum batch size is 100, isn't that at most ~7 KB? Do you think it's important for us to account for this when we've already given ourselves a 1 MB buffer (we upload early only if the total size reaches 3 MB)?

gled4er · 2018-06-15T05:46:39Z

Hello @cgillum,

I am sorry for not explaining in more understandable way. The use case I was thinking is as follows:

We have bigger than 60K input (let's say 5MB)
This input is offloaded to a blob and we add a reference information to the entity
However, we are adding the original payload size 5MB to the estimatedBytes in order to decide when to process the batch. In my opinion in this case we should not use 5MB anymore since it is not part of the entity but it is persisted in a blob, but have something as the following code:

switch (historyEvent.EventType)
                {
                    case EventType.ExecutionStarted:
                            estimatedBytes += Encoding.Unicode.GetByteCount(executionStartedEvent.Input) > MaxStorageQueuePayloadSizeInBytes ? 72 : Encoding.Unicode.GetByteCount(executionStartedEvent.Input);
                        break;
                    case EventType.ExecutionCompleted:
                             estimatedBytes += Encoding.Unicode.GetByteCount(executionCompleted.Result) > MaxStorageQueuePayloadSizeInBytes ? 72 : Encoding.Unicode.GetByteCount(executionCompleted.Result);
                        break;
                    case EventType.ExecutionTerminated:
                          estimatedBytes += Encoding.Unicode.GetByteCount(executionTerminatedEvent.Input) > MaxStorageQueuePayloadSizeInBytes ? 72 : Encoding.Unicode.GetByteCount(executionTerminatedEvent.Input);
                        break;
                    case EventType.ContinueAsNew:
                          estimatedBytes += Encoding.Unicode.GetByteCount(executionCompletedEvent.Result) > MaxStorageQueuePayloadSizeInBytes ? 72 : Encoding.Unicode.GetByteCount(executionCompletedEvent.Result);
                        break;
                }

You are right that we don't need to change the CompressLargeMessageAsync method and we can have the 72B value defined as const.

Let me know if you find value in this approach or we can leave the code in the current state.

Thank you!

cgillum · 2018-06-15T05:53:01Z

Ah, I understand now. You're saying we will overcount in the case of large messages. Yes, in that case I think it makes sense to check for this condition and return an appropriate byte size like you suggested so that we don't upload to the history table unnecessarily.

gled4er · 2018-06-15T06:21:37Z

Hello @cgillum,

Thank you for the quick reply!

I will add the changes.

In terms of integration tests do you think we will need additional ones to be added?

Lin added great tests for both large text / binary messages so I think we don't need new tests for this PR. Moreover, I think it will be difficult to detect the batch size from the integration tests.

Thank you!

gled4er · 2018-06-15T09:14:40Z

Hello @cgillum,

I add the optimization for the bytes calculation.

When you have time, may I ask you to review once more?

It will be great if you can enable the CI for this pull request so I can validate that all tests are passing on the server as well.

Thank you!

cgillum · 2018-06-15T16:29:18Z

I'll take a look. Unfortunately there is no CI on this branch, so we have to validate manually.

cgillum

I really like the simplicity of the new iteration - thanks!

I have one piece of performance-related feedback that I would like you to consider. However, for now I'm going to accept this change so we can move forward with the release - so maybe we can have a followup PR to make it more efficient.

cgillum · 2018-06-15T17:28:55Z

src/DurableTask.AzureStorage/Tracking/AzureTableTrackingStore.cs

+        void UpdateEstimatedBytes(ref int estimatedBytes, string payload)
+        {
+            int payloadBytes = Encoding.Unicode.GetByteCount(payload);
+            estimatedBytes += payloadBytes > MaxStorageQueuePayloadSizeInBytes ? GuidByteSize : payloadBytes;


I think it would be more efficient to examine the entity to know if there is a large message payload first before counting bytes. These bytes were already counted earlier when deciding if we need to upload to blob storage, so we're basically double-counting.

…191)" This reverts commit b7bdbdf.

cgillum · 2018-06-15T19:40:40Z

Unfortunately I found another issue, which is that this PR does not count activity function return values (i.e. the TaskCompleted event type. I've submitted a separate PR to make these corrections.

* DurableTask.AzureStorage API to enumerate instances (#187) * DurableTask.AzureStorage ETW trace improvements (#192) * Adding 4MB limit check for Azure Storage Table batch insert (#191) * DurableTask.AzureStorage: Alternate fix for the 4 MB max entity size which covers more scenarios. (#194) * Updated Newtonsoft.Json to v11.0.2, WindowsAzure.Storage to v8.6.0. (#193) * Fixed issues with the ETW event source and added test reliability improvements.

gled4er added 2 commits June 14, 2018 18:28

Adding first logic for 4MB limit.

8b28bae

Adding support for data payload above 4 MB check if adding the curren…

95494e6

…t payload will go over the limit of 4 MB.

cgillum requested changes Jun 14, 2018

View reviewed changes

Reverting previous commit changes to simplify the logic.

220c740

Optimizaing calculation of estimated bytes.

853b3a9

cgillum approved these changes Jun 15, 2018

View reviewed changes

cgillum merged commit b7bdbdf into Azure:azure-storage Jun 15, 2018

cgillum added a commit that referenced this pull request Jun 15, 2018

Revert "Adding 4MB limit check for Azure Storage Table batch insert (#…

c619247

…191)" This reverts commit b7bdbdf.

cgillum mentioned this pull request Jun 15, 2018

DurableTask.AzureStorage: Alternate fix for the 4 MB max entity size which covers more scenarios. #194

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding 4MB limit for Azure Storage Table batch insert #191

Adding 4MB limit for Azure Storage Table batch insert #191

gled4er commented Jun 14, 2018

gled4er commented Jun 14, 2018

cgillum commented Jun 14, 2018

cgillum Jun 14, 2018

gled4er Jun 15, 2018

cgillum Jun 14, 2018

gled4er Jun 15, 2018

cgillum Jun 14, 2018

gled4er Jun 15, 2018

cgillum Jun 14, 2018

gled4er Jun 15, 2018

gled4er commented Jun 14, 2018

cgillum commented Jun 15, 2018

gled4er commented Jun 15, 2018

gled4er commented Jun 15, 2018 •

edited

Loading

cgillum commented Jun 15, 2018

gled4er commented Jun 15, 2018

cgillum commented Jun 15, 2018

gled4er commented Jun 15, 2018

gled4er commented Jun 15, 2018

cgillum commented Jun 15, 2018

cgillum left a comment

cgillum Jun 15, 2018

cgillum commented Jun 15, 2018

Adding 4MB limit for Azure Storage Table batch insert #191

Adding 4MB limit for Azure Storage Table batch insert #191

Conversation

gled4er commented Jun 14, 2018

gled4er commented Jun 14, 2018

cgillum commented Jun 14, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gled4er commented Jun 14, 2018

cgillum commented Jun 15, 2018

gled4er commented Jun 15, 2018

gled4er commented Jun 15, 2018 • edited Loading

cgillum commented Jun 15, 2018

gled4er commented Jun 15, 2018

cgillum commented Jun 15, 2018

gled4er commented Jun 15, 2018

gled4er commented Jun 15, 2018

cgillum commented Jun 15, 2018

cgillum left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cgillum commented Jun 15, 2018

gled4er commented Jun 15, 2018 •

edited

Loading