DynamoDB Streams lag monitoring #5

ivanarkhipov · 2016-10-14T08:55:42Z

Hello!
We're using DynamoDB Streams + Kinesis Client Library (KCL).
How can we measure latency between event was created in a stream and it was processed on KCL side?

As I know, KCL's MillisBehindLatest metric is specific to Kinesis Streams.
approximateCreationDateTime record attribute has a minute-level approximation, which is not acceptable for monitoring in sub-second latency systems.
Could you please help with some useful metrics for monitoringDynamoDB Streams latency?

Thank you!

Ivan

The text was updated successfully, but these errors were encountered:

pfifer · 2016-10-26T17:49:52Z

This feature is currently on DynamoDB's road map, but they don't currently have an ETA.

amcp · 2017-04-24T00:58:58Z

Put the System.timeInMillis() in an item attribute on your own when you put and update items. As long as your stream view type is NEW_IMAGES or OLD_AND_NEW_IMAGES and your item updates contain this timestamp, you can get a better approximation.

joelittlejohn · 2017-04-24T09:06:54Z

@amcp I'm afraid adding an item attribute does not solve this issue. I think this one should be reopened.

The requirement here is for lag a metric. This means the time (in millis) between the current item, and the latest item that was added to the stream. From the docs for MillisBehindLatest:

The number of milliseconds the GetRecords response is from the tip of the stream, indicating how far behind current time the consumer is. A value of zero indicates record processing is caught up, and there are no new records to process at this moment.

If no new items are added, the client is not lagging (even if the time attribute on the item is old). This is very different to checking a time attribute on the item.

amcp · 2017-04-24T15:05:57Z

Seems to me you are interested in the age of stream records relative to the tip of each shard. Each processor works on a shard forward in time. Each time you do a GetRecords call on your usual shard iterator, you could also get a shard iterator for that shard of type LATEST and compute the lag you seek in that manner. Note that shards can roll over for size and age or split for throughput reasons so you might have to do a few calls to get to the latest child shard. By sampling the tip of each shard lineage, you could keep a pretty good estimate of how much you lag.

amcp · 2017-04-24T15:12:51Z

Here is some good related reading (also includes links to prior articles).
https://noise.getoto.net/2016/08/19/monitor-your-application-for-processing-dynamodb-streams/

joelittlejohn · 2017-04-25T12:35:15Z

Another measure of lag is the number of records between the current set of records and the tip. It would be good if this library implemented some help with either kind of lag monitoring.

amcp · 2017-04-25T16:45:44Z

Together with the lag estimates above you could also use 1 minute CloudWatch ConsumedCapacity metrics on the table to estimate the number of writes accepted per second, allowing you to backtrack the number of records between your Stream Worker and the heads of offspring shard lineages.

amcp · 2017-04-25T16:53:55Z

Another thing you could do is feed the DynamoDB Stream into a Kinesis stream with a Lambda, and use the MillisBehindLatest metric from Kinesis records. Seems a bit over the top though.

Mentis · 2017-12-21T22:57:30Z

Any updates on that? Is there any other way to identify how long particular event sits in the stream?

aggarwal · 2020-08-14T06:12:07Z

The value of ApproximateCreationDateTime is precise to the second as of January 2019.

We're currently working on emitting a MillisBehindLatest metric from the adapter package that will emit the difference between ApproximateCreationDateTime from the GetRecords result and System.currentTimeMillis() on the client. Emitting this metric will allow a large majority of customers to get some basic monitoring out of the box. This will allow you to track how far behind you are in processing your stream. We expect to release this change in the next few weeks.

The DynamoDB Streams GetRecords API does not currently expose any data about the amount or age of records that were written after the records returned in a batch. Making this data available is a large project that requires architectural changes in the service. We'll consider this in our 6-12 month roadmap.

pietropra · 2020-12-07T13:45:56Z

@aggarwal this is great to know I was just looking at this information.
Perhaps is worth making it explicit in the documentation?

https://docs.aws.amazon.com/amazondynamodb/latest/APIReference/API_streams_StreamRecord.html

aggarwal · 2021-05-11T02:47:46Z

Version 1.5.3 now includes the implementation of MillisBehindLatest as described above. Due to limitations of how the metric object is scoped in KCL, this metric is emitted at the stream-shard level, and not at the application-level.

https://github.com/awslabs/dynamodb-streams-kinesis-adapter/releases/tag/1.5.3

jeet23 · 2022-08-18T11:17:09Z

Hi @aggarwal,
Is the MillisBehindLatest metric available for DynamoDB Streams Kinesis adapter as well?

Or is it only for the Kinesis streams?

pfifer mentioned this issue Oct 26, 2016

DynamoDB Streams lag monitoring awslabs/amazon-kinesis-client#113

Closed

amcp added the enhancement label Apr 22, 2017

amcp self-assigned this Apr 22, 2017

amcp closed this as completed Apr 24, 2017

amcp reopened this Apr 24, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DynamoDB Streams lag monitoring #5

DynamoDB Streams lag monitoring #5

ivanarkhipov commented Oct 14, 2016

pfifer commented Oct 26, 2016

amcp commented Apr 24, 2017

joelittlejohn commented Apr 24, 2017

amcp commented Apr 24, 2017 •

edited

Loading

amcp commented Apr 24, 2017

joelittlejohn commented Apr 25, 2017

amcp commented Apr 25, 2017

amcp commented Apr 25, 2017 •

edited

Loading

Mentis commented Dec 21, 2017

aggarwal commented Aug 14, 2020

pietropra commented Dec 7, 2020 •

edited

Loading

aggarwal commented May 11, 2021

jeet23 commented Aug 18, 2022

DynamoDB Streams lag monitoring #5

DynamoDB Streams lag monitoring #5

Comments

ivanarkhipov commented Oct 14, 2016

pfifer commented Oct 26, 2016

amcp commented Apr 24, 2017

joelittlejohn commented Apr 24, 2017

amcp commented Apr 24, 2017 • edited Loading

amcp commented Apr 24, 2017

joelittlejohn commented Apr 25, 2017

amcp commented Apr 25, 2017

amcp commented Apr 25, 2017 • edited Loading

Mentis commented Dec 21, 2017

aggarwal commented Aug 14, 2020

pietropra commented Dec 7, 2020 • edited Loading

aggarwal commented May 11, 2021

jeet23 commented Aug 18, 2022

amcp commented Apr 24, 2017 •

edited

Loading

amcp commented Apr 25, 2017 •

edited

Loading

pietropra commented Dec 7, 2020 •

edited

Loading