Scaling Kinesis Shards Yields Errors #37

etspaceman · 2017-11-20T23:21:42Z

See this thread on the AWS Developer forums:

https://forums.aws.amazon.com/thread.jspa?threadID=245127

Basically to recreate this, I bump up the shard count for Kinesis. With a single node, I see the above error occur, and the consumer dies. We bounce our application and scale to 2 nodes, but one of the nodes tries to connect to the initial shard (which is now in a "CLOSED" state), and throws the same error. We've also tried removing the checkpointing and then restarting our applications, with the same issue occurring. The only way we're able to consume all shards is to have N+1 instances of a consumer against the stream shards.

etspaceman · 2017-11-30T23:33:14Z

Any thoughts on this one @markglh? It effectively had us drop our consumer from our application. :/

markglh · 2017-12-21T21:10:44Z

I need to recreate this in an int test @etspaceman - looks like a KCL bug but would be interesting to see if we can get around it.

etspaceman · 2019-04-01T20:20:34Z

I talked to @markglh about this today. I think we've found the source of the issue. See:

awslabs/amazon-kinesis-client#211

According to this, the exception can be thrown if we are checkpointing using sequence numbers entirely. This exception is also specific to shard-end events, which is described by the TERMINATE shutdown reason (the input to `shutdownRequested will have this value).

https://github.com/WW-Digital/reactive-kinesis/blob/master/src/main/scala/com/weightwatchers/reactive/kinesis/consumer/ConsumerProcessingManager.scala#L143

If we change the above to include a call a checkpointer.checkpoint() in the instances of shard-end events, we should be able to properly avoid the error.

agaro1121 · 2019-04-01T20:36:36Z

That makes sense. Scaling kinesis involves shutting down a shard to replace with 2 others. If we don't checkpoint at the end of the old shard, then kinesis flags it as being in a inconsistent state.

Fixes WW-Digital#37,WW-Digital#63

Fixes #37,#63

etspaceman mentioned this issue May 9, 2018

Problem consuming from stream when moving from 1 shard to 2 shards. #63

Closed

easel added a commit to easel/reactive-kinesis that referenced this issue Oct 15, 2019

Force checkpoint before shutting down terminated shards

3dcc4d8

Fixes WW-Digital#37,WW-Digital#63

easel mentioned this issue Oct 15, 2019

Force checkpoint before shutting down terminated shards #79

Merged

easel added a commit to easel/reactive-kinesis that referenced this issue Oct 15, 2019

Force checkpoint before shutting down terminated shards

308d9bd

Fixes WW-Digital#37,WW-Digital#63

markglh closed this as completed in #79 Feb 12, 2020

markglh pushed a commit that referenced this issue Feb 12, 2020

Force checkpoint before shutting down terminated shards (#79)

8f6a0d3

Fixes #37,#63

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scaling Kinesis Shards Yields Errors #37

Scaling Kinesis Shards Yields Errors #37

etspaceman commented Nov 20, 2017

etspaceman commented Nov 30, 2017

markglh commented Dec 21, 2017

etspaceman commented Apr 1, 2019

agaro1121 commented Apr 1, 2019

Scaling Kinesis Shards Yields Errors #37

Scaling Kinesis Shards Yields Errors #37

Comments

etspaceman commented Nov 20, 2017

etspaceman commented Nov 30, 2017

markglh commented Dec 21, 2017

etspaceman commented Apr 1, 2019

agaro1121 commented Apr 1, 2019