You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Scheduler::shutdown (or Worker for older revisions) can be executed before completing the Scheduler::initialize phase after calling Scheduler::run. This causes LeaseCoordinator::stop (where the LeaseTaker is canceled and shutdown) the to be called before LeaseCoordinator::start where the LeaseTaker is actually initialized. This eventually causes the Scheduler and LeaseCoordinator to be in a shutdown state while leaving the LeaseCoordinator'sleaseCoordinatorThreadPool & takerFuture to remain executing while taking leases from other Schedulers that are actually running.
Simply running the following test can reproduce the problem:
@pfifer with client v1.8.1 having below issue. I found this issue close to my issue.
2019-02-09 21:10:01.875 INFO 26971 --- [ Thread-29] c.a.s.k.clientlibrary.lib.worker.Worker : Worker shutdown requested.
2019-02-09 21:10:01.876 INFO 26971 --- [ Thread-29] c.a.s.k.leases.impl.LeaseCoordinator : Worker ip-1234. has successfully stopped lease-tracking threads
2019-02-09 21:10:01.877 INFO 26971 --- [dProcessor-0000] c.c.d.v.s.p.KinesisRecordProcessor : Checkpointing shard shardId-000000000000
2019-02-09 21:10:01.878 INFO 26971 --- [dProcessor-0000] k.c.l.w.KinesisClientLibLeaseCoordinator : Worker ip-1234. could not update checkpoint for shard shardId-000000000000 because it does not hold the lease
2019-02-09 21:10:01.878 INFO 26971 --- [dProcessor-0000] c.c.d.v.s.p.KinesisRecordProcessor : Caught shutdown exception, skipping checkpoint.
com.amazonaws.services.kinesis.clientlibrary.exceptions.ShutdownException: Can't update checkpoint - instance doesn't hold the lease for this shard
at com.amazonaws.services.kinesis.clientlibrary.lib.worker.KinesisClientLibLeaseCoordinator.setCheckpoint(KinesisClientLibLeaseCoordinator.java:174) ~[amazon-kinesis-client-1.8.1.jar!/:na]
Any clue, if this is my issue? I see sometimes checkpoint gets updated, sometimes throws above error and it delivers again those messages back to consumer. Is this fixed in 2.0.4? I have very major upgrade to do so looking to seeif 1.x has fix for above issue.
Scheduler::shutdown (or
Worker
for older revisions) can be executed before completing the Scheduler::initialize phase after calling Scheduler::run. This causes LeaseCoordinator::stop (where theLeaseTaker
is canceled and shutdown) the to be called before LeaseCoordinator::start where theLeaseTaker
is actually initialized. This eventually causes theScheduler
andLeaseCoordinator
to be in a shutdown state while leaving theLeaseCoordinator's
leaseCoordinatorThreadPool
&takerFuture
to remain executing while taking leases from otherSchedulers
that are actually running.Simply running the following test can reproduce the problem:
The text was updated successfully, but these errors were encountered: