-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rkb->rkb_fetching > 0 error #84
Comments
Is this reproducible? 2014-02-21 10:48 GMT+07:00 winbatch notifications@github.com:
|
only happened once, we'll see if I can reproduce it. What does On Thu, Feb 20, 2014 at 11:08 PM, Magnus Edenhill
|
Each broker thread must only have one outstanding Fetch request (containing How many topics and partitions were you consuming from? 2014-02-21 11:13 GMT+07:00 winbatch notifications@github.com:
|
I believe it was a 10 partition topic. Note that I am starting 10 'loops' BTW, if you think it's more likely to happen with more partitions - I happen to have a topic with 500 partitions if you want me to try that. On Thu, Feb 20, 2014 at 11:22 PM, Magnus Edenhill
|
500! now we're talking, yes, please try it on the fortune500 topic with the debug enabled (there will be a lot of output...) By start and stop I mean close (in time) to the rd_kafka_consume_start|stop calls, but it seems this wasnt the case here. |
certainly not near the stop, since I don't call stop - this is to run I'll see if I can reproduce it and get some debug logs. On Thu, Feb 20, 2014 at 11:31 PM, Magnus Edenhill
|
You know, it occurred to me. If you're taking the bold step of aborting the program with the assert, couldn't you print out your debug information (that 'rk_dump' function) right before the abort? I jknow it wouldn't have the entire history, but might give you what you need? |
Thats a very good idea. |
(please don't tell me you're going to post the above in a loop once for each partition? ;) ) |
This assert version calls rd_kafka_dump() (if an rk is available) thus providing some more information before abort():ing. This is winbatch's idea from issue #84
Did you have a chance to repro this on your 500 topic cluster? |
I also find this problem and it is reproducible. Kafka server has 4 topics and the topics from which our program read messages has 3 partitions. Here is the stack information: |
Thanks! The crash was caused by retrying timed out Fetch requests, reproducible by bringing down the broker and starting it again. Now fixed in master. |
Not sure what this means, but it doesn't look good ;)
Got this while doing a simple consumer.
rdkafka_broker.c:2800: rd_kafka_broker_fetch_reply: Assertion `rkb->rkb_fetching > 0' failed.
Aborted
The text was updated successfully, but these errors were encountered: