-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rdkafka_partition.c:165:rd_kafka_toppar_remove: assert: rktp->rktp_removed==0 #558
Comments
From the backtrace it seems that one of your topics were suddenly missing from the metadata returned from the broker, or that the partition count changed for a topic. Does that sound familiar? librdkafka still shouldnt fail because of this and I'll try to reproduce it. |
I have the exact same issue, the coredump stacks is the same. |
Is this still happening on latest master? |
I downloaded the source on 9 Apr. Alan From: Magnus Edenhill <notifications@github.commailto:notifications@github.com> Is this still happening on latest master? You are receiving this because you commented. |
Yes please, try latest master and if problem persists outline exact steps to reproduce. Thanks |
This seems to be caused by connecting to two clusters simultaneously where the topic state differs. |
This error is still happening with the latest version: My setup: bin/kafka-topics.sh --zookeeper server1,server2,server3,server4,server5,server6 --describe --topic myTopic My application is producing on partition 1 and consuming from partition 0 and 2. Scenario:
Application log: INFO - DeliveryReportCallback: Ack of message delivery for (124 bytes): Success GDB back trace: #0 0x0000003c24432625 in raise () from /lib64/libc.so.6 Note that I am using the KafkaConsumer, not the legacy Consumer. I also do not have a custom rebalance callback. Thanks. |
It looks like this happens when different brokers report different partition counts for the same topic. |
Just saw that this bug was part of 0.9.2 milestone. Do you have any date in mind for the release? Thanks. |
Ive reproduced it and will fix it soon. |
This is fixed on the partition_changes branch |
This issue is now fixed on master, please try to verify the fix in your environment. |
We are using librdkafka-0.9.0.99, and it crash when keeping on consuming message for hours.
The logs message:
Consume failed: Local: Operation in progress
Consume failed: Local: Operation in progress
Consume failed: Local: Operation in progress
Consume failed: Local: Operation in progress
Consume failed: Local: Operation in progress
*** rdkafka_partition.c:165:rd_kafka_toppar_remove: assert: rktp->rktp_removed==0 ***
Aborted (core dumped)
The gdb info is like this:
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
warning: no loadable sections found in added symbol-file system-supplied DSO at 0x7fff40195000
Core was generated by `./etld etl.properties'.
Program terminated with signal 6, Aborted.
#0 0x00007ff1e34630d5 in raise () from /lib/x86_64-linux-gnu/libc.so.6
(gdb) bt
#0 0x00007ff1e34630d5 in raise () from /lib/x86_64-linux-gnu/libc.so.6
#1 0x00007ff1e346683b in abort () from /lib/x86_64-linux-gnu/libc.so.6
#2 0x00007ff1e28402aa in rd_kafka_crash (file=, line=, function=, rk=0x0,
#3 0x00007ff1e2870ad5 in rd_kafka_toppar_remove (rktp=0x7ff1bc003020) at rdkafka_partition.c:165
#4 0x00007ff1e2870af2 in rd_kafka_toppar_destroy_final (rktp=0x7ff1bc003020) at rdkafka_partition.c:183
#5 0x00007ff1e28514d8 in rd_kafka_topic_leader_update (rkb=0x7ff1d8002890, partition=1, rkt=0x7ff1bc0011d0,
#6 rd_kafka_topic_metadata_update (rkb=0x7ff1b8003f40, mdt=0x7ff1b8001547) at rdkafka_topic.c:735
#7 0x00007ff1e2860d31 in rd_kafka_parse_Metadata (rkb=0x7ff1b8003f40, rkt=0x7ff1bc0011d0, rkbuf=)
#8 0x00007ff1e28692b8 in rd_kafka_op_handle_Metadata (rkb=0x7ff1b8003f40, err=RD_KAFKA_RESP_ERR_NO_ERROR, rkbuf=0x2325640,
#9 0x00007ff1e2859959 in rd_kafka_buf_callback (rkb=0x7ff1b8003f40, err=RD_KAFKA_RESP_ERR_NO_ERROR, response=0x2325640,
#10 0x00007ff1e2838609 in rd_kafka_toppar_q_cb (rk=, rko=, cb_type=,
#11 0x00007ff1e285bd0f in rd_kafka_q_serve (rkq=0x7ff1d8001670, timeout_ms=, max_cnt=, cb_type=0,
#12 0x00007ff1e2841dbd in rd_kafka_toppars_q_serve (timeout_ms=, rkq=0x7ff1d8001670) at rdkafka.c:892
#13 rd_kafka_thread_main (arg=0x7ff1d80015e0) at rdkafka.c:929
#14 0x00007ff1e28781ff in _thrd_wrapper_function (aArg=) at tinycthread.c:599
#15 0x00007ff1e3d08e9a in start_thread () from /lib/x86_64-linux-gnu/libpthread.so.0
#16 0x00007ff1e352038d in clone () from /lib/x86_64-linux-gnu/libc.so.6
#17 0x0000000000000000 in ?? ()
Do you know why it crash like this? Thanks.
The text was updated successfully, but these errors were encountered: