-
-
Notifications
You must be signed in to change notification settings - Fork 675
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Topic data access slow on sort NEWEST #192
Comments
Hello, Can you provide me some log after playing this curl curl -i -X POST -H "Content-Type: application/json" \
-d '{ "configuredLevel": "TRACE" }' \
http://localhost:8080/loggers/org.kafkahq It will help me to understand the response time. One question, what is your sort preference on this page, and is this topic is compacted ? |
here is the log, sort options are default so i guess 'Newest'
|
The topic is not compacted no.
|
Ok thanks for information ! It seems that you use the So to resume, the
On the OLDEST If the topic is compacted, It can be worst (still with newest). Because it can lead to multiple query on Kafka to be able to fetch some data. Here is the situation, but as I known I haven't any solution to speed up the NEWEST part (except have a config to allow more parallel fetch). Just for information can you send me the log with sort OLDEST ? |
This is the debug with OLDEST, indeed it's really fast. Not sure that this is the default though, don't remember changing this ever in the UI. Or maybe i did very long time ago and it's stored in a cookie.
|
Ok just like I think. The configuration can be changed on the configuration files only. |
Thanks, indeed it was set to NEWEST. Browsing OLDEST rarely makes sense in our case. |
So Like i said and can't change this behavior, since it was a limitation from Kafka Api. The only things is to raise parallelism. |
This is the CPU spec: Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz, 3601 Mhz, 4 Core(s), 8 Logical Processor(s) For comparison, i have created a gif of loading the same topic data in the other tool. This just to exclude connectivity or infrastructure issues with the cluster. It does seem possible to get back some data rather quickly, so wondering where the main difference could be with kafkahq. I hope you don't mind me posting this, if not let me know and i'll remove it. |
No problems for screenshot. I'have added some log on the dev docker image, is it possible to have the log output (will log line like |
@tchiotludo sorry but docker I cannot easily test here. Do you have a snapshot of the jar available somewhere ? |
Yes look at this : You can grab the jar on your local desktop |
Using latest dev build i get NPE when wanting to browse topic data. Note I built it myself locally.
|
Arg, I just push a new commit on dev branch, can you retry please ? |
Here is the new output. Note that at the end, the page says 'no data available' which i find a bit strange. I have seen this happening before with NEWEST, never with OLDEST. There is definitely data in the topic as you can see from below output. Sometimes after a few refreshes the data reappears.
|
The no data available can happen, because Kafka API can't let me know if there is data and I reach the poll timeout (that you can change in config). The latest log output is showing no data available, right ? |
I increased poll-timeout to 10000 but still getting "no data available". There are 550k records in the topic, size is about 110MB. EDIT: increasing poll-timeout to 20000 makes results appear again.
|
The amount of volume in the topic is not related to this bug it seems. For a topic of 5k records i had to wait 35 seconds to get back only one record, that's a bit surprising. |
* fixed bug * erased console log and fixed formatDateTime function * topic data filters saved in session storage * schema column responsive
Seems that the reason can be found here |
- Adding stop condition in newest sort to prevent extra polling - Fixing wrong number of records with pagination close #192
Hi,
When in the topic list overview, clicking on a topic that has a lot of data takes a long time before the record list shows up. This is a topic with only about 500k records. Checking perf in another browsing tool (kafkatool.com), there the records display in 4-5 seconds for the same topic. Is there anything that could potentially slow down fetching the records ?
The text was updated successfully, but these errors were encountered: