-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Zilla is unresponsive sometimes & the app logs Stopped but accepts the CRUD requests #1312
Comments
Container definition used:
The cpu and memory are given 4 vcpu & 16GB. |
My suspicion is that something is wrong with Zilla's Kafka connection. Can you validate that you are able to reach Kafka from zilla's container?
This opens you a terminal on the ecs container.
Check if you can reach the bootstrap server: |
I've observed the same response/behaviour while hitting the zilla endpoint with the new version(0.9.101). The zilla container is able to connect to the kafka cluster(from inside the container, verified the connectivity to kafka through netcat command while the curl to the zilla endpoint is tried from other terminal). Shared the diagnostic log with new version (0.9.101) to review for any traces. |
Hi Team,
I'm running zilla with kafka crud use case, where i primarily focused on POST. Initially the app was running fine with 0.9.89 , able to get 204 response when we hit it with curl. We see an abnormal behaviour when using curl on CRUD operations. the curl is stuck and awaiting for the response back from the server. I see a log entry but actually the request is not hitting the zilla app.
October 24, 2024 at 22:00 (UTC+5:30)
Strange observation is, if the existing curl is forced to stop and re-run the curl POST couple of times, it works and gives 204 response, but soon after it'll be back in the stuck mode. I don't see much in the logging on what's really happening. I tried deploying a new image version 0.9.98 just to isolate the issue. But it's the same behaviour. I want to know where exactly the problem lies in. please help me on how can i enable better logging. The additional change to the existing zilla config is added telemetry at each section. I don't think that's a problem!
Observations: when we try CRUD operations,
stuck
in getting the response forever.204
responseEmpty reply from server.
Stopped
abruptly, but still accepts the request.Here is some additional info. this is something which popped in the log when a load run was triggered from jmeter( 8MPBS network 800 TPS load with 10kb of payload size)
Attached a Diagnostic zip zilla_20241028121032.tar.gz
Env: Zilla is deployed on AWS ECS fargate. The Kafka implementation is Confluent Cloud.
zilla.yaml config:
Attached.
zilla.yaml.zip
Slack ref: https://aklivitycommunity.slack.com/archives/C027SEH2KRC/p1730119696631149?thread_ts=1729788282.245839&cid=C027SEH2KRC
Let me know if you need anything more!
The text was updated successfully, but these errors were encountered: