-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I am experiencing difficulties connecting to Kafka in a distributed environment. #109
Comments
Hi @5win, thanks for reaching out. I've just tried Kafka ML with an empty cluster (K3s v1.28.3-rc2+k3s2). Unfortunately we cannot reproduce your issue. It looks like a name resolution error, which might be one of two things:
On either case, verify your * Maybe your cluster is not using CoreDNS, but is the most used out there. PS: The steps I used to deploy Kafka-ML local are: # Make sure no Kafka-ML is running
kubectl delete namespace kafkaml
# Clone the repo
git clone https://github.com/ertis-research/kafka-ml
# Deploy Kafka-ML
kubectl apply -k kafka-ml/kustomize/local |
We've just noticed this error comes as a result of the bootstrap process in Kafka (see https://www.confluent.io/blog/kafka-client-cannot-connect-to-broker-on-aws-on-docker-etc/). This is likely caused because you are using our local deployment on a remote cluster and thus localhost do not match up. There are three ways of resolving your issue: Port forward Kafka to your local machineSomething among these lines should work. Keep the following command running on a background shell: kubectl port-forward '--namespace=kafkaml' service/kafka 9094:9094 And set the kafka address to Modify KAFKA_CFG_ADVERTISED_LISTENERSYou can add Deploy the normal Kafka-ML versionThis would unlock the usage of GPU if your cluster supports it and you deploy the |
Dear authors, Thanks |
Hello,
I am trying to run kafka-ml on a k8s cluster and execute the mnist example code. However, I am facing the following error and having difficulty connecting to Kafka.
This is the result after cloning the repository, creating pods using 'apply kustomize/local', and modifying the bootstrap_servers in mnist_dataset_training_example.py to the IP and NodePort of the node where the Kafka pod is running.
I appreciate your assistance, even though it might be inconvenient. Thank you.
The text was updated successfully, but these errors were encountered: