Cant access kubeflow dashboard after using "kubectl port-forward svc/istio-ingressgateway -n istio-system --address 0.0.0.0 8085:80" #332

TranThanh96 · 2022-08-23T07:23:45Z

I tried install kubeflow on aws with s3 storage by following tutorial from https://awslabs.github.io/kubeflow-manifests/docs/deployment/
Everything works well except the last step to access kubeflow dashboard: kubectl port-forward svc/istio-ingressgateway -n istio-system --address 0.0.0.0 8085:80

After using port-fowarding, I cant access http://localhost:8080/
This page gave me 403 error: You don't have authorization to view this page!
How can I fix this?

AlexandreBrown · 2022-08-23T12:30:07Z

@TranThanh96 Can you make sure your command is the same as the doc https://awslabs.github.io/kubeflow-manifests/docs/deployment/vanilla/guide/#port-forward ?

kubectl port-forward svc/istio-ingressgateway -n istio-system 8080:80

ryansteakley · 2022-08-23T17:16:20Z

Hey @TranThanh96, I responded to you on slack, can you additionally specify which deployment option you ran, was it the rds-s3?

TranThanh96 · 2022-08-24T02:23:42Z

@TranThanh96 Can you make sure your command is the same as the doc https://awslabs.github.io/kubeflow-manifests/docs/deployment/vanilla/guide/#port-forward ?
kubectl port-forward svc/istio-ingressgateway -n istio-system 8080:80

ya, I tried both with and without --address 0.0.0.0
still cant access from browser

TranThanh96 · 2022-08-24T02:24:07Z

Hey @TranThanh96, I responded to you on slack, can you additionally specify which deployment option you ran, was it the rds-s3?

I use s3 only

TranThanh96 · 2022-08-24T04:09:37Z

after re-installing everything, I can reach the login page now

ryansteakley · 2022-08-24T04:55:15Z

sounds good, verify you are able to login and run any samples you wish.

TranThanh96 · 2022-08-24T05:01:58Z

sounds good, verify you are able to login and run any samples you wish.

@ryansteakley I cant see any example pipelines in dashboard

and I cant create a new notebook server, error: 0/1 nodes are available : 1 too many pods
log.txt

ryansteakley · 2022-08-24T05:24:32Z

looks like you have several pods in crashloop backoff. Is your instance the same size or similar to the one described in https://awslabs.github.io/kubeflow-manifests/docs/deployment/prerequisites/ Did you follow the auto-setup python script?

ryansteakley · 2022-08-24T05:35:56Z

run kubectl describe pod -n and similarily kubectl logs -n on the pods in failure state. and share anything you find there as well

TranThanh96 · 2022-08-24T07:32:16Z

@ryansteakley
I follow guide install with s3 only.
I am using 2 node: t3.xlarge
now I have 3 pod that keep CrashLoopBackOff:

and this is log for each pod
metadata-grpc-deployment-f8d68f687-pdzcx_describe.txt
metadata-grpc-deployment-f8d68f687-pdzcx_log..txt
metadata-writer-d7ff8d4bc-qqtjz_describe.txt
metadata-writer-d7ff8d4bc-qqtjz_log.txt
ml-pipeline-777648985d-jhkvl_describe.txt
ml-pipeline-777648985d-jhkvl_log.txt

ryansteakley · 2022-08-24T08:29:41Z

Warning Failed 34m (x5 over 34m) kubelet Error: secret "mlpipeline-minio-artifact" not found in ml-pipeline logs. Can you check to see if this secret exists. Run kubectl get secrets -n kubeflow

TranThanh96 · 2022-08-24T08:44:15Z

secrets_kf_log.txt
seem like it exist

ryansteakley · 2022-08-24T09:02:54Z

Can you verify that you are using v3.2.0 of kustomize? Run kubectl delete pods -n kubeflow --all and see if the pods come up normally.

TranThanh96 · 2022-08-24T09:06:49Z

yes, I am using kustomize v3.2.0

Tried kubectl delete pods -n kubeflow --all but the pod metadata-grpc-deployment-f8d68f687-mqs82 keep crashloopbackoff

ryansteakley · 2022-08-24T09:43:52Z

What do you see when you login? Are any other pods still failing?

TranThanh96 · 2022-08-24T09:48:08Z

everything is good except those 3 pods keep crashloopbackoff

and I get some errors on Pipelines and Runs, any suggestions please?

errors on Runs:

and these pods:

ryansteakley · 2022-08-24T09:50:31Z

Can you verify that the s3-secret you created is following this requirement. Configure a Secret (e.g. s3-secret) with your AWS credentials. These need to be long-term credentials from an IAM user and not temporary.

TranThanh96 · 2022-08-24T09:53:40Z

yes, I can confirm that. How can I give you a evidence?

ryansteakley · 2022-08-24T09:56:14Z

No way, to prove. Can you one more time describe the ml-pipeline pod. I would suggest restarting from a fresh cluster, and follow the cluster pre-req listed above.

TranThanh96 · 2022-08-24T09:57:42Z

Yes, this is 3rd times I re-install kubeflow on aws eks from a fresh cluster. and this error keep occurring

ryansteakley · 2022-08-24T10:00:41Z

Sorry you are running into these problems, if you can please share the logs from the latest crashloopbackoff mlpipeline. Which version of AWS kubeflow are you running? I will try to reproduce your issue on my end and see if there is some underlying issue.

TranThanh96 · 2022-08-24T10:02:43Z

Sorry you are running into these problems, if you can please share the logs from the latest crashloopbackoff mlpipeline. Which version of AWS kubeflow are you running? I will try to reproduce your issue on my end and see if there is some underlying issue.

how can I get these log? I can provide it to you.
I am using this version:
KUBEFLOW_RELEASE_VERSION=v1.5.1
AWS_RELEASE_VERSION=v1.5.1-aws-b1.0.1

ryansteakley · 2022-08-24T10:04:22Z

kubectl logs <ml-pipeline-pod> -n kubeflow i see you are running 2 node: t3.xlarge, we reccomend a minimum of 5 nodes and m5.xlarge. Stated here https://awslabs.github.io/kubeflow-manifests/docs/deployment/prerequisites/ if you have time try to re-create following the suggested cluster create command

TranThanh96 · 2022-08-24T10:13:46Z

kubectl logs -n kubeflow

This is log from ml-pipeline

surajkota · 2022-08-25T21:09:35Z

@ryansteakley @TranThanh96 I think this is because of a bug related to missing mysql deployment in S3 only deployment option. It was fixed in main branch recently but not backported to release branch #310

@TranThanh96 Can you comment out this like - disable-mysql-pv-claim.yaml in awsconfigs/apps/pipeline/s3/kustomization.yaml and run

kustomize build awsconfigs/apps/pipeline/s3 | kubectl apply -f -

Please delete the pods which are in crashloopbackoff after doing this so that a new pod gets created

TranThanh96 · 2022-08-26T02:22:11Z

@ryansteakley @TranThanh96 I think this is because of a bug related to missing mysql deployment in S3 only deployment option. It was fixed in main branch recently but not backported to release branch #310

@TranThanh96 Can you comment out this like - disable-mysql-pv-claim.yaml in awsconfigs/apps/pipeline/s3/kustomization.yaml and run

kustomize build awsconfigs/apps/pipeline/s3 | kubectl apply -f -

Please delete the pods which are in crashloopbackoff after doing this so that a new pod gets created

yes, I try with rds + s3 deployment. everything works. So the problem is related to mysql

surajkota · 2022-09-25T00:33:37Z

Thanks for reporting this issue. We have released a patch version (v1.5.1-aws-b1.0.2) to fix this issue

TranThanh96 added the bug Something isn't working label Aug 23, 2022

surajkota closed this as completed Sep 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cant access kubeflow dashboard after using "kubectl port-forward svc/istio-ingressgateway -n istio-system --address 0.0.0.0 8085:80" #332

Cant access kubeflow dashboard after using "kubectl port-forward svc/istio-ingressgateway -n istio-system --address 0.0.0.0 8085:80" #332

TranThanh96 commented Aug 23, 2022

AlexandreBrown commented Aug 23, 2022

ryansteakley commented Aug 23, 2022

TranThanh96 commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022 •

edited

Loading

ryansteakley commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022 •

edited

Loading

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

surajkota commented Aug 25, 2022 •

edited

Loading

TranThanh96 commented Aug 26, 2022

surajkota commented Sep 25, 2022

Cant access kubeflow dashboard after using "kubectl port-forward svc/istio-ingressgateway -n istio-system --address 0.0.0.0 8085:80" #332

Cant access kubeflow dashboard after using "kubectl port-forward svc/istio-ingressgateway -n istio-system --address 0.0.0.0 8085:80" #332

Comments

TranThanh96 commented Aug 23, 2022

AlexandreBrown commented Aug 23, 2022

ryansteakley commented Aug 23, 2022

TranThanh96 commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022 • edited Loading

ryansteakley commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022 • edited Loading

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

ryansteakley commented Aug 24, 2022

TranThanh96 commented Aug 24, 2022

surajkota commented Aug 25, 2022 • edited Loading

TranThanh96 commented Aug 26, 2022

surajkota commented Sep 25, 2022

TranThanh96 commented Aug 24, 2022 •

edited

Loading

TranThanh96 commented Aug 24, 2022 •

edited

Loading

surajkota commented Aug 25, 2022 •

edited

Loading