-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Build hadoop from scratch #7
Conversation
The apache/hadoop:3.3.6 container is built on a 5+ year old version of CentOS and doesn't run on MacOS. Hopefully this will fix things. ``` I have no name!@5c6eec42ebf6:/opt/bitnami/spark$ ./bin/spark-submit --master yarn --conf spark.hadoop.yarn.resourcemanager.hostname=yarn-resourcemanager --conf spark.hadoop.yarn.resourcemanager.address=yarn-resourcemanager:8032 --conf spark.hadoop.fs.s3a.endpoint=http://minio:9002 --conf spark.hadoop.fs.s3a.access.key=minio --conf spark.hadoop.fs.s3a.secret.key=minio123 --conf spark.hadoop.fs.s3a.path.style.access=true --conf spark.hadoop.fs.s3a.impl=org.apache.hadoop.fs.s3a.S3AFileSystem --conf spark.yarn.stagingDir=s3a://yarn --deploy-mode client examples/src/main/python/pi.py 10 2>/dev/null Pi is roughly 3.138560 ```
## OS notes: | ||
|
||
* The Hadoop containers don't seem to start correctly on Mac machines. Ubuntu linux works | ||
normally. | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Counting my chickens before they're hatched here
Not sure if this is useful, but here is a dockerfile that shows how they build it https://github.com/apache/hadoop/blob/docker-hadoop-3/Dockerfile |
Looks like the top of the image stack from |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good.
But on my Mac:
yarn-resourcemanager | ERROR: JAVA_HOME /usr/lib/jvm/java-8-openjdk-amd64/jre/ does not exist.
yarn-nodemanager | ERROR: JAVA_HOME /usr/lib/jvm/java-8-openjdk-amd64/jre/ does not exist.
yarn-resourcemanager exited with code 1
Can you run the images and check if the directory exists? |
|
I just double checked and the images start normally for me on this commit |
It does exist. Trying to figure our what's going on here. I cannot
|
Well that's bizarre |
What link are you talking about? There's no link in the path that I see |
I mean the
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
java-1.8.0 links to java-8, but the path goes through java-8 so there's no links in $JAVA_HOME AFAICT |
Merging this since it's better than before, and no worse on MacOS. If we can figure out what's going on with MacOS we can make another PR. |
The apache/hadoop:3.3.6 container is built on a 5+ year old version of CentOS and doesn't run on MacOS. Hopefully this will fix things.