How to correctly set up SparkDataSet using S3 #1352
paulofbmarcon
started this conversation in
Idea
Replies: 1 comment
-
So when it comes to spark we've found managing cloud credentials (AWS and often Azure from experience) we have found that this approach is typically easier to get up and running. IAM roles make this even easier, but that's not always possible. The solution you've ended up is good - the only thing we could recommend is that you do the credentials stuff in |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I tried the below configuration in catalog.yml but I couldn't login to AWS S3. The only way I could get to access s3 files was to set com.amazonaws.auth.profile.ProfileCredentialsProvider in spark.yml. Is this the expected behavior?
It looks like SparkDataSet should be reading my credentials configuration file.
with the credentials.yml as follow:
Beta Was this translation helpful? Give feedback.
All reactions