Skip to content

Conversation

@koeninger
Copy link
Contributor

No description provided.

@codecov-io
Copy link

Current coverage is 84.98%

Merging #32 into master will increase coverage by +0.29% as of 9ebb0b9

@@            master     #32   diff @@
======================================
  Files           10      10       
  Stmts          307     313     +6
  Branches        67      67       
  Methods          0       0       
======================================
+ Hit            260     266     +6
  Partial          0       0       
  Missed          47      47       

Review entire Coverage Diff

Powered by Codecov

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another test coverage issue :(

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, there's a separate earlier PR just for that one line... got pulled into this branch by accident as I'm trying to get things working

@marmbrus
Copy link
Contributor

/cc @kai-zeng who has a WIP branch with a similar fix.

@JoshRosen
Copy link
Contributor

Hey @koeninger, do you have a regression test for this? Now that we have the infra from #41 in place I think it would be a good idea to test this.

@JoshRosen JoshRosen added the bug label Aug 21, 2015
@JoshRosen
Copy link
Contributor

Hmm, I'm wondering why the fact that I'm missing this fix in my end-to-end tests has not led to a test failure yet.

@eduardoramirez
Copy link
Contributor

@JoshRosen I came across this issue as well. It worked fine when running the job an EC2 instance that was allowed to read/write from S3. However, when running the job locally, I needed to set the hadoop configuration with my keys.

@JoshRosen
Copy link
Contributor

@eduardoramirez @koeninger, given that both of you have reported this issue and fix I'm inclined to merge this now and figure out the regression tests later. I'm still puzzled over why I haven't been able to get this to fail in Travis, especially given that I purposely picked environment variables that don't match the ones that AWS expects. I'll take a closer look at our S3 bucket permissions tomorrow to see if that could explain the problem.

@JoshRosen
Copy link
Contributor

I think I've spotted the problem: during test setup, we end up setting some Hadoop Configuration access key properties on the global SparkContext.hadoopConfiguration. I'm going to remove those lines, verify that the tests fail, then will rebase and merge this fix.

@JoshRosen
Copy link
Contributor

I've opened #55 to bring this up to date and to fix the tests so that they're capable of detecting the bug being fixed here.

@JoshRosen JoshRosen closed this Aug 26, 2015
JoshRosen pushed a commit that referenced this pull request Aug 26, 2015
…st unload

This is an updated version of #32 with two additional changes:

- Do not mutate the SparkContext's `hadoopConfiguration`.
- Refactor tests so that they're capable of catching this bug.

Author: cody koeninger <cody@koeninger.org>
Author: Josh Rosen <joshrosen@databricks.com>

Closes #55 from JoshRosen/set-credentials.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants