-
Notifications
You must be signed in to change notification settings - Fork 321
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enable batch sampling #709
Enable batch sampling #709
Conversation
Codecov Report
@@ Coverage Diff @@
## issue-692-sample-remaining-columns #709 +/- ##
=====================================================================
Coverage ? 65.93%
=====================================================================
Files ? 36
Lines ? 2657
Branches ? 0
=====================================================================
Hits ? 1752
Misses ? 905
Partials ? 0 Continue to review full report at Codecov.
|
83c4049
to
c394863
Compare
129b19f
to
f79231a
Compare
46b70f7
to
1f8ebbd
Compare
bc6e6e2
to
b797faf
Compare
1f8ebbd
to
30bd3cd
Compare
30bd3cd
to
0fbd9fc
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks pretty good to me, nice job 👍
b797faf
to
e3e9d67
Compare
0fbd9fc
to
1a7abb6
Compare
e3e9d67
to
3c7135b
Compare
48c0eec
to
786927d
Compare
3c7135b
to
754b9e1
Compare
786927d
to
ea3d28a
Compare
754b9e1
to
640419c
Compare
ea3d28a
to
50b1585
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One small question but otherwise LGTM!
sampled, num_valid = self._sample_rows( | ||
batch_size_per_try, conditions, transformed_conditions, float_rtol, sampled, | ||
) | ||
|
||
num_increase = min(num_valid - prev_num_valid, remaining) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In what case would remaining
be less than num_valid - prev_num_valid
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since we are sampling up to batch_size_per_try
rows, we could sample more than the number of remaining rows. For example, if we wish to sample 100 rows and we've already sampled 90 valid rows, but our batch size is 50, then we could sample something like 38 rows in the _sample_rows
call above
640419c
to
38c1783
Compare
39754ef
to
1098a0d
Compare
38c1783
to
4c38284
Compare
1098a0d
to
9ecfeae
Compare
4c38284
to
780a293
Compare
9ecfeae
to
f211698
Compare
780a293
to
32dd6ab
Compare
f211698
to
37f962e
Compare
* Add method to sample remaining columns * update integration tests * add unit tests * update tutorials and docs * Enable batch sampling (#709) * Add batch sampling and progress bar * Make sure to close progress bar * Periodically write to file * add unit tests * cr comments * fix test
* Add method to sample remaining columns * update integration tests * add unit tests * update tutorials and docs * Enable batch sampling (#709) * Add batch sampling and progress bar * Make sure to close progress bar * Periodically write to file * add unit tests * cr comments * fix test
* Add method to sample remaining columns * update integration tests * add unit tests * update tutorials and docs * Enable batch sampling (#709) * Add batch sampling and progress bar * Make sure to close progress bar * Periodically write to file * add unit tests * cr comments * fix test
* Add method to sample remaining columns * update integration tests * add unit tests * update tutorials and docs * Enable batch sampling (#709) * Add batch sampling and progress bar * Make sure to close progress bar * Periodically write to file * add unit tests * cr comments * fix test
* Update sample method args * Add unit tests * remove conditioning logic for now * Add error handling * Make integration tests pass * code review comments * Add sample conditions method * add back unit tests with conditions * Update logic for handling multiple conditions * fix lint * fix integration tests * Add method to sample remaining columns (3/3) (#708) * Add method to sample remaining columns * update integration tests * add unit tests * update tutorials and docs * Enable batch sampling (#709) * Add batch sampling and progress bar * Make sure to close progress bar * Periodically write to file * add unit tests * cr comments * fix test
* Update sample method args * Add unit tests * remove conditioning logic for now * Add error handling * Make integration tests pass * code review comments * Add sample conditions method * add back unit tests with conditions * Update logic for handling multiple conditions * fix lint * fix integration tests * Add method to sample remaining columns (3/3) (#708) * Add method to sample remaining columns * update integration tests * add unit tests * update tutorials and docs * Enable batch sampling (#709) * Add batch sampling and progress bar * Make sure to close progress bar * Periodically write to file * add unit tests * cr comments * fix test
* Update sample method args * Add unit tests * remove conditioning logic for now * Add error handling * Make integration tests pass * code review comments * Add sample conditions method * add back unit tests with conditions * Update logic for handling multiple conditions * fix lint * fix integration tests * Add method to sample remaining columns (3/3) (#708) * Add method to sample remaining columns * update integration tests * add unit tests * update tutorials and docs * Enable batch sampling (#709) * Add batch sampling and progress bar * Make sure to close progress bar * Periodically write to file * add unit tests * cr comments * fix test
Resolves #693