Faster, more space-efficient tutorials #1124

adamjstewart · 2023-02-18T17:38:58Z

This PR attempts to simplify and speed up our notebook tests, and includes the following changes:

Use nbmake variable mocking instead of checking pytest env vars
Use tempfile for a single shared download directory
Download smaller datasets
Strip output from notebooks to make smaller files
Rerun tests if they fail, known transient issues

Closes #665
Closes #1074

Before

Numbers from here.

Tutorial	Size	Time
benchmarking	1.9G (NAIP) + 717M (Chesapeake)	34m 26s
custom dataset	331M (Sentinel)	4m 54s
getting started	1.9G (NAIP) + 717M (Chesapeake)	10s
indices	331M (Sentinel)	59s*
pretrained weights	4.7G (EuroSAT)	1m 25s
trainers	6.4G (Cyclone)	5m 28s*
transforms	4.7G (EuroSAT)	2m 37s
total	22G†	50m*

*failed, likely longer if passed
†duplicate copies of each download

After

Numbers from here.

Tutorial	Size	Time
benchmarking	1.9G (NAIP) + 717M (Chesapeake)	2m 1s
custom dataset	331M (Sentinel)	3m 11s
getting started	1.9G (NAIP) + 717M (Chesapeake)	10s
indices	19M (EuroSAT100)	10s
pretrained weights	19M (EuroSAT100)	15s
trainers	19M (EuroSAT100)	13s
transforms	19M (EuroSAT100)	1m 23s
total	3.0G††	7m 34s

††shared copies across downloads

calebrob6 · 2023-02-23T23:25:37Z

This is waiting on the 100 image version of EuroSat right?

adamjstewart · 2023-02-24T02:51:14Z

Yep, will start trying to integrate that now.

adamjstewart · 2023-02-24T19:04:08Z

Notebook tests are passing for the first time in almost a year!

adamjstewart · 2023-02-26T21:49:15Z

I think this PR is mostly complete. I couldn't get caching working, but we can figure that out another day. A few things remaining I'm concerned about:

nbmake seems to still be finicky, it crashes if I save the cell output in the trainers tutorial
reviewing git diffs is impossible, wish there was a way to improve this

Depending on whether the notebook was saved locally or on Colab, the indentation changes, meaning every single line is changed (maybe we can find a JSON autoformatter to fix this?). And saving on Colab adds a ton of extraneous metadata that I don't want. We could remove all outputs, which would fix nbmake and reduce the size of the files, but then you wouldn't be able to see plots without running tutorials. We could also use nbsphinx to generate the outputs, but this would be slow and require downloads and need to happen on every commit. We would at least want to make smaller downloads for the last 3 tutorials.

adamjstewart · 2023-02-26T21:55:31Z

Would also be nice if we could run isort/flake8/pyupgrade on our notebooks. Or even store the notebook in a .py file and autoconvert it to .ipynb on the fly like PyTorch does. But I know less about those.

adamjstewart · 2023-03-01T20:07:38Z

Opened a discussion for this: #1152

I guess I'm fine merging this PR as is, although it would be nice to decide whether we want to include outputs in tutorials before we merge this PR which adds 10K lines of code.

adamjstewart · 2023-03-16T19:58:46Z

Not sure why the tests are crashing again. They were working fine on Colab. May have to try stripping outputs again.

calebrob6 · 2023-03-22T12:47:46Z

.github/workflows/tutorials.yaml

-      env:
-        MLHUB_API_KEY: ${{ secrets.MLHUB_API_KEY }}
-      run: pytest --nbmake --nbmake-timeout=3000 docs/tutorials --durations=10
+      run: pytest --nbmake --durations=10 --reruns=10 docs/tutorials


Can you explain this?

--durations=10 prints the 10 slowest tests (very useful for seeing which tests are worth speeding up). --reruns=10 reruns the tests up to 10 times until they pass.

Even with all the changes in this PR, the tests seem to still fail intermittently. I'm hoping that once treebeardtech/nbmake#80 is solved, the error message will be more useful. Until then, rerunning the tests is necessary to ensure that they pass.

* Speed up notebook tests * Black fix * Mock rest of variables * Undo URL changes * Update conda deps * Notebooks also plot images * Fix undefined variable * Test with serial data loading * Use tempfile for all data download directories * Encode timeout in notebook * Share datasets across processes * Fix missing import * Pretrained Weights: use EuroSAT100 * Transforms: use EuroSAT100 * Trainers: use EuroSAT100 * Blacken * MPLBACKEND is already Agg by default on Linux * Indices: use EuroSAT100 * Pretrained Weights: add output * Pretrained Weights: add output * Trainers: save output * Pretrained Weights: ResNet 50 -> 18 * Trainers: better graph * Indices: add missing plot * Cache downloads * Small edit * Revert "Cache downloads" This reverts commit 5276c53. * Revert "Revert "Cache downloads"" This reverts commit 137c69e. * env only * half env * Variable with no braces * Set tmpdir elsewhere * Give up on tmpdir caching * Trainers: clear output * lightning.pytorch package import * nbstripout * Rerun upon failure * Re-add caching * Rerun failures on release branch too

adamjstewart added this to the 0.4.1 milestone Feb 18, 2023

github-actions bot added datasets Geospatial or benchmark datasets dependencies Packaging and dependencies documentation Improvements or additions to documentation testing Continuous integration testing and removed datasets Geospatial or benchmark datasets labels Feb 18, 2023

adamjstewart removed the dependencies Packaging and dependencies label Feb 20, 2023

adamjstewart force-pushed the tests/notebooks branch from 724311f to c026e42 Compare February 24, 2023 03:48

github-actions bot added the dependencies Packaging and dependencies label Feb 24, 2023

adamjstewart changed the title ~~Speed up notebook tests~~ Faster, more space-efficient tutorials Feb 24, 2023

adamjstewart added 17 commits February 25, 2023 16:41

Speed up notebook tests

6ab8f56

Black fix

3fab7eb

Mock rest of variables

a8045b7

Undo URL changes

2dce840

Update conda deps

b1d58d1

Notebooks also plot images

44628b1

Fix undefined variable

3aff8bf

Test with serial data loading

2c58002

Use tempfile for all data download directories

798fb9a

Encode timeout in notebook

006a61b

Share datasets across processes

24af6ff

Fix missing import

54e35ee

Pretrained Weights: use EuroSAT100

6d363da

Transforms: use EuroSAT100

fd9268e

Trainers: use EuroSAT100

c27c241

Blacken

db4f9c9

MPLBACKEND is already Agg by default on Linux

81fbe76

adamjstewart added 6 commits February 26, 2023 11:37

half env

9ee1317

Variable with no braces

86c69f8

Set tmpdir elsewhere

f5d0e35

Give up on tmpdir caching

5909a8e

Merge branch 'main' into tests/notebooks

aadd199

Trainers: clear output

77b4e0b

adamjstewart mentioned this pull request Feb 26, 2023

NBMAKE INTERNAL ERROR: Exception ignored in socket treebeardtech/nbmake#80

Closed

adamjstewart marked this pull request as ready for review March 1, 2023 20:07

isaaccorley previously approved these changes Mar 10, 2023

View reviewed changes

Merge branch 'main' into tests/notebooks

aad37f5

adamjstewart dismissed isaaccorley’s stale review via aad37f5 March 16, 2023 17:15

lightning.pytorch package import

76eba99

adamjstewart added 2 commits March 17, 2023 23:28

nbstripout

58a1f7d

Merge branch 'main' into tests/notebooks

b9969f9

adamjstewart marked this pull request as draft March 18, 2023 16:49

adamjstewart added 4 commits March 19, 2023 11:58

Merge branch 'main' into tests/notebooks

636bb2c

Rerun upon failure

d85cf39

Re-add caching

4cd7a30

Rerun failures on release branch too

38452e3

adamjstewart marked this pull request as ready for review March 20, 2023 19:23

calebrob6 approved these changes Mar 22, 2023

View reviewed changes

adamjstewart mentioned this pull request Mar 22, 2023

Add tensorboard to trainers tutorial #1163 #1189

Merged

adamjstewart merged commit a0455d4 into main Mar 29, 2023

adamjstewart deleted the tests/notebooks branch March 29, 2023 01:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster, more space-efficient tutorials #1124

Faster, more space-efficient tutorials #1124

adamjstewart commented Feb 18, 2023 •

edited

Loading

calebrob6 commented Feb 23, 2023

adamjstewart commented Feb 24, 2023

adamjstewart commented Feb 24, 2023

adamjstewart commented Feb 26, 2023

adamjstewart commented Feb 26, 2023

adamjstewart commented Mar 1, 2023

adamjstewart commented Mar 16, 2023

calebrob6 Mar 22, 2023

adamjstewart Mar 22, 2023

Faster, more space-efficient tutorials #1124

Faster, more space-efficient tutorials #1124

Conversation

adamjstewart commented Feb 18, 2023 • edited Loading

Before

After

calebrob6 commented Feb 23, 2023

adamjstewart commented Feb 24, 2023

adamjstewart commented Feb 24, 2023

adamjstewart commented Feb 26, 2023

adamjstewart commented Feb 26, 2023

adamjstewart commented Mar 1, 2023

adamjstewart commented Mar 16, 2023

calebrob6 Mar 22, 2023

Choose a reason for hiding this comment

adamjstewart Mar 22, 2023

Choose a reason for hiding this comment

adamjstewart commented Feb 18, 2023 •

edited

Loading