BigEarthNet Trainers #211

isaaccorley · 2021-10-29T04:24:21Z

Adds torchgeo.trainers.BigEarthNetClassificationTask and torchgeo.trainers.BigEarthNetDataModule
Adds torchgeo.trainers.tasks.MultiLabelClassficationTask. This differs from ClassificationTask by using torch.nn.BCEWithLogitsLoss and modifies the metrics to handle multilabel outputs
Adds BigEarthNet train configs
Adds unit tests
Added additional BigEarthNet test data to allow for a train/val/test split
Adds estimate band min/max stats based on random samples from the dataset

TODO:

Need to train to get benchmark result and compare to recent papers

adamjstewart

Doesn't this trainer need to be added to train.py?

tests/trainers/test_bigearthnet.py

torchgeo/trainers/__init__.py

conf/bigearthnet.yaml

tests/trainers/test_tasks.py

torchgeo/trainers/bigearthnet.py

torchgeo/trainers/tasks.py

adamjstewart

I still haven't heard a good reason why we can't have a single ClassificationTask and MultiLabelClassificationTask without all of these subclasses.

tests/trainers/test_bigearthnet.py

tests/trainers/test_tasks.py

torchgeo/trainers/__init__.py

isaaccorley · 2021-11-01T19:35:31Z

I refactored test_tasks to use a dummy dataset and datamodule so we can test for varying number of channels but avoid having to repeat tests for bigearthnet and so2sat datamodule specific args. Let me know if this works.

isaaccorley · 2021-11-01T19:37:34Z

You are right thought that there is not a good case for making a separate task for each classification dataset. We can definitely remove them if that's the route you want to take.

adamjstewart · 2021-11-02T01:50:24Z

torchgeo/trainers/bigearthnet.py

+DataLoader.__module__ = "torch.utils.data"
+
+
+class BigEarthNetClassificationTask(MultiLabelClassificationTask):


I think getting rid of this class is the last remaining thing to do before this can be merged.

Personally I would refactor ClassificationTask and MultiLabelClassificationTask first in separate PRs to keep this PR from getting any bigger.

We should merge this because I still need to refactor the num classes as an input arg before I can remove it. If I do that then I will need to remove/refactor all other classification tasks as well.

* add additional bigearthnet test data for train/val/test split * update bigearthnet dataset length test * add MultiLabelClassificationTask * add BigEarthNet trainer and datamodule * add bigearthnet and multilabelclassificationtask tests * mypy and format * add estimated band min/max values for normalization * softmax outputs to correctly compute metrics * update min/max stats for 100k samples * organize imports in torchgeo.trainers.__init__.py * clean up fixtures in test_tasks.py * added bigearthnet to train.py * format * move fixtures into class methods * consolidate bigearthnet fixtures * refactor tasks tests * add scope=class * style/mypy fixes * mypy fixes

isaaccorley added the trainers PyTorch Lightning trainers label Oct 29, 2021

isaaccorley requested a review from calebrob6 October 29, 2021 04:24

isaaccorley self-assigned this Oct 29, 2021

isaaccorley requested a review from adamjstewart October 29, 2021 04:24

adamjstewart requested changes Oct 29, 2021

View reviewed changes

isaaccorley added 14 commits October 31, 2021 21:06

add additional bigearthnet test data for train/val/test split

e70f90d

update bigearthnet dataset length test

fbaa8b5

add MultiLabelClassificationTask

5f50ec2

add BigEarthNet trainer and datamodule

aa3891b

add bigearthnet and multilabelclassificationtask tests

48d1286

mypy and format

48a16d5

add estimated band min/max values for normalization

2141856

softmax outputs to correctly compute metrics

407b4aa

update min/max stats for 100k samples

928d13b

organize imports in torchgeo.trainers.__init__.py

41467c9

clean up fixtures in test_tasks.py

58a09bd

added bigearthnet to train.py

2064955

format

a46d3e7

move fixtures into class methods

d6e548b

isaaccorley force-pushed the trainers/bigearthnet branch from c4d243e to d6e548b Compare November 1, 2021 02:07

consolidate bigearthnet fixtures

05f3230

adamjstewart requested changes Nov 1, 2021

View reviewed changes

tests/trainers/test_bigearthnet.py Outdated Show resolved Hide resolved

tests/trainers/test_tasks.py Outdated Show resolved Hide resolved

tests/trainers/test_tasks.py Outdated Show resolved Hide resolved

torchgeo/trainers/__init__.py Outdated Show resolved Hide resolved

isaaccorley added 3 commits November 1, 2021 14:12

refactor tasks tests

8e2cd4f

add scope=class

14f6e45

merge

23dc466

isaaccorley requested a review from adamjstewart November 1, 2021 19:34

isaaccorley added 2 commits November 1, 2021 14:59

style/mypy fixes

ae16fdb

mypy fixes

503a558

adamjstewart reviewed Nov 2, 2021

View reviewed changes

adamjstewart approved these changes Nov 2, 2021

View reviewed changes

adamjstewart merged commit 3cc63de into microsoft:main Nov 2, 2021

isaaccorley deleted the trainers/bigearthnet branch November 2, 2021 16:29

adamjstewart added this to the 0.1.0 milestone Nov 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BigEarthNet Trainers #211

BigEarthNet Trainers #211

isaaccorley commented Oct 29, 2021

adamjstewart left a comment

adamjstewart left a comment

isaaccorley commented Nov 1, 2021

isaaccorley commented Nov 1, 2021

adamjstewart Nov 2, 2021

adamjstewart Nov 2, 2021 •

edited

Loading

isaaccorley Nov 2, 2021

		DataLoader.__module__ = "torch.utils.data"


		class BigEarthNetClassificationTask(MultiLabelClassificationTask):

BigEarthNet Trainers #211

BigEarthNet Trainers #211

Conversation

isaaccorley commented Oct 29, 2021

adamjstewart left a comment

Choose a reason for hiding this comment

adamjstewart left a comment

Choose a reason for hiding this comment

isaaccorley commented Nov 1, 2021

isaaccorley commented Nov 1, 2021

adamjstewart Nov 2, 2021

Choose a reason for hiding this comment

adamjstewart Nov 2, 2021 • edited Loading

Choose a reason for hiding this comment

isaaccorley Nov 2, 2021

Choose a reason for hiding this comment

adamjstewart Nov 2, 2021 •

edited

Loading