Update FAIR1M dataset and datamodule #1275

isaaccorley · 2023-04-22T18:54:22Z

Originally we created the FAIR1M dataset when only train/part1 images and labels were available.

This PR updates the FAIR1M dataset and datamodule to work with the latest train/val/test sets.

…nto datasets/fair1mv2

torchgeo/datamodules/fair1m.py

torchgeo/datasets/fair1m.py

calebrob6

@adamjstewart can comment on the versionadded stuff, else LGTM

adamjstewart · 2023-04-25T17:39:47Z

Yes, it does need versionadded

…nto datasets/fair1mv2

adamjstewart · 2023-04-26T15:00:52Z

torchgeo/datasets/fair1m.py

+                don't match
+
+        .. versionchanged:: 0.5
+           Added *split* and *download* parameters.


The split and download parameters.

This seems like more of a nitpick. Added is more clear. The is just a statement.

It should also be versionadded, not versionchanged. The versionadded template already says "New in version 0.5:"

adamjstewart · 2023-04-26T15:03:49Z

torchgeo/datamodules/fair1m.py

-from .utils import dataset_split
+
+
+def collate_fn(batch: list[dict[str, Tensor]]) -> dict[str, Any]:


How is this different from unbind_samples?

Because of the torch.stack on line 27

adamjstewart · 2023-04-26T15:04:24Z

torchgeo/datamodules/fair1m.py

+    .. versionadded:: 0.5
+    """
+    output: dict[str, Any] = {}
+    output["image"] = torch.stack([sample["image"] for sample in batch])


Does this line do anything?

This was mostly copied from nasa_marine_debris.py. I'm assuming it's there for mypy reasons.

But doesn't this just unstack and restack so that the output is identical?

No, this takes the batch (a list of sample dicts) and grabs each image and stacks it into a single tensor along a new batch dimension.

adamjstewart · 2024-08-19T11:55:06Z

torchgeo/datasets/fair1m.py

+            os.path.join("validation", "images"),
+            os.path.join("validation", "labelXml"),
+        ),
+        "test": (os.path.join("test", "images")),


Working on a PR now to add type hints to a bunch of stuff. Just noticed this bug. Here test is of type str, not tuple[str]. Will fix in my other PR.

update fair1m to work with latest dataset

1d0d25b

isaaccorley self-assigned this Apr 22, 2023

github-actions bot added the datasets Geospatial or benchmark datasets label Apr 22, 2023

isaaccorley marked this pull request as draft April 22, 2023 18:54

adamjstewart added this to the 0.5.0 milestone Apr 22, 2023

update fair1m to work with latest dataset

e74ecad

isaaccorley force-pushed the datasets/fair1mv2 branch from 1d0d25b to e74ecad Compare April 22, 2023 21:43

isaaccorley added 2 commits April 22, 2023 21:48

Merge branch 'datasets/fair1mv2' of github.com:isaaccorley/torchgeo i…

2626951

…nto datasets/fair1mv2

add fair1m tests

20831d9

github-actions bot added datamodules PyTorch Lightning datamodules testing Continuous integration testing labels Apr 23, 2023

isaaccorley marked this pull request as ready for review April 23, 2023 02:47

isaaccorley requested review from adamjstewart, calebrob6 and nilsleh April 23, 2023 02:47

fix tests

b69e484

nilsleh reviewed Apr 23, 2023

View reviewed changes

torchgeo/datamodules/fair1m.py Show resolved Hide resolved

torchgeo/datamodules/fair1m.py Show resolved Hide resolved

torchgeo/datasets/fair1m.py Show resolved Hide resolved

isaaccorley force-pushed the datasets/fair1mv2 branch 3 times, most recently from 56ffb9c to 6cdd0a0 Compare April 25, 2023 14:38

calebrob6 previously approved these changes Apr 25, 2023

View reviewed changes

isaaccorley added 3 commits April 25, 2023 18:04

update fair1m to work with latest dataset

a028267

add fair1m tests

4b30f03

fix tests

2ef0cad

isaaccorley force-pushed the datasets/fair1mv2 branch from 6cdd0a0 to 2ef0cad Compare April 25, 2023 23:05

isaaccorley added 2 commits April 26, 2023 00:04

Merge branch 'datasets/fair1mv2' of github.com:isaaccorley/torchgeo i…

38c6a04

…nto datasets/fair1mv2

add versionchanged and versionadded

93c54a3

isaaccorley dismissed calebrob6’s stale review via 93c54a3 April 26, 2023 00:06

isaaccorley requested a review from nilsleh April 26, 2023 00:07

isaaccorley requested a review from calebrob6 April 26, 2023 00:07

nilsleh approved these changes Apr 26, 2023

View reviewed changes

isaaccorley merged commit 28615a1 into microsoft:main Apr 26, 2023

adamjstewart reviewed Apr 26, 2023

View reviewed changes

isaaccorley deleted the datasets/fair1mv2 branch April 26, 2023 17:26

adamjstewart added the backwards-incompatible Changes that are not backwards compatible label Sep 30, 2023

adamjstewart reviewed Aug 19, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update FAIR1M dataset and datamodule #1275

Update FAIR1M dataset and datamodule #1275

isaaccorley commented Apr 22, 2023

calebrob6 left a comment

adamjstewart commented Apr 25, 2023

adamjstewart Apr 26, 2023 •

edited

Loading

isaaccorley Apr 26, 2023

adamjstewart Apr 26, 2023

adamjstewart Apr 26, 2023

isaaccorley Apr 26, 2023

adamjstewart Apr 26, 2023

isaaccorley Apr 26, 2023

adamjstewart Apr 26, 2023

isaaccorley Apr 26, 2023 •

edited

Loading

adamjstewart Aug 19, 2024

		from .utils import dataset_split


		def collate_fn(batch: list[dict[str, Tensor]]) -> dict[str, Any]:

Update FAIR1M dataset and datamodule #1275

Update FAIR1M dataset and datamodule #1275

Conversation

isaaccorley commented Apr 22, 2023

calebrob6 left a comment

Choose a reason for hiding this comment

adamjstewart commented Apr 25, 2023

adamjstewart Apr 26, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

isaaccorley Apr 26, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adamjstewart Apr 26, 2023 •

edited

Loading

isaaccorley Apr 26, 2023 •

edited

Loading