add support for detection.mask_path #5120

sashankaryal · 2024-11-15T02:28:35Z

What changes are proposed in this pull request?

Continuation of #4693. This PR adds support for detection.mask_path. Works for both grayscale and multi-channel masks.

Until now, this was the only label type which didn't have support for on-disk masks.

How is this patch tested? If it is not, please explain why.

Golden path e2e. Also exported and re-imported the following dataset with and verified to be working.

To test locally, run the following script:

import fiftyone as fo
import numpy as np

from PIL import Image

dataset = fo.load_dataset("test-detection-masks", create_if_necessary=True)
dataset.clear()

samples = []

# sample with bad detection mask
test_image_np = np.linspace(0, 255, 100*100, dtype=np.uint8).reshape(100,100)
test_image = Image.fromarray(test_image_np)
test_image_path = "/tmp/bad-detection-mask-sample.png"
test_image.save(test_image_path)

badDetectionMaskSample = fo.Sample(filepath=test_image_path)
badDetectionMaskSample["ground_truth"] = fo.Detections(
    detections=[
        fo.Detection(
            label="bad_mask_detection",
            bounding_box=[0.0, 0.0, 0.0, 0.0],
            mask=np.empty((0, 0)),
        ),
    ]
)
samples.append(badDetectionMaskSample)

# sample with good detection mask
test_image_np = np.ones((255, 255), dtype=np.uint8) * 120
test_image = Image.fromarray(test_image_np)
test_image_path = "/tmp/good-detection-mask-sample.png"
test_image.save(test_image_path)

goodDetectionMaskSample = fo.Sample(filepath=test_image_path)
goodDetectionMaskSample["ground_truth"] = fo.Detections(
    detections=[
        fo.Detection(
            label="good_mask_detection",
            bounding_box=[0.0, 0.0, 0.5, 0.5],
            mask=np.ones((15, 15)),
        ),
    ]
)
samples.append(goodDetectionMaskSample)

# sample with good detection mask _path_
test_image_np = np.ones((255, 255, 3), dtype=np.uint8) * 60
test_image = Image.fromarray(test_image_np)
test_image_path = "/tmp/good-detection-mask-path-sample.png"
test_image.save(test_image_path)

def generate_alternating_mask(w, h, channels=1):
    # create a 2D alternating pattern (255 and 0)
    pattern = np.tile([255, 0], (h, w // 2 + 1))[:, :w]
    
    # if channels > 1, stack the pattern for each channel
    if channels > 1:
        pattern = np.stack([pattern] * channels, axis=-1)
    
    return Image.fromarray(pattern.astype(np.uint8))

mask = generate_alternating_mask(15, 15, 3)
mask_path = "/tmp/on-disk-detection-mask.png"
mask.save(mask_path)

goodDetectionMaskPathSample = fo.Sample(filepath=test_image_path)
goodDetectionMaskPathSample["prediction"] = fo.Detection(
            label="good_mask_detection_path",
            bounding_box=[0.0, 0.0, 0.5, 0.5],
            mask_path=mask_path,
        )
samples.append(goodDetectionMaskPathSample)

dataset.add_samples(samples)

You should get a dataset with samples that look like:

Release Notes

Is this a user-facing change that should be mentioned in the release notes?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release
notes for FiftyOne users.

What areas of FiftyOne does this PR affect?

App: FiftyOne application changes
Build: Build and test infrastructure changes
Core: Core fiftyone Python library changes
Documentation: FiftyOne documentation changes
Other

Summary by CodeRabbit

New Features
- Introduced new attributes and methods for better management of detection and segmentation masks, including mask_path, has_mask, get_mask, import_mask, and export_mask.
Improvements
- Enhanced handling of asynchronous operations in detection processing.
- Updated methods across various modules to consistently check for mask validity using has_mask().
- Improved efficiency in pixel processing logic and color assignment based on mask data.
- Expanded test coverage and functionality for detection masks, improving sample management and assertions.
- Enhanced FiftyOne Datasets documentation for clarity and usability, including updates on instance segmentation, dynamic attributes, and batch operations.
Bug Fixes
- Improved robustness in mask-checking logic across multiple functionalities.

coderabbitai · 2024-11-15T02:28:42Z

Walkthrough

The changes introduced in this pull request enhance the handling of mask data within the Detection and Segmentation classes, including the addition of a mask_path attribute. The imputeOverlayFromPath function has been updated to improve asynchronous handling of detections. Additionally, modifications were made in various utility files to ensure consistency in mask validation across different contexts. Overall, the updates focus on improving the management and retrieval of mask data in the FiftyOne library.

Changes

File Path	Change Summary
`app/packages/looker/src/worker/index.ts`	Added constant `DETECTION`; modified `imputeOverlayFromPath` to use `for...of` loop for async handling of detections.
`fiftyone/core/labels.py`	Added `mask_path` attribute to `Detection` and `Segmentation` classes; introduced `has_mask`, `get_mask`, `import_mask`, and `export_mask` methods.
`fiftyone/utils/coco.py`	Updated `from_label` and `_coco_objects_to_detections` methods to check for masks using `has_mask()` instead of direct checks.
`fiftyone/utils/cvat.py`	Modified `_create_detection_shapes` to check for masks using `has_mask()` instead of direct checks.
`fiftyone/utils/eta.py`	Changed mask retrieval in `to_detected_object` from `detection.mask` to `detection.get_mask()`.
`e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts`	Enhanced tests for detection masks with multiple sample images and updated assertions to verify correct loading of masks.
`e2e-pw/src/oss/specs/plugins/histograms.spec.ts`	Added verification for the new `"detections.detections.mask_path"` field in the histograms panel test case.

Possibly related issues

[FR] Allow mask_path in Detection labels #4486: The changes introduce a mask_path in the Detection class, aligning with the proposal to allow mask_path in Detection labels, which would enable the use of masks saved on disk.

Possibly related PRs

Add mask_path to fo.Detection labels #4693: This PR adds a mask_path to the Detection class, which is directly related to the changes in the main PR that modifies the imputeOverlayFromPath function to enhance handling of detections, including the use of the DETECTION constant.
Fix COCO category IDs #4884: The changes in COCO category IDs and the handling of masks are relevant as they pertain to the management of detection masks, which is a focus of the main PR.
performance: use offscreen canvas to decode on-disk masks #5089: This PR introduces an offscreen canvas for decoding on-disk masks, which aligns with the main PR's focus on improving the handling of asynchronous operations related to detections and overlays.

Suggested labels

feature, bug

Suggested reviewers

brimoor
sashankaryal

🐇 In the fields of code we hop,
With masks and paths, we never stop!
Detection's grace, a new embrace,
Segmentation's friend, we find our place.
Asynchronous dreams, we weave and play,
In FiftyOne's world, we leap all day! 🌟

📜 Recent review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 52bde0e and 62763c8.

📒 Files selected for processing (1)

fiftyone/core/labels.py (6 hunks)

🔇 Additional comments (5)

fiftyone/core/labels.py (5)

44-48: Well-designed mixin class!

The _HasMedia mixin follows good OOP principles by providing a reusable interface for label classes that contain media fields.

Line range hint 398-420: LGTM! Well-documented changes

The class documentation clearly explains the new mask_path attribute and its purpose.

652-654: LGTM! Consistent usage of new mask property

The code correctly uses the new has_mask property and provides appropriate warning messages.

🧰 Tools

🪛 Ruff

654-654: No explicit stacklevel keyword argument found

(B028)

Line range hint 4-6: Verify test coverage for new mask functionality

While the implementation looks solid, ensure comprehensive test coverage for the new mask functionality, including:

Edge cases for mask file handling
Error scenarios for file I/O
Integration tests with the existing detection pipeline

Run the following script to check test coverage:

✅ Verification successful

Let me gather more information about the test coverage by checking the actual test implementations and any related mask functionality in the codebase.

Test coverage for mask functionality appears comprehensive

The codebase shows thorough test coverage for mask-related functionality including:

Mask transformation tests (int-to-int, int-to-rgb, rgb-to-int, rgb-to-rgb)
Instance mask parsing and validation
Mask targets handling with validation for both numeric and RGB formats
Edge cases for mask file handling in import/export scenarios
Integration with the detection pipeline through segmentation tests

The test suite adequately covers:

Data validation and error scenarios
Format conversions and transformations
File I/O operations
Integration with the broader detection system

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check test coverage for mask-related functionality

# Look for test files
echo "Searching for test files..."
fd -e py test_ --exec rg -l "mask_path|has_mask|import_mask|export_mask" {}

# Look for specific test cases
echo "Searching for mask-related test cases..."
rg -A 5 "def test.*mask"

Length of output: 1088

Script:

#!/bin/bash
# Check the implementation of mask-related tests
echo "Checking mask test implementations..."
rg -B 2 -A 10 "test.*mask" tests/unittests/label_tests.py tests/unittests/dataset_tests.py

# Look for mask-related functionality in core code
echo -e "\nChecking core mask functionality..."
rg -l "mask" fiftyone/core/

# Check for mask-related assertions and test utilities
echo -e "\nChecking test assertions and utilities..."
rg "assert.*mask" tests/

Length of output: 8182

428-478: 🛠️ Refactor suggestion

Add error handling for mask operations

The mask import/export methods should handle potential I/O errors gracefully.

Apply this diff to improve error handling:

     def import_mask(self, update=False):
         if self.mask_path is not None:
-            self.mask = _read_mask(self.mask_path)
+            try:
+                self.mask = _read_mask(self.mask_path)
+            except (IOError, OSError) as e:
+                raise IOError(f"Failed to read mask from {self.mask_path}: {e}")

             if update:
                 self.mask_path = None

     def export_mask(self, outpath, update=False):
         if self.mask_path is not None:
-            etau.copy_file(self.mask_path, outpath)
+            try:
+                etau.copy_file(self.mask_path, outpath)
+            except (IOError, OSError) as e:
+                raise IOError(f"Failed to copy mask from {self.mask_path} to {outpath}: {e}")
         else:
-            _write_mask(self.mask, outpath)
+            try:
+                _write_mask(self.mask, outpath)
+            except (IOError, OSError) as e:
+                raise IOError(f"Failed to write mask to {outpath}: {e}")

             if update:
                 self.mask = None
                 self.mask_path = outpath

Likely invalid or redundant comment.

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (1)

app/packages/looker/src/worker/index.ts (1)

114-125: Consider parallel processing for better performance

While the current sequential processing is safe, consider using Promise.all with proper error handling for better performance when processing multiple detections:

-    for (const detection of label.detections) {
-      await imputeOverlayFromPath(
-        field,
-        detection,
-        coloring,
-        customizeColorSetting,
-        colorscale,
-        buffers,
-        {},
-        DETECTION
-      );
-    }
+    await Promise.all(
+      label.detections.map((detection) =>
+        imputeOverlayFromPath(
+          field,
+          detection,
+          coloring,
+          customizeColorSetting,
+          colorscale,
+          buffers,
+          {},
+          DETECTION
+        ).catch(error => {
+          console.error('Failed to process detection:', error);
+          // Handle or rethrow based on requirements
+          throw error;
+        })
+      )
+    );

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between d0f2abd and 22593b9.

📒 Files selected for processing (5)

app/packages/looker/src/worker/index.ts (2 hunks)
fiftyone/core/labels.py (2 hunks)
fiftyone/utils/coco.py (2 hunks)
fiftyone/utils/cvat.py (1 hunks)
fiftyone/utils/eta.py (1 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

app/packages/looker/src/worker/index.ts (1)

Pattern **/*.{ts,tsx}: Review the Typescript and React code for conformity with best practices in React, Recoil, Graphql, and Typescript. Highlight any deviations.

🔇 Additional comments (10)

app/packages/looker/src/worker/index.ts (2)

8-8: LGTM: Import addition is correct and well-placed

The addition of the DETECTION constant import aligns with the PR's objective of enhancing detection mask support.

114-125: LGTM: Improved async handling for detection masks

The change from forEach to for...of with await ensures proper sequential processing of detection masks, preventing potential race conditions and memory leaks.

fiftyone/utils/eta.py (1)

599-599: LGTM! Proper encapsulation of mask access.

The change from direct mask access to using get_mask() method aligns with the PR objective of supporting on-disk masks.

Let's verify the implementation of the get_mask() method:

✅ Verification successful

Verified: get_mask() properly handles both in-memory and on-disk masks

The implementation correctly handles both cases:

Returns self.mask if the mask is stored in memory
Falls back to loading from self.mask_path if mask is stored on disk
Returns None if neither is available

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check the implementation of get_mask() method in Detection class
# Expected: Method that handles both in-memory and on-disk masks

ast-grep --pattern 'class Detection {
  $$$
  get_mask($$$) {
    $$$
  }
  $$$
}'

Length of output: 84

Script:

#!/bin/bash
# Let's try a broader search for Detection class and get_mask method
rg "class Detection" -A 20

# Also search for get_mask method definition separately
rg "def get_mask"

# Search for any imports of Detection class to find its location
rg "from.*Detection.*import"

Length of output: 10373

Script:

#!/bin/bash
# Let's check the implementation of get_mask in Detection class
rg "def get_mask" -A 10 fiftyone/core/labels.py

Length of output: 656

fiftyone/utils/coco.py (2)

1307-1307: Good use of encapsulation with has_mask() method

Replacing direct mask attribute access with the has_mask() method improves encapsulation and makes the code more maintainable.

2119-2119: Consistent use of has_mask() method

Good consistency in using the has_mask() method here as well, maintaining the encapsulation pattern throughout the codebase.

fiftyone/utils/cvat.py (1)

6403-6403: Verify the implementation of has_mask() method

The change from det.mask to det.has_mask() looks good as it aligns with the support for detection.mask_path. This change allows for checking both in-memory masks and on-disk masks.

Run the following script to verify the implementation:

fiftyone/core/labels.py (4)

421-424: Good implementation of the has_mask property

The has_mask property accurately checks for the presence of a mask, whether it's in-memory (mask) or stored on disk (mask_path).

426-439: get_mask method correctly retrieves the mask

The get_mask method properly returns the mask from the mask attribute or reads it from mask_path when mask is not available.

455-472: export_mask method handles mask exporting appropriately

The export_mask method effectively exports the mask to the specified outpath, managing both cases where the mask is in-memory or referenced by mask_path.

527-528: Enhanced code clarity by using get_mask() in to_segmentation

Utilizing self.get_mask() in the to_segmentation method ensures that the mask is correctly obtained from either mask or mask_path, improving code maintainability.

fiftyone/core/labels.py

coderabbitai

Actionable comments posted: 2

🧹 Outside diff range and nitpick comments (1)

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (1)

10-14: Consider using dynamic temp directories instead of hardcoded paths

Using hardcoded paths in /tmp might cause issues in different environments or parallel test runs. Consider using Node's os.tmpdir() with unique subdirectories for test isolation.

-const badDetectionMaskSampleImage = "/tmp/detection-bad-mask-img.png";
-const goodDetectionMaskSampleImage = "/tmp/detection-good-mask-img.png";
-const goodDetectionMaskPathSampleImage = "/tmp/detection-mask-path-img.png";
-const goodDetectionMaskOnDisk = "/tmp/detection-mask-on-disk.png";
+const testDir = path.join(os.tmpdir(), `detection-mask-${Date.now()}`);
+const badDetectionMaskSampleImage = path.join(testDir, "detection-bad-mask-img.png");
+const goodDetectionMaskSampleImage = path.join(testDir, "detection-good-mask-img.png");
+const goodDetectionMaskPathSampleImage = path.join(testDir, "detection-mask-path-img.png");
+const goodDetectionMaskOnDisk = path.join(testDir, "detection-mask-on-disk.png");

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 22593b9 and db07f00.

⛔ Files ignored due to path filters (1)

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts-snapshots/grid-detections-chromium-darwin.png is excluded by !**/*.png, !**/*.png

📒 Files selected for processing (1)

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (4 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (1)

Pattern **/*.{ts,tsx}: Review the Typescript and React code for conformity with best practices in React, Recoil, Graphql, and Typescript. Highlight any deviations.

🔇 Additional comments (1)

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (1)

Line range hint 54-90: Well-structured test data covering multiple scenarios

The test data setup effectively covers three important scenarios:

Invalid mask (empty array)
Valid in-memory mask
Valid on-disk mask path

This provides good coverage for the new detection.mask_path feature.

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts

coderabbitai

Actionable comments posted: 2

🧹 Outside diff range and nitpick comments (2)

e2e-pw/src/oss/specs/plugins/histograms.spec.ts (1)

Line range hint 16-43: Enhance test data to include mask_path samples.

The test data setup creates Detection objects but doesn't include mask_path values. Consider enhancing the test coverage by including samples with mask paths.

Here's a suggested modification to the test data setup:

    samples = []
    for i in range(0, 10):
        sample = fo.Sample(
            filepath=f"{i}.png",
-           detections=fo.Detections(detections=[fo.Detection(label=f"label-{i}")]),
+           detections=fo.Detections(detections=[
+               fo.Detection(
+                   label=f"label-{i}",
+                   mask_path=f"masks/{i}_mask.png"
+               )
+           ]),
            classification=fo.Classification(label=f"label-{i}"),

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (1)

26-40: Add error handling for image creation

The Promise.all for image creation should include error handling to ensure test stability and proper cleanup on failure.

 await Promise.all(
   [
     badDetectionMaskSampleImage,
     goodDetectionMaskSampleImage,
     goodDetectionMaskPathSampleImage,
   ].map((img, index) => {
     const fillColor = colors[index];
-    mediaFactory.createBlankImage({
-      outputPath: img,
-      width: 25,
-      height: 25,
-      fillColor: fillColor,
-    });
+    return mediaFactory.createBlankImage({
+      outputPath: img,
+      width: 25,
+      height: 25,
+      fillColor: fillColor,
+    }).catch(error => {
+      console.error(`Failed to create test image ${img}:`, error);
+      throw error;
+    });
   })
 );

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between db07f00 and 11fb58d.

⛔ Files ignored due to path filters (1)

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts-snapshots/grid-detections-chromium-darwin.png is excluded by !**/*.png, !**/*.png

📒 Files selected for processing (7)

app/packages/looker/src/worker/index.ts (2 hunks)
e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (4 hunks)
e2e-pw/src/oss/specs/plugins/histograms.spec.ts (1 hunks)
fiftyone/core/labels.py (2 hunks)
fiftyone/utils/coco.py (2 hunks)
fiftyone/utils/cvat.py (1 hunks)
fiftyone/utils/eta.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (3)

fiftyone/utils/coco.py
fiftyone/utils/cvat.py
fiftyone/utils/eta.py

🧰 Additional context used

📓 Path-based instructions (3)

app/packages/looker/src/worker/index.ts (1)

Pattern **/*.{ts,tsx}: Review the Typescript and React code for conformity with best practices in React, Recoil, Graphql, and Typescript. Highlight any deviations.

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (1)

Pattern **/*.{ts,tsx}: Review the Typescript and React code for conformity with best practices in React, Recoil, Graphql, and Typescript. Highlight any deviations.

e2e-pw/src/oss/specs/plugins/histograms.spec.ts (1)

Pattern **/*.{ts,tsx}: Review the Typescript and React code for conformity with best practices in React, Recoil, Graphql, and Typescript. Highlight any deviations.

🔇 Additional comments (11)

e2e-pw/src/oss/specs/plugins/histograms.spec.ts (1)

62-62: LGTM! Field addition is consistent with PR objectives.

The addition of "detections.detections.mask_path" to the verification list aligns with the PR's goal of adding mask path support.

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (1)

73-74: Validate bounding box coordinates and mask dimensions

The bounding box coordinates [0.0, 0.0, 0.5, 0.5] and mask dimensions (15x15) are hardcoded. Consider:

Adding validation for bounding box coordinates (should be normalized [0-1])
Making mask dimensions proportional to the image size (25x25)

app/packages/looker/src/worker/index.ts (2)

8-8: LGTM: Clean import addition

The addition of the DETECTION constant import is well-placed and aligns with the PR's objective of supporting detection mask paths.

114-125: LGTM: Improved async handling for detection processing

The changes improve the handling of detections by:

Using for...of loop to properly support async/await
Using the type-safe DETECTION constant
Ensuring proper sequential processing of detection masks

This enhancement prevents potential race conditions in mask processing and aligns with the PR's objective.

fiftyone/core/labels.py (7)

412-413: LGTM! Class attribute follows established patterns.

The _MEDIA_FIELD class attribute correctly identifies the media path field and follows the same pattern used by other classes in the codebase.

404-405: LGTM! Field declaration and documentation are appropriate.

The mask_path field is correctly declared as a StringField and well-documented in the class docstring.

Also applies to: 417-417

421-424: LGTM! Property implementation is clean and consistent.

The has_mask property correctly checks both in-memory and on-disk mask representations.

426-438: LGTM! Method handles mask retrieval appropriately.

The get_mask method implements a clear priority between in-memory and on-disk masks, with proper documentation.

449-454: LGTM! Method correctly handles mask import.

The implementation properly handles importing the mask from disk and updating the path attribute based on the update parameter.

455-471: LGTM! Method handles mask export comprehensively.

The export_mask method correctly handles both file copying and mask writing scenarios, with proper state management based on the update parameter.

527-528: LGTM! Method properly uses the new mask retrieval approach.

The to_segmentation method has been correctly updated to use the new get_mask method for consistent mask handling.

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (3)

fiftyone/core/labels.py (3)

421-424: Enhance the docstring for the has_mask property

Consider expanding the docstring to clarify that it checks both in-memory and on-disk masks.

 @property
 def has_mask(self):
-    """Whether this instance has a mask."""
+    """Whether this instance has a mask, either in memory or on disk."""
     return self.mask is not None or self.mask_path is not None

426-438: Enhance the docstring for the get_mask method

Consider expanding the docstring to specify the return type and clarify the priority order.

 def get_mask(self):
-    """Returns the detection mask for this instance.
+    """Returns the detection mask for this instance, prioritizing in-memory mask over on-disk mask.
 
     Returns:
-        a numpy array, or ``None``
+        a numpy array of shape (height, width) containing the mask data, or ``None`` if no mask exists
     """

440-470: Add mask format specification in docstrings

The methods would benefit from explicit documentation of the mask format.

 def import_mask(self, update=False):
     """Imports this instance's mask from disk to its :attr:`mask`
-    attribute.
+    attribute. The mask should be a 2D binary or 0/1 integer numpy array.

     Args:
         update (False): whether to clear this instance's :attr:`mask_path`
             attribute after importing
     """

 def export_mask(self, outpath, update=False):
     """Exports this instance's mask to the given path.
+
+    The mask is written as a 2D binary image file.

     Args:
         outpath: the path to write the mask
         update (False): whether to clear this instance's :attr:`mask`
             attribute and set its :attr:`mask_path` attribute when
             exporting in-database segmentations
     """

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 11fb58d and 5d2dedc.

📒 Files selected for processing (1)

fiftyone/core/labels.py (2 hunks)

🔇 Additional comments (2)

fiftyone/core/labels.py (2)

404-405: LGTM: Well-structured media field implementation

The addition of mask_path attribute and _MEDIA_FIELD class variable follows the established pattern for media fields in the codebase.

Also applies to: 412-413

526-527: LGTM: Clean integration with new mask handling

The update to use get_mask() properly integrates with the new mask handling system.

coderabbitai

Actionable comments posted: 4

🧹 Outside diff range and nitpick comments (2)

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (2)
8-8: Consider adding JSDoc comment for the colors array

Add documentation explaining the purpose of each color in the array and its relationship to the test cases.
+/** Colors used for creating test images:
+ * - Red (#ff0000): Bad detection mask
+ * - Green (#00ff00): Good detection mask
+ * - Blue (#0000ff): Good detection mask with path
+ */
const colors = ["#ff0000", "#00ff00", "#0000ff"];
Line range hint 54-65: Consider parameterizing the test data

The hardcoded values for bounding boxes and mask dimensions could be defined as constants at the top of the file for better maintainability.
+const TEST_CONFIG = {
+  emptyMask: {
+    bbox: [0.0, 0.0, 0.0, 0.0],
+    maskDims: [0, 0]
+  },
+  goodMask: {
+    bbox: [0.0, 0.0, 0.5, 0.5],
+    maskDims: [15, 15]
+  }
+};

# Then use in the code:
-                    bounding_box=[0.0, 0.0, 0.0, 0.0],
-                    mask=np.empty((0, 0)),
+                    bounding_box=TEST_CONFIG.emptyMask.bbox,
+                    mask=np.empty(TEST_CONFIG.emptyMask.maskDims),

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 5d2dedc and 9fe5978.

⛔ Files ignored due to path filters (1)

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts-snapshots/grid-detections-chromium-darwin.png is excluded by !**/*.png, !**/*.png

📒 Files selected for processing (7)

app/packages/looker/src/worker/index.ts (2 hunks)
e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (4 hunks)
e2e-pw/src/oss/specs/plugins/histograms.spec.ts (1 hunks)
fiftyone/core/labels.py (2 hunks)
fiftyone/utils/coco.py (2 hunks)
fiftyone/utils/cvat.py (1 hunks)
fiftyone/utils/eta.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (4)

app/packages/looker/src/worker/index.ts
e2e-pw/src/oss/specs/plugins/histograms.spec.ts
fiftyone/utils/cvat.py
fiftyone/utils/eta.py

🧰 Additional context used

📓 Path-based instructions (1)

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts (1)

Pattern **/*.{ts,tsx}: Review the Typescript and React code for conformity with best practices in React, Recoil, Graphql, and Typescript. Highlight any deviations.

🔇 Additional comments (6)

fiftyone/core/labels.py (6)

404-405: LGTM! Well-documented mask_path attribute

The mask_path attribute and _MEDIA_FIELD constant are properly implemented and documented.

Also applies to: 412-413, 417-417

421-424: LGTM! Clear and correct has_mask property

The has_mask property correctly checks for both in-memory and on-disk masks.

426-438: LGTM! Well-implemented get_mask method

The get_mask method provides a clean interface for retrieving masks from either memory or disk.

440-452: Skip comment about incorrect docstring

A previous review has already identified the incorrect 'outpath' reference in the docstring.

454-470: LGTM! Comprehensive export_mask implementation

The export_mask method effectively handles all mask export scenarios with proper memory management options.

526-527: LGTM! Proper integration with get_mask

The to_segmentation method correctly uses the new get_mask method for consistent mask handling.

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts

fiftyone/utils/coco.py

coderabbitai

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (2)

fiftyone/core/labels.py (2)

Line range hint 397-418: Add return type hints to docstrings

The class docstring should specify return types for properties and methods that return values.

Line range hint 1150-1250: Consider extracting common mask handling logic

The Detection and Segmentation classes have identical mask handling methods (has_mask, get_mask, import_mask, export_mask). Consider extracting these into a mixin class to reduce code duplication.

Create a new mixin class like this:

class _HasMask(_HasMedia):
    """Mixin for Label classes that have mask functionality."""
    
    @property
    def has_mask(self):
        """Whether this instance has a mask."""
        return self.mask is not None or self.mask_path is not None

    def get_mask(self):
        """Returns the mask for this instance."""
        if self.mask is not None:
            return self.mask

        if self.mask_path is not None:
            return _read_mask(self.mask_path)

        return None

    def import_mask(self, update=False):
        """Imports this instance's mask from disk."""
        if self.mask_path is not None:
            try:
                self.mask = _read_mask(self.mask_path)
                if update:
                    self.mask_path = None
            except (IOError, OSError) as e:
                raise IOError(f"Failed to read mask from {self.mask_path}: {e}")

    def export_mask(self, outpath, update=False):
        """Exports this instance's mask to disk."""
        try:
            if self.mask_path is not None:
                etau.copy_file(self.mask_path, outpath)
            else:
                _write_mask(self.mask, outpath)
                if update:
                    self.mask = None
                    self.mask_path = outpath
        except (IOError, OSError) as e:
            raise IOError(f"Failed to write mask to {outpath}: {e}")

Then update the class definitions:

-class Detection(_HasAttributesDict, _HasID, _HasMedia, Label):
+class Detection(_HasAttributesDict, _HasID, _HasMask, Label):

-class Segmentation(_HasID, _HasMedia, Label):
+class Segmentation(_HasID, _HasMask, Label):

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 653badf and 229296b.

📒 Files selected for processing (1)

fiftyone/core/labels.py (4 hunks)

🔇 Additional comments (1)

fiftyone/core/labels.py (1)

43-48: Well-designed mixin class!

The _HasMedia mixin provides a clean abstraction for labels with media fields, following good OOP principles.

fiftyone/core/labels.py

benjaminpkane

LGTM 🚀

I think some follow up work might need to happen for annotation integrations

benjaminpkane · 2024-11-19T15:27:55Z

fiftyone/core/labels.py

@@ -1044,7 +1100,6 @@ def import_mask(self, update=False):
        attribute.

        Args:
-            outpath: the path to write the map


NIce catch 🚀

fiftyone/core/labels.py

benjaminpkane · 2024-11-19T16:02:57Z

app/packages/looker/src/worker/painter.ts

+      // putImageData results in an UInt8ClampedArray (for both grayscale or RGB masks),
+      // where each pixel is represented by 4 bytes (RGBA)
+      // it's packed like: [R, G, B, A, R, G, B, A, ...]
+      // use first channel info to determine if the pixel is in the mask
+      // skip second (G), third (B) and fourth (A) channels


Ok, we are always loading a 32 bit image when mask_path is used

sashankaryal requested a review from a team November 15, 2024 02:28

sashankaryal self-assigned this Nov 15, 2024

coderabbitai bot reviewed Nov 15, 2024

View reviewed changes

fiftyone/core/labels.py Outdated Show resolved Hide resolved

sashankaryal mentioned this pull request Nov 15, 2024

Add mask_path to fo.Detection labels #4693

Merged

7 tasks

coderabbitai bot reviewed Nov 15, 2024

View reviewed changes

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts Show resolved Hide resolved

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts Show resolved Hide resolved

sashankaryal force-pushed the feat/on-disk-detection-mask-path branch from db07f00 to a79cf96 Compare November 15, 2024 04:12

coderabbitai bot reviewed Nov 15, 2024

View reviewed changes

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts Show resolved Hide resolved

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts Show resolved Hide resolved

coderabbitai bot reviewed Nov 15, 2024

View reviewed changes

sashankaryal force-pushed the feat/on-disk-detection-mask-path branch 2 times, most recently from 7e2e35e to fb8b322 Compare November 15, 2024 16:41

Laurent2916 and others added 6 commits November 15, 2024 10:42

add mask_path to Detection label

3aaff99

apply suggestions

69f60d3

fix docstrings

5fcacd0

fix detection mask path parsing bug

d7a39e7

add e2e for detection mask paths

8d09f0f

add detections.detections.mask_path to histogram e2e test

9fe5978

sashankaryal force-pushed the feat/on-disk-detection-mask-path branch from fb8b322 to 9fe5978 Compare November 15, 2024 16:42

coderabbitai bot reviewed Nov 15, 2024

View reviewed changes

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts Show resolved Hide resolved

e2e-pw/src/oss/specs/overlays/detection-mask.spec.ts Show resolved Hide resolved

fiftyone/utils/coco.py Outdated Show resolved Hide resolved

fiftyone/utils/coco.py Outdated Show resolved Hide resolved

sashankaryal added 7 commits November 15, 2024 17:11

account for 1-channel or multi-channel masks

653555a

fetch as blob instead of arraybuffer and casting into blob again

8967291

add linux screenshot

7b89cb3

add exception handling for mask path decoding

bd9fa59

don't invoke has_mask as method

653badf

_HasMedia mixin in Detection class, too

229296b

collect promises

c981c49

coderabbitai bot reviewed Nov 18, 2024

View reviewed changes

fiftyone/core/labels.py Show resolved Hide resolved

sashankaryal added 3 commits November 18, 2024 15:02

fix name collision bug in self.mask

252961c

document mask_path for instance segmentations

401c7ca

use allSettled

52bde0e

lint

62763c8

benjaminpkane approved these changes Nov 19, 2024

View reviewed changes

sashankaryal merged commit 6125f1d into develop Nov 19, 2024
14 checks passed

sashankaryal deleted the feat/on-disk-detection-mask-path branch November 19, 2024 17:33

sashankaryal mentioned this pull request Nov 22, 2024

[FR] Support non-binary/grayscale masks in fo.Detection #4961

Open

6 tasks

Laurent2916 mentioned this pull request Nov 25, 2024

[FR] Allow mask_path in Detection labels #4486

Closed

6 tasks

This was referenced Nov 26, 2024

Merge release/v1.1.0 to develop #5190

Merged

[Docs] Features + release notes #5188

Merged

This was referenced Dec 6, 2024

fix import timeouts on increasing datasets by precomputing batch size #5231

Closed

Support on-disk instance segmentations in SDK #5256

Merged

Merge release/v1.2.0 to develop #5265

Merged

coderabbitai bot mentioned this pull request Dec 20, 2024

Merge release/v1.2.0 to develop #5302

Merged

coderabbitai bot mentioned this pull request Jan 10, 2025

add detection mask threshold to dataset app config #5369

Open

7 tasks

coderabbitai bot mentioned this pull request Jan 28, 2025

Add author/license to all zoo models/datasets #5438

Merged

coderabbitai bot mentioned this pull request Feb 23, 2025

map_samples() and update_samples() #5480

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for detection.mask_path #5120

add support for detection.mask_path #5120

sashankaryal commented Nov 15, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 15, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

benjaminpkane left a comment

benjaminpkane Nov 19, 2024

benjaminpkane Nov 19, 2024

add support for detection.mask_path #5120

add support for detection.mask_path #5120

Conversation

sashankaryal commented Nov 15, 2024 • edited by coderabbitai bot Loading

What changes are proposed in this pull request?

How is this patch tested? If it is not, please explain why.

Release Notes

Is this a user-facing change that should be mentioned in the release notes?

What areas of FiftyOne does this PR affect?

Summary by CodeRabbit

Summary by CodeRabbit

coderabbitai bot commented Nov 15, 2024 • edited Loading

Walkthrough

Changes

Possibly related issues

Possibly related PRs

Suggested labels

Suggested reviewers

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

benjaminpkane left a comment

Choose a reason for hiding this comment

benjaminpkane Nov 19, 2024

Choose a reason for hiding this comment

benjaminpkane Nov 19, 2024

Choose a reason for hiding this comment

sashankaryal commented Nov 15, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 15, 2024 •

edited

Loading