Skip to content

Latest commit

 

History

History
27 lines (20 loc) · 1.1 KB

README.md

File metadata and controls

27 lines (20 loc) · 1.1 KB

Mask_R-CNN

Overview (What is Mask RCNN)

  • Deep residual network which can solve instance segmentation

    • Instance segmentation referes to seperating different objects from a image or a video.
  • Two steps of Mask RCNN

    1. Generates proposals regarding to regions where the object might occupy.
    2. Predicts the class, bounding box, and mask in pixel level of the object.
  • Backbone structure

    • Feature Pyramid Netwrok style deep nerual network.
    • Consists of bottom-up path way, top-bottom pathway, and lateral connections.

Step 1

  • RPN scans all FPN path and prposes regions where it may contain objects
  • Anchors are set of boxes sharing the same center
  • RPN uses the anchors to predict and output the coordinates and thes size of the object.

Step 2

  • ROI align locates the relevent areas of fetaure map
  • One branch generates masks for each object in pixel level

Architecture of Mask RCNN