Reproduce ScanNet200 Results #38

Louis708 · 2024-10-27T12:16:14Z

I try to reproduce ScanNet200 results. I prepare the data and follow your running code instructions. I get the below results:

ScanNet200 Evaluation
################################################
what           :      AP  AP_50%  AP_25%
################################################
Head AP        :   0.254   0.314   0.342
Common AP      :   0.209   0.259   0.282
Tail AP        :   0.212   0.260   0.295
Base AP        :   0.246   0.308   0.342
Novel AP       :   0.218   0.267   0.294
------------------------------------------------
AP             :   0.226   0.279   0.307
################################################

It seems that the results of 0.226, 0.279, 0.307 is a bit different from the paper's 0.237, 0.294, 0.328. Is the gap within an acceptable range? Or am I missing some steps during reproducing?

Here is the model I use:

Grounded-SAM (groundingdino_swint_ogc, sam_vit_h_4b8939), CLIP (ViT-L/14@336px)

I only change the file path in config and the agnostic flag into False. Here is my config:

proposals:
p2d: True # 2D branch
p3d: True # 3D branch
agnostic: False
refined: True

Here is my understanding of running the code:

grounding_2d.sh: Generate the 2D masks (maskGdino) and first stage feature (grounded_feat). This step taks a lot of hours to run.
generate_3d_inst.sh: Generate 3D instances (hier_agglo) from 2D masks using hierarchical agglomerative clustering.
refine_grounding_feat: Refine second stage feature (hier_agglo) from 3D instances, output refine features (refined_grounded_feat)
generate_3d_inst.sh: Finalize the 3D output masks from refine features (refined_grounded_feat). In this step, I change the bool here into False

Open3DIS/tools/generate_3d_inst.py

Line 278 in 4b05043

if True:

and the bool here into True

Open3DIS/tools/generate_3d_inst.py

Line 296 in 4b05043

if False:

to get the final output (final_result_hier_agglo) instead of 3D instances (hier_agglo).

I understand that it's hard to figure out the problem that I encounter. Your work is very cool. If you could help me out and give me some insight I would really appreciate it.

Thanks.

The text was updated successfully, but these errors were encountered:

PhucNDA · 2024-10-27T12:31:27Z

Hi @Louis708,

How did you generate 3D feature from ISBNet?

PhucNDA · 2024-10-27T12:48:15Z

Hi @Louis708,

You may want to independently verify the results for the 3D backbone-only and 2D-only cases to help identify the bug. These specific results are available on our webpage. If you encounter any issues with the source code, please don’t hesitate to reach out to me.

Louis708 · 2024-10-27T13:04:00Z

Hi @PhucNDA ,

Thanks for very quick reply. I generate the 3D features from ISBNet using

cd segmenter3d/ISBNet/
python3 tools/test.py configs/scannet200/isbnet_scannet200.yaml pretrains/scannet200/head_scannetv2_200_val.pth
in https://github.com/VinAIResearch/Open3DIS/blob/main/docs/DATA.md#3d-backbone

I will check the results for the 3D backbone-only and 2D-only on ScanNet 200 later.

Thanks

sgmzhou4 · 2024-11-12T07:42:27Z

Hi @PhucNDA ,

I try to reproduce ScanNet200 results. I prepare the data and follow your running code instructions. I get the below results:
ScanNet200 Evaluation
################################################
what           :      AP  AP_50%  AP_25%
################################################
Head AP        :   0.254   0.314   0.342
Common AP      :   0.209   0.259   0.282
Tail AP        :   0.212   0.260   0.295
Base AP        :   0.246   0.308   0.342
Novel AP       :   0.218   0.267   0.294
------------------------------------------------
AP             :   0.226   0.279   0.307
################################################
It seems that the results of 0.226, 0.279, 0.307 is a bit different from the paper's 0.237, 0.294, 0.328. Is the gap within an acceptable range? Or am I missing some steps during reproducing?

Here is the model I use:

Grounded-SAM (groundingdino_swint_ogc, sam_vit_h_4b8939), CLIP (ViT-L/14@336px)

I only change the file path in config and the agnostic flag into False. Here is my config:

proposals: p2d: True # 2D branch p3d: True # 3D branch agnostic: False refined: True

Here is my understanding of running the code:

grounding_2d.sh: Generate the 2D masks (maskGdino) and first stage feature (grounded_feat). This step taks a lot of hours to run.

generate_3d_inst.sh: Generate 3D instances (hier_agglo) from 2D masks using hierarchical agglomerative clustering.

refine_grounding_feat: Refine second stage feature (hier_agglo) from 3D instances, output refine features (refined_grounded_feat)

generate_3d_inst.sh: Finalize the 3D output masks from refine features (refined_grounded_feat). In this step, I change the bool here into False

Open3DIS/tools/generate_3d_inst.py

Line 278 in 4b05043

if True:

and the bool here into True

Open3DIS/tools/generate_3d_inst.py

Line 296 in 4b05043

if False:

to get the final output (final_result_hier_agglo) instead of 3D instances (hier_agglo).

I understand that it's hard to figure out the problem that I encounter. Your work is very cool. If you could help me out and give me some insight I would really appreciate it.

Thanks.

Would you mind sharing how did you modify the eval.py and your GT file plz?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproduce ScanNet200 Results #38

Reproduce ScanNet200 Results #38

Louis708 commented Oct 27, 2024 •

edited

Loading

PhucNDA commented Oct 27, 2024

PhucNDA commented Oct 27, 2024

Louis708 commented Oct 27, 2024

sgmzhou4 commented Nov 12, 2024

Reproduce ScanNet200 Results #38

Reproduce ScanNet200 Results #38

Comments

Louis708 commented Oct 27, 2024 • edited Loading

PhucNDA commented Oct 27, 2024

PhucNDA commented Oct 27, 2024

Louis708 commented Oct 27, 2024

sgmzhou4 commented Nov 12, 2024

Louis708 commented Oct 27, 2024 •

edited

Loading