[Enhance] Add pipeline for data loading #430

Wuziyi616 · 2021-04-09T06:36:20Z

In dataset.show/evaluate() functions, we may need to load points/gt. In current implementations, we load them from disk using np.fromfile(), which is incompatible when data is in ceph. To solve this, we add eval_pipeline in configs, and pass it as an argument to these functions. This pipeline purely consists of raw data loading operations (e.g. LoadImage, LoadPoints), eliminating the effects of data augmentation, and can adjust with the file client.

Wuziyi616 · 2021-04-09T06:40:20Z

The added eval_pipeline will be passed as an argument in the eval_hook.

Also support semseg mask loading in scannet-seg dataset now.

I have carefully checked all the configs and add eval_pipeline to configs with custom data pipelines.

codecov · 2021-04-09T06:54:05Z

Codecov Report

❗ No coverage uploaded for pull request base (master@2d9b97b). Click here to learn what that means.
The diff coverage is 67.85%.

❗ Current head a7b8d84 differs from pull request most recent head 27f05ad. Consider uploading reports for the commit 27f05ad to get more accurate results

@@            Coverage Diff            @@
##             master     #430   +/-   ##
=========================================
  Coverage          ?   50.80%           
=========================================
  Files             ?      184           
  Lines             ?    13425           
  Branches          ?     2160           
=========================================
  Hits              ?     6820           
  Misses            ?     6149           
  Partials          ?      456

Flag	Coverage Δ
unittests	`50.80% <67.85%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mmdet3d/datasets/waymo_dataset.py	`10.36% <0.00%> (ø)`
mmdet3d/datasets/custom_3d_seg.py	`63.75% <18.18%> (ø)`
mmdet3d/datasets/lyft_dataset.py	`71.28% <70.00%> (ø)`
mmdet3d/datasets/nuscenes_dataset.py	`41.40% <70.00%> (ø)`
mmdet3d/datasets/kitti_dataset.py	`75.89% <72.22%> (ø)`
mmdet3d/datasets/sunrgbd_dataset.py	`76.04% <77.77%> (ø)`
mmdet3d/datasets/custom_3d.py	`72.80% <87.50%> (ø)`
mmdet3d/datasets/scannet_dataset.py	`92.13% <87.87%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2d9b97b...27f05ad. Read the comment docs.

Wuziyi616 · 2021-04-09T08:25:36Z

Actually the commit message shouldn't be "reuse", because I pass into a new pipeline. "Reuse" is for easy understanding since the added eval_pipeline is similar to test_pipeline.

mmdet3d/datasets/custom_3d_seg.py

mmdet3d/datasets/custom_3d.py

mmdet3d/datasets/scannet_dataset.py

Wuziyi616 · 2021-04-15T13:23:51Z

Move the tedious if/else conditions to dataset._extract_data(). Set default value to pipeline.

Wuziyi616 · 2021-04-15T13:57:29Z

I have tried using tools/misc/visualize_results.py to show results by calling dataset.show() and input config.eval_pipeline. The results are the same as before.

Wuziyi616 · 2021-04-15T15:47:11Z

Basic logic now:

if pipeline is given in show/evaluate function, directly use this pipeline
if pipeline is None, if self.pipeline is not None, get_loading_pipeline(self.pipeline)
pipeline and self.pipeline are both None, call _build_default_pipeline() for this dataset

mmdet3d/datasets/custom_3d.py

mmdet3d/datasets/custom_3d_seg.py

mmdet3d/datasets/kitti_dataset.py

mmdet3d/datasets/utils.py

configs/centerpoint/centerpoint_0075voxel_second_secfpn_4x8_cyclic_20e_nus.py

mmdet3d/datasets/custom_3d_seg.py

Wuziyi616

I have carefully checked all the config files and 5 of them needed not to be modified.

ZwwWayne · 2021-04-19T14:19:41Z

PRs can be merged after resolving conflicts.

Wuziyi616 requested review from Tai-Wang and ZwwWayne April 9, 2021 07:40

Tai-Wang reviewed Apr 10, 2021

View reviewed changes

mmdet3d/datasets/custom_3d_seg.py Outdated Show resolved Hide resolved

Tai-Wang reviewed Apr 10, 2021

View reviewed changes

mmdet3d/datasets/custom_3d.py Show resolved Hide resolved

Wuziyi616 requested a review from Tai-Wang April 13, 2021 07:04

ZwwWayne reviewed Apr 15, 2021

View reviewed changes

mmdet3d/datasets/scannet_dataset.py Outdated Show resolved Hide resolved

Wuziyi616 requested a review from ZwwWayne April 16, 2021 03:27

Tai-Wang reviewed Apr 16, 2021

View reviewed changes

mmdet3d/datasets/custom_3d.py Outdated Show resolved Hide resolved

Tai-Wang reviewed Apr 16, 2021

View reviewed changes

mmdet3d/datasets/custom_3d_seg.py Outdated Show resolved Hide resolved

Tai-Wang reviewed Apr 16, 2021

View reviewed changes

mmdet3d/datasets/kitti_dataset.py Outdated Show resolved Hide resolved

Tai-Wang reviewed Apr 16, 2021

View reviewed changes

mmdet3d/datasets/utils.py Show resolved Hide resolved

ZwwWayne reviewed Apr 17, 2021

View reviewed changes

configs/centerpoint/centerpoint_0075voxel_second_secfpn_4x8_cyclic_20e_nus.py Outdated Show resolved Hide resolved

ZwwWayne reviewed Apr 17, 2021

View reviewed changes

mmdet3d/datasets/custom_3d_seg.py Show resolved Hide resolved

Wuziyi616 commented Apr 19, 2021

View reviewed changes

ZwwWayne approved these changes Apr 19, 2021

View reviewed changes

Wuziyi616 added 8 commits April 19, 2021 22:20

reuse pipeline in scannet-det dataset

c8867f4

reuse pipeline in kitti dataset

621521e

reuse pipeline in lyft dataset

270cc7c

reuse pipeline in sunrgbd dataset

232c2cc

reuse pipeline in nuscenes dataset

a68ce6f

reuse pipeline in waymo dataset

808d1ec

reuse pipeline in scannet-seg dataset

f151649

add eval_pipeline in configs which have custom data pipelines

ee6eb5c

Wuziyi616 added 10 commits April 19, 2021 22:24

move data loading via pipeline to dataset._extract_data() for clarity

13ef3ed

use eval_pipeline in tools/misc/visualize_results.py

6e49e56

get_pipeline from self when no pipeline is provided

21580d2

fix small bugs

afaa8df

fix small bugs

8b19cdb

simplify and clear code

05b8776

remove unnecessary eval_pipeline added

02f4be2

add comment about why we set self.test_mode=False

b76dc44

small fix

ad13e54

modify docs about config

27f05ad

Wuziyi616 force-pushed the reuse_pipeline_dataset branch from 54158d1 to 27f05ad Compare April 19, 2021 14:26

ZwwWayne merged commit 78c29c3 into open-mmlab:master Apr 19, 2021

Wuziyi616 deleted the reuse_pipeline_dataset branch April 19, 2021 15:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhance] Add pipeline for data loading #430

[Enhance] Add pipeline for data loading #430

Wuziyi616 commented Apr 9, 2021

Wuziyi616 commented Apr 9, 2021 •

edited

Loading

codecov bot commented Apr 9, 2021 •

edited

Loading

Wuziyi616 commented Apr 9, 2021

Wuziyi616 commented Apr 15, 2021

Wuziyi616 commented Apr 15, 2021

Wuziyi616 commented Apr 15, 2021

Wuziyi616 left a comment

ZwwWayne commented Apr 19, 2021

[Enhance] Add pipeline for data loading #430

[Enhance] Add pipeline for data loading #430

Conversation

Wuziyi616 commented Apr 9, 2021

Wuziyi616 commented Apr 9, 2021 • edited Loading

codecov bot commented Apr 9, 2021 • edited Loading

Codecov Report

Wuziyi616 commented Apr 9, 2021

Wuziyi616 commented Apr 15, 2021

Wuziyi616 commented Apr 15, 2021

Wuziyi616 commented Apr 15, 2021

Wuziyi616 left a comment

Choose a reason for hiding this comment

ZwwWayne commented Apr 19, 2021

Wuziyi616 commented Apr 9, 2021 •

edited

Loading

codecov bot commented Apr 9, 2021 •

edited

Loading