Performance issues in examples/

Hello! I've found a performance issue in tensorlayer/examples: `batch()` should be called before `map()`, which could make your program more efficient. Here is [the tensorflow document](https://tensorflow.google.cn/guide/data_performance?hl=zh_cn#vectorized_mapping) to support it.

Detailed description is listed below:

- examples/quantized_net/tutorial_binarynet_cifar10_tfrecord.py: `train_ds = train_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_binarynet_cifar10_tfrecord.py#L168) should be called before `train_ds = train_ds.map(_map_fn_train, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_binarynet_cifar10_tfrecord.py#L164).
- examples/quantized_net/tutorial_binarynet_cifar10_tfrecord.py: `test_ds = test_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_binarynet_cifar10_tfrecord.py#L178) shoule be called before `test_ds = test_ds.map(_map_fn_test, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_binarynet_cifar10_tfrecord.py#L175).
- examples/quantized_net/tutorial_dorefanet_cifar10_tfrecord.py: `train_ds = train_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_dorefanet_cifar10_tfrecord.py#L161) should be called before `train_ds = train_ds.map(_map_fn_train, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_dorefanet_cifar10_tfrecord.py#L157).
- examples/quantized_net/tutorial_dorefanet_cifar10_tfrecord.py: `test_ds = test_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_dorefanet_cifar10_tfrecord.py#L171) should be called before `test_ds = test_ds.map(_map_fn_test, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_dorefanet_cifar10_tfrecord.py#L168).
- examples/quantized_net/tutorial_quanconv_cifar10.py: `train_ds = train_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_quanconv_cifar10.py#L158) should be called before `train_ds = train_ds.map(_map_fn_train, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_quanconv_cifar10.py#L154).
- examples/quantized_net/tutorial_quanconv_cifar10.py: `test_ds = test_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_quanconv_cifar10.py#L168) should be called before `test_ds = test_ds.map(_map_fn_test, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_quanconv_cifar10.py#L165).
- examples/quantized_net/tutorial_ternaryweight_cifar10_tfrecord.py: `train_ds = train_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_ternaryweight_cifar10_tfrecord.py#L167) should be called before `train_ds = train_ds.map(_map_fn_train, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_ternaryweight_cifar10_tfrecord.py#L163).
- examples/quantized_net/tutorial_ternaryweight_cifar10_tfrecord.py: `test_ds = test_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_ternaryweight_cifar10_tfrecord.py#L177) should be called before `test_ds = test_ds.map(_map_fn_test, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/quantized_net/tutorial_ternaryweight_cifar10_tfrecord.py#L174).
- examples/data_process/tutorial_fast_affine_transform.py: `dataset = dataset.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/data_process/tutorial_fast_affine_transform.py#L102) should be called before `dataset = dataset.map(_map_fn, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/data_process/tutorial_fast_affine_transform.py#L101).
- examples/data_process/tutorial_tf_dataset_voc.py: `ds = ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/data_process/tutorial_tf_dataset_voc.py#L95) should be called before `ds = ds.map(_map_fn, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/data_process/tutorial_tf_dataset_voc.py#L92).
- examples/basic_tutorials/tutorial_cifar10_cnn_static.py: `train_ds = train_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/basic_tutorials/tutorial_cifar10_cnn_static.py#L143) should be called before `train_ds = train_ds.map(_map_fn_train, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/basic_tutorials/tutorial_cifar10_cnn_static.py#L139).
- examples/basic_tutorials/tutorial_cifar10_cnn_static.py: `test_ds = test_ds.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/basic_tutorials/tutorial_cifar10_cnn_static.py#L153) should be called before `test_ds = test_ds.map(_map_fn_test, num_parallel_calls=multiprocessing.cpu_count())`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/basic_tutorials/tutorial_cifar10_cnn_static.py#L150).
- examples/deprecated_tutorials/tutorial_imagenet_inceptionV3_distributed.py: `dataset = dataset.batch(batch_size)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/deprecated_tutorials/tutorial_imagenet_inceptionV3_distributed.py#L150) should be called before `dataset = dataset.map(_map_fn, num_parallel_calls=max_cpus)`[(here)](https://github.com/tensorlayer/tensorlayer/blob/73a42a3c4da03715673adb55523c66a7ad826067/examples/deprecated_tutorials/tutorial_imagenet_inceptionV3_distributed.py#L148).

Besides, you need to check the function called in `map()`(e.g., `_map_fn` called in `dataset.map()`) whether to be affected or not to make the changed code work properly. For example, if `_map_fn` needs data with shape (x, y, z) as its input before fix, it would require data with shape (batch_size, x, y, z).

Looking forward to your reply. Btw, I am very glad to create a PR to fix it if you are too busy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance issues in examples/ #1139

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Performance issues in examples/ #1139

Description

Activity

zsdonghao commented on Aug 20, 2021

DLPerf commented on Aug 31, 2021

hanjr92 commented on Sep 8, 2021

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions