Taking too long to process images. How to leverage GPU properly? #23

aman-captech · 2018-10-17T11:19:08Z

I am using TensorFlow 1.8.0. with a Nvidia 960M GPU. I have obtained the following results on two sets of 5 images with different resolutions:
No. of images; Resolution; Runtime; GPU Memory used:
5; 3280x2460; 220secs; 3176Mib
5; 167x215; 61secs; 380Mib

Can anyone suggest why is it taking so long to process these images. The inference time seems to be way off from the benchmark results given in the repo.
Also for the second set of low res. images why isn't more GPU memory being utilised to process faster?
Am I unable to leverage the GPU properly? If yes, please help me out with how to go about this?
Has anyone else experienced such results?

thelastfunction · 2019-09-13T11:11:03Z

Hi aman,
Have you solved the problem as I am also facing the same problem. The inference time is way more than what is in the benchmark.

lishangqiu · 2019-12-23T20:25:51Z

Hi aman,
I am facing the same problem too. Did you solve it? I wonder if we can predict on batch.

thelastfunction · 2020-01-17T06:47:59Z

Hi,
I figured it out , you may not be doing inference on Batch on Images or It may be using CPU as default make sure that you have installed tensorflow-gpu and performed Inference on Batch of Images

photoszzt · 2020-04-25T23:00:24Z

@thelastfunction How do you do batch of images? detect_faces doesn't accept a batch of images.

JiaHeng-DLUT · 2020-09-30T11:13:40Z

@photoszzt I am facing the same problem too. Did you solve it? How to predict on batch?

photoszzt · 2020-09-30T15:19:16Z

@JiaHeng-DLUT I switch to RetinaFace: https://github.com/deepinsight/insightface/tree/master/RetinaFace with this implementation: https://github.com/biubug6/Pytorch_Retinaface

JiaHeng-DLUT · 2020-10-01T01:20:07Z

@photoszzt Thanks!

chaudharyachint08 · 2022-04-12T15:27:13Z

First repository appeared to be removed, for me across three it looks like MTCNN is faster on CPU than GPU

…tch processing support - Completely refactored the MTCNN implementation following best coding practices. - Optimized code by removing unnecessary transpositions, resulting in faster computation. Fixes #22. - Transposed convolutional layer weights to eliminate the need for additional transpositions during preprocessing and postprocessing, improving overall efficiency. - Converted preprocessing and postprocessing functions into matrix operations to accelerate computation. Fixes #14, #110. - Added batch processing support to enhance performance for multiple input images. Fixes #9, #71. - Migrated network architecture to TensorFlow >= 2.12 for improved compatibility and performance. Fixes #80, #82, #90, #91, #93, #98, #104, #112, #114, #115, #116. - Extensively documented the project with detailed explanations of thresholds and parameters. Fixes #12, #41, #52, #57, #99, #122, #117. - Added support for selecting computation backends (CPU, GPU, etc.) with the `device` parameter. Fixes #23. - Added new parameters to control the result format (support for x1, y1, x2, y2 instead of x1, y1, width, height) and the ability to return tensors instead of dictionaries. Fixes #72. - Configured PyLint support to ensure code quality and style adherence. - Organized functions into specific modules (`mtcnn.utils.*` and `mtcnn.stages.*`) for better modularity. - Created Jupyter notebooks for visualization and ablation studies of each stage, allowing detailed exploration of layers, weights, and intermediate results. Fixes #88, #102. - Added a comprehensive training guide for the model. Fixes #35, #39. - Updated README with information on the new version, including the complete Read the Docs documentation that describes the process, theoretical background, and usage examples. Fixes #53, #73. - Configured GitHub Actions for continuous integration and delivery (CI/CD). - Fixed memory leak by switching to a more efficient TensorFlow method (`model(tensor)` instead of `model.predict(tensor)`). Fixes #87, #109, #121, #125, #128. - Made TensorFlow an optional dependency to prevent conflicts with user-installed versions. Fixes #95. - Added comprehensive unit tests for increased reliability and coverage.

ipazc mentioned this issue Oct 8, 2024

MTCNN v1.0.0 #132

Closed

ipazc mentioned this issue Oct 8, 2024

Refactored MTCNN codebase with significant optimizations. Version 1.0.0 #133

Merged

ipazc closed this as completed in #133 Oct 8, 2024

ipazc closed this as completed in b6eba4b Oct 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Taking too long to process images. How to leverage GPU properly? #23

Taking too long to process images. How to leverage GPU properly? #23

aman-captech commented Oct 17, 2018 •

edited

Loading

thelastfunction commented Sep 13, 2019

lishangqiu commented Dec 23, 2019

thelastfunction commented Jan 17, 2020

photoszzt commented Apr 25, 2020

JiaHeng-DLUT commented Sep 30, 2020

photoszzt commented Sep 30, 2020

JiaHeng-DLUT commented Oct 1, 2020

chaudharyachint08 commented Apr 12, 2022

Taking too long to process images. How to leverage GPU properly? #23

Taking too long to process images. How to leverage GPU properly? #23

Comments

aman-captech commented Oct 17, 2018 • edited Loading

thelastfunction commented Sep 13, 2019

lishangqiu commented Dec 23, 2019

thelastfunction commented Jan 17, 2020

photoszzt commented Apr 25, 2020

JiaHeng-DLUT commented Sep 30, 2020

photoszzt commented Sep 30, 2020

JiaHeng-DLUT commented Oct 1, 2020

chaudharyachint08 commented Apr 12, 2022

aman-captech commented Oct 17, 2018 •

edited

Loading