Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

best practice for performance #27

Open
yanggthomas opened this issue Jun 12, 2024 · 4 comments
Open

best practice for performance #27

yanggthomas opened this issue Jun 12, 2024 · 4 comments

Comments

@yanggthomas
Copy link

Hi, I am trying to verify the performance on an A100 PCIe version.
However, I can't get the expected performance reported in your paper.

Currently I am getting 467 MLUPs for cavity example with 400^3 cubic and 86 MLUPs for non-sparse case for 2phase with 256^3.

Besides, in your paper, it is reported that "The performance of the NVIDIA A100 GPU reached over 900 MLUPS for single-phase flow and 500 for two-phase flow with surface tension." in conclusion section. However in table 1, the max MLUPs for 2-phase is 310. Is this typo or other reasons?

@yanggthomas
Copy link
Author

I am using Taichi v1.7.1 and removed dynamic_index parameter in ti.init()

@yjhp1016
Copy link
Owner

yjhp1016 commented Jun 12, 2024 via email

@yanggthomas
Copy link
Author

so what's your suggested best practice to achieve good performance?

I think the performance depends on geometry (will affect data continuity in memory), configuration of your computer (software and hardware), your library versions etc. I think as these configurations are not the same, so we got different results.

On Wed, 12 Jun 2024 at 11:44, yanggthomas @.> wrote: Hi, I am trying to verify the performance on an A100 PCIe version. However, I can't get the expected performance reported in your paper. Currently I am getting 467 MLUPs for cavity example with 400^3 cubic and 86 MLUPs for non-sparse case for 2phase with 256^3. Besides, in your paper, it is reported that "The performance of the NVIDIA A100 GPU reached over 900 MLUPS for single-phase flow and 500 for two-phase flow with surface tension." in conclusion section. However in table 1, the max MLUPs for 2-phase is 310. Is this typo or other reasons? — Reply to this email directly, view it on GitHub <#27>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJEDKQHCV5Q7JZJOQJSQEBLZHARA3AVCNFSM6AAAAABJGC2DWKVHI2DSMVQWIX3LMV43ASLTON2WKOZSGM2DQNBTG42TQMY . You are receiving this because you are subscribed to this thread.Message ID: @.>

@yjhp1016
Copy link
Owner

yjhp1016 commented Jun 12, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants