[performance] loading VOTable in CARTA requires more RAM comparing to TOPCAT #1217

kswang1029 · 2022-11-17T05:09:05Z

Based on the psrecord test, loading a VOTable in CARTA requires more RAM comparing to TOPCAT. CARTA is faster than TOPCAT but it has a stricter memory requirement. It would be good to study a bit on how to reduce the memory usage while retaining the excellent performance.

veggiesaurus · 2022-11-17T13:30:41Z

I think this is the easiest solution: https://pugixml.org/docs/manual.html#dom.memory.compact, but we would need to add pugixml as a third-party lib instead of an installed dependency.

kswang1029 · 2022-11-17T14:27:10Z

So no way we can provide both modes (yes I am greedy) at the same time, right? (as a user configurable)

veggiesaurus · 2022-11-17T16:10:48Z

So no way we can provide both modes (yes I am greedy) at the same time, right? (as a user configurable)

it would be rather complicated. We'd need to build the library twice and muck around with linking stuff I think

kswang1029 · 2022-11-18T01:57:55Z

So no way we can provide both modes (yes I am greedy) at the same time, right? (as a user configurable)

it would be rather complicated. We'd need to build the library twice and muck around with linking stuff I think

I see. Then let's see if the performance is still good enough with PUGIXML_COMPACT.

kswang1029 · 2022-11-30T10:15:20Z

@jolopezl would you be able to investigate this with the suggested build flag and make comparisons without that flag?

jolopezl · 2022-12-04T13:03:16Z

PUGIXML_COMPACT seems to have the desired impact, halving the amount of memory usage:

The changes are minimal: just adding pugixml as a third-party library and forcing the PUGIXML_COMPAT variable ON. I don't observe any significant performance detriment, but I would prefer some e2e test checked to assert this strongly.

kswang1029 · 2022-12-05T11:46:17Z

PUGIXML_COMPACT seems to have the desired impact, halving the amount of memory usage:

The changes are minimal: just adding pugixml as a third-party library and forcing the PUGIXML_COMPAT variable ON. I don't observe any significant performance detriment, but I would prefer some e2e test checked to assert this strongly.

based on the plots, it seems with the PUGIXML_COMPACT flag ON, it uses less ram (~50% less) and the e2e time is also reduced? When the PUGIXML_COMPACT flag is OFF, do you see your OS is swapping? If so that might explain the e2e time difference.

jolopezl · 2022-12-05T22:04:12Z

@kswang1029, a small update with better CPU sampling:

based on the plots, it seems with the PUGIXML_COMPACT flag ON, it uses less ram (~50% less) and the e2e time is also reduced? When the PUGIXML_COMPACT flag is OFF, do you see your OS is swapping? If so that might explain the e2e time difference.

The rectangles on the plots have the same width, so I'd say that roughly there's the same e2e time. For the case PUGIXML_COMPAT=OFF, I can effectively observe some swapping from 250MB up to 1GB (from several trials), so I would say yes, there are some OS-dependent effects.

veggiesaurus · 2022-12-06T08:40:33Z

@jolopezl which branch is this with? And which file? I'd like to try with my machine and ubuntu

kswang1029 · 2022-12-06T10:14:54Z

@kswang1029, a small update with better CPU sampling:

based on the plots, it seems with the PUGIXML_COMPACT flag ON, it uses less ram (~50% less) and the e2e time is also reduced? When the PUGIXML_COMPACT flag is OFF, do you see your OS is swapping? If so that might explain the e2e time difference.

The rectangles on the plots have the same width, so I'd say that roughly there's the same e2e time. For the case PUGIXML_COMPAT=OFF, I can effectively observe some swapping from 250MB up to 1GB (from several trials), so I would say yes, there are some OS-dependent effects.

The "e2e" time should count between the initial 0% CPU usage and the final 0% CPU usage, not just duration of the 400% CPU usage. As your OS was swapping, it won't be a fair comparison of the e2e time between the two tests. I will perform some tests here.

kswang1029 · 2022-12-13T07:34:56Z

@jolopezl @veggiesaurus Here is a comparison with and without the PUGIXML_COMPACT=ON flag (jolopezl/1217_pugixml_compact vs dev)

This is tested with a desktop having 64GB of RAM. No swapping is observed during the test.

For the backend execution time, we see similar results. With PUGIXML_COMPACT=ON, it is very slightly slower but not that significant (sampling rate is 0.02s so the difference should be real).

For the peak RAM usage (excluding the RAM usage for the image itself), it is reduced by 50% roughly. Note that the image itself occupies 267MB.

For the final RAM usage (excluding the RAM usage for the image itself), it is reduced by 75% roughly. Note that the image itself occupies 267MB.

Based on this, it is a promising and significant improvement. 👍 @jolopezl please file a PR and I will do some other manual tests.

kswang1029 added question Further information is requested R&D labels Nov 17, 2022

kswang1029 assigned jolopezl Nov 30, 2022

jolopezl mentioned this issue Dec 13, 2022

add pugixml as third-party library with PUGIXML_COMPACT enabled #1228

Merged

5 tasks

jolopezl linked a pull request Dec 13, 2022 that will close this issue

add pugixml as third-party library with PUGIXML_COMPACT enabled #1228

Merged

5 tasks

confluence closed this as completed in #1228 Jan 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[performance] loading VOTable in CARTA requires more RAM comparing to TOPCAT #1217

[performance] loading VOTable in CARTA requires more RAM comparing to TOPCAT #1217

kswang1029 commented Nov 17, 2022 •

edited

Loading

veggiesaurus commented Nov 17, 2022

kswang1029 commented Nov 17, 2022 •

edited

Loading

veggiesaurus commented Nov 17, 2022

kswang1029 commented Nov 18, 2022

kswang1029 commented Nov 30, 2022

jolopezl commented Dec 4, 2022

kswang1029 commented Dec 5, 2022

jolopezl commented Dec 5, 2022

veggiesaurus commented Dec 6, 2022

kswang1029 commented Dec 6, 2022

kswang1029 commented Dec 13, 2022 •

edited

Loading

[performance] loading VOTable in CARTA requires more RAM comparing to TOPCAT #1217

[performance] loading VOTable in CARTA requires more RAM comparing to TOPCAT #1217

Comments

kswang1029 commented Nov 17, 2022 • edited Loading

veggiesaurus commented Nov 17, 2022

kswang1029 commented Nov 17, 2022 • edited Loading

veggiesaurus commented Nov 17, 2022

kswang1029 commented Nov 18, 2022

kswang1029 commented Nov 30, 2022

jolopezl commented Dec 4, 2022

kswang1029 commented Dec 5, 2022

jolopezl commented Dec 5, 2022

veggiesaurus commented Dec 6, 2022

kswang1029 commented Dec 6, 2022

kswang1029 commented Dec 13, 2022 • edited Loading

kswang1029 commented Nov 17, 2022 •

edited

Loading

kswang1029 commented Nov 17, 2022 •

edited

Loading

kswang1029 commented Dec 13, 2022 •

edited

Loading