Query Builder tests for OOM and memory limit (from PR #2053) #2054

grusev · 2024-12-10T09:45:07Z

Reference Issues/PRs

What does this implement or fix?

Any other comments?

Checklist

Query Builder tests for OOM and memory limit

The PR contains code that introduces memray and pytest-memray libraries. With their help it is possible to create 2 types of tests:

memory leaks test - the memory leaks tests will accept one or two parameters:
- mem leak threshold - a threshold of mem in KB/MB that will trigger failure if above
- (optional) filter function - the function can be passed to memray to filterout code that we do not want to be accounted as mem leak. In future we might get some code that we KNOW has memory leaks but is outside of our scope. Thus we can use this function to filter it out.. Currently the code accepts that mem leaks under certain treshold will not lead to failure. If that threshold is vialoted in future this should mean immediate development investigative task. In the log there will be information what is the code that triggered that. IMPORTANT: code source will be if the test is executed against release build - only the size of leak will be correct. To know the source of the code the test must be executed against build with debug information
memory limit test - the memory limit test is about enforcing memory efficiency of code. Currently I have set CURRENT memory limit as limit for the this QueryBuilder code. If test fails in future then that would mean that our code is LESS memory efficient in that code. Why that has happened needs to be investigated Also, the number currently set accepts situation AS IS. It does not makes claim that we are memory efficient. No such study has been done it is outside of the test purpose. If such thing is needed then an investigative task need to be created
Have you updated the relevant docstrings, documentation and copyright notice?
Is this contribution tested against all ArcticDB's features?
Do all exceptions introduced raise appropriate error messages?
Are API changes highlighted in the PR description?
Is the PR labelled as enhancement or bug so it appears in autogenerated release notes?

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

vasil-pashov · 2024-12-16T09:58:52Z

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

+        The query basically will do aggregation of half of dataframe
+    """
+    q = QueryBuilder()
+    return (


Why is this brackets? It's inconsistent with query_resample from this file and most of the codebase.

a way to have decent multilines ...

I don't think you need then in this case. The function quoted above query_resample doesn't use brackets. We should keep the style homogeneous.

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

vasil-pashov · 2024-12-16T10:58:00Z

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

+        lib: Library = arctic_library_lmdb
+        df = generate_big_dataframe(300000)
+        lib.write(symbol, df)
+        del df


I don't think this is needed. It should be deleted when it goes out of scope.

In theory yes. In practice not sure. Thus one delete more is not hurting

I think we should have good understanding of the code we're putting both in theory and in practice. It's unusual to delete stuff in python in such way. We should prove there is a reason to do this and document why it is needed. If not we should omit it.

Let's see @alexowens90's take on this as well.

vasil-pashov · 2024-12-16T10:59:20Z

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

+            everything not relevant from mem leak measurement out of 
+            test, so it works as less as possible
+        """
+        lib: Library = arctic_library_lmdb


Why are some things type annotated e.g. lib: Library while others are not e.g. df is not df: pd.DataFrame

ommissions, or somewhere not needed. fixed here and there

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

alexowens90 · 2024-12-16T11:02:31Z

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

+    """
+    q = QueryBuilder()
+    return (
+        q[q["strings"] != "QASDFGH"]


The chances of this filtering anything out are very low, is that deliberate?

yes, this is deliberate to impose attempt to filter by impossible condition which should still be evaluated. And as we know with strings things are always buggier than with numbers

My issue is that the condition is not impossible, just unlikely. Making it actually impossible is fine (e.g. use a different character set than the generated dataframe), but we have optimisations in place for when nothing is filtered out, which could then skew the results if you get unlucky and the dataframe does contain QASDFGH

Yes. It is very unlikely. Thus making the results 99.99% of the dataframe (note that by default I think we generate str of 10 chars array and this is 7 chars. So perhaps really impossible to achieve unless bug

Anyway for our purposes we do not need exactness at all. Thus actual returned result if it is 99 or 100% of rows really does not matter

It does matter. Returning 100% and 99.999% of a dataframe can take totally different code paths

Let's see what @alexowens90 thinks

added aditional query also without any filter. Thus we have now 3 queries that have groupby agg., with filter(50%) filter(100%) effectivly and no filter ...

there is additional functional test also that covers the case if there is something fishy with queries

@grusev the query needs to be changed to one of:

Return 100% of the dataframe

Return <100% of the dataframe

And it has to be consistent. We have optimisations in place so that a filter that doesn't remove anything is much more efficient than a filter that removes even 1 row.

We have it already .. look at:

def query_no_filter_only_groupby_with_aggregations() -> QueryBuilder:
"""
groupby composite aggregation query for QueryBuilder memory tests.
The query basically will do aggregation of half of dataframe
"""
q = QueryBuilder()
return (
q.groupby("uint8")
.agg({"uint32": "mean",
"int32": "sum",
"strings": "count",
"float64": "sum",
"float32": "min",
"int16": "max"})
)

only the comment is misleading but all is ok

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

Co-authored-by: Vasil Danielov Pashov <vasil.pashov1@gmail.com>

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

…w on V1 store only

vasil-pashov · 2024-12-20T15:25:08Z

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

+    Pass size of dataframe and it will generate random row range
+    """
+    percentage_rows_returned = 0.57
+    start_percentage = random.uniform(0.01, 1.0 - percentage_rows_returned)


This will probably disable starting from 0-th row. Is this the desired behavior?

The same question applies to query_date_range_57percent

vasil-pashov · 2024-12-20T15:26:17Z

python/tests/stress/arcticdb/version_store/test_mem_leaks.py

+    start_percentage = random.uniform(0.01, 1.0 - percentage_rows_returned)
+    result_size_rows = int(0.57 * size)
+    q = QueryBuilder()
+    a = random.randint(0,int((size-1) * start_percentage))


As the row range is closed on the left side and opened on the right side it is safe to pass size here. It's also the correct thing to do because the current thing represents a the dataframe without it's last row.

Why is the conversion to int needed. Looking at the type hint size is already an integer.

grusev mentioned this pull request Dec 10, 2024

Query Builder tests for OOM and memory limit #2053

Closed

5 tasks

grusev force-pushed the qb_oom branch 2 times, most recently from fff58e8 to bbb8502 Compare December 11, 2024 12:26

Georgi Rusev added 13 commits December 11, 2024 18:20

memray tests for QB OOM+mem_limit

b8aae25

not run on win and mac for standard mem leak test

2867a7a

read_batch tests added

3444942

python 3.6 and 3.7 support excluded for pytest-memray

0cd54e3

fine tunings

5e4accf

more fine tunning for 3.6

ad2b65c

memray does not support windows

9b5e5ad

proper check for platform

fe851cf

fix ommision

b1f70d9

fine tunings

892f13e

test mark introduced and compilation errors for conda should be fixed

05cccee

safety margin

68ff1c6

macos leaks up to 35kb

9444361

grusev force-pushed the qb_oom branch from 6b6324e to 9444361 Compare December 11, 2024 16:20

Merge branch 'master' into qb_oom

1522101

grusev marked this pull request as ready for review December 12, 2024 06:01

grusev requested review from alexowens90, willdealtry and poodlewars as code owners December 12, 2024 06:01

fix for mem of standard test

83a058c

vasil-pashov requested changes Dec 16, 2024

View reviewed changes

alexowens90 requested changes Dec 16, 2024

View reviewed changes

Georgi Rusev and others added 5 commits December 16, 2024 17:39

addressed comments

ea54da2

comments addressed

ccf5a08

Update python/tests/stress/arcticdb/version_store/test_mem_leaks.py

d14ba00

Co-authored-by: Vasil Danielov Pashov <vasil.pashov1@gmail.com>

more comments addressed

e644f11

comment

7598ddc

Georgi Rusev and others added 4 commits December 17, 2024 16:40

added new test

4fdd5e4

xfail for new tests

2469c51

Merge branch 'master' into qb_oom

6f86c5c

some fixes

b9f2001

vasil-pashov reviewed Dec 18, 2024

View reviewed changes

python/tests/stress/arcticdb/version_store/test_mem_leaks.py Outdated Show resolved Hide resolved

alexowens90 requested changes Dec 18, 2024

View reviewed changes