Testing plan 18.2 #2522

sebastian · 2018-04-04T08:33:45Z

sebastian · 2018-04-04T08:35:14Z

If there are any particular things you would like to test, please go ahead and add your name to the corresponding item. Some are probably better done by me.

obrok · 2018-04-04T10:14:54Z

I wanted to extend the fuzzer some, compared to last milestone, before running the tests. Maybe give me till the end of this week to add more features to it, and I will do the extended run next week?

sebastian · 2018-04-04T10:36:27Z

Maybe give me till the end of this week to add more features to it, and I will do the extended run next week?

Sure, sounds good

sebastian · 2018-04-04T14:53:34Z

Ran make perftest

Query: SELECT COUNT(*) FROM notes_changes
18.1.2: AVERAGE duration: 1.216s, STDDEV: 0.023s
18.2.0: AVERAGE duration: 0.534s, STDDEV: 0.019s

Query: SELECT COUNT(*) FROM notes
18.1.2: AVERAGE duration: 0.368s, STDDEV: 0.005s
18.2.0: AVERAGE duration: 0.151s, STDDEV: 0.006s

In other words, despite being a small and locally executed test (and one can argue with how representative these queries are), the performance seems to have improved.

Also the data for the performance run in diffix0 is also more or less like last time we did the test. Unfortunately getting the same test to run on 18.1.2 didn't work out of the box. That would have been a more interesting comparison.

sasa1977 · 2018-04-05T09:07:18Z

I was able to execute compliance tests for 1000 users. The test took almost one hour, and reported 10 errors. I think that some of them have the same cause, so I suspect the actual number of errors is small. I'll create a separate issues for these errors.

sasa1977 · 2018-04-05T09:35:06Z

I created the corresponding issues and tagged them with compliance and bug labels. You can see them here.

sasa1977 · 2018-04-09T07:29:46Z

I've encountered a lot of problems trying to run the compliance tests for 10k users. In the end I had to do a reduced scope over 5k users and without sql server odbc and tds, mongo 3.0 and 3.2, and sap hana.

In total there were 20 failures. I'll go through all of them and create issues for the errors I haven't seen in the previous run.

Even in such reduced scope, the test took 63 hours (for comparison, a test over 1k users took 1 hour). I can't explain why such a huge increase, but I think it's safe to say that 10k doesn't seem attainable at the moment. I think that if we want to run compliance for a larger user set, we should rethink our input. Perhaps having less records per-user, and using smaller input sets (e.g. less names) could help us here.

obrok · 2018-04-16T12:39:29Z

I ran the fuzz tests for 2000 queries on top of the runs I've been doing during last week while developing the fuzzer. I found quite a number of issues, most rather obscure.

I tested with postgres and mysql only. Currently the fuzzer is set to run as a compliance test and expects the results to be the same for all datasources. This is made somewhat difficult for the different datasources which have their own little idiosyncrasies. Even mysql handles booleans somewhat differently from postgres and is more permissive in terms of operations crashing in the DB.

The most useful work items I foresee for the fuzzer:

Making an effort to select a uid from the inner query - because this is only done randomly, many queries are rejected because of the lack of such and so the fuzzer is bad at exploring queries with subqueries.
Shrinking the failing queries - it takes quite some work to extract a useful minimal example from a reported query. Currently it seems to me that there is a bug in stream_data that makes this impossible - Shrinking doesn't seem to work as advertised for one_of/frequency/tree whatyouhide/stream_data#97

sebastian · 2018-04-23T08:59:23Z

Release has been made.

sebastian added the ready label Apr 4, 2018

sebastian added this to the Release 18.2 milestone Apr 4, 2018

sasa1977 mentioned this issue Apr 10, 2018

improving compliance tests #2585

Closed

sebastian closed this as completed Apr 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Testing plan 18.2 #2522

Testing plan 18.2 #2522

sebastian commented Apr 4, 2018 •

edited

Loading

sebastian commented Apr 4, 2018

obrok commented Apr 4, 2018

sebastian commented Apr 4, 2018

sebastian commented Apr 4, 2018

sasa1977 commented Apr 5, 2018

sasa1977 commented Apr 5, 2018

sasa1977 commented Apr 9, 2018

obrok commented Apr 16, 2018

sebastian commented Apr 23, 2018

Testing plan 18.2 #2522

Testing plan 18.2 #2522

Comments

sebastian commented Apr 4, 2018 • edited Loading

sebastian commented Apr 4, 2018

obrok commented Apr 4, 2018

sebastian commented Apr 4, 2018

sebastian commented Apr 4, 2018

sasa1977 commented Apr 5, 2018

sasa1977 commented Apr 5, 2018

sasa1977 commented Apr 9, 2018

obrok commented Apr 16, 2018

sebastian commented Apr 23, 2018

sebastian commented Apr 4, 2018 •

edited

Loading