[WIP] Improved robustness of concurrent schema updates #3186

brimoor · 2023-06-12T04:05:01Z

As explained in the @todo, this is a bit of a hack to specifically avoid issues when concurrently modifying a dataset's schema. However, the underlying problem still exists whenever list fields other than DatasetDocument.sample_fields and DatasetDocument.frame_fields are concurrently edited without first reloading.

I think it makes sense to go ahead and merge this particular patch because schema updates are by far the most likely case where concurrent list edits may arise, since in many workflows dataset objects may be held in-memory for long periods of time without reloading them. Unlike samples, which are generally only loaded + modified on-demand.

codecov · 2023-06-12T04:07:04Z

Codecov Report

Patch and project coverage have no change.

Comparison is base (d0c3dff) 15.56% compared to head (4ad5b8f) 15.56%.

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #3186   +/-   ##
========================================
  Coverage    15.56%   15.56%           
========================================
  Files          564      564           
  Lines        69319    69319           
  Branches       681      681           
========================================
  Hits         10791    10791           
  Misses       58528    58528

Flag	Coverage Δ
app	`15.56% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

allenleetc · 2023-06-12T11:53:00Z

Still orienting but LGTM. One question -- theoretically race conditions are still possible as the dataset doc is not locked between reload() and the subsequent schema update (not sure exactly where that happens)?

brimoor · 2023-06-12T13:36:41Z

Still orienting but LGTM. One question -- theoretically race conditions are still possible as the dataset doc is not locked between reload() and the subsequent schema update (not sure exactly where that happens)?

yes theoretically there still could be issues if two processes update the schema within the few lines of code between the reload() I added here and when the db transaction has happened. However, this is substantially better than status quo, where this issue can happen if any concurrent updates happen between the time when the dataset is loaded in one process and when a schema change is made (which could be hours or days).

I'd like us to properly address this issue as per #3187; ;this is just attempting to place a bandaid on the most likely case.

stale

* update sidebar test, fix small regressions (#3250) * remove next index ref, fix string filter is matching default (#3249) * release notes * package bumps * disabled e2e * lint * fix mac arm64 * Teams v1.3.2 release note * rm e2e * add main

* kill server on session close * close desktop app * cleanup * lint * lint * is_open * rm remote warning

* add frame cases to dynamic label tests * embedded frame label fixes * linting

Minor typo fixes

Adding merge_sample() method

* only matches * db_field bug and path fix

* don't throw error * add debug msg

* fixing filter_keypoints() bug * adding dynamic doc test

…into release/v0.21.3

* only matches * db_field bug and path fix * base image sample tests * cleanup * exclude bug * rm onlyMatch * frame and dynamic tests * adding coverage * keypoints fixes * cleanup * base image sample tests * cleanup * exclude bug * rm onlyMatch * frame and dynamic tests * adding coverage * keypoints fixes * cleanup * tweaks * exclude no only matches

brimoor · 2023-07-17T01:38:21Z

Closing in favor of #3308.

brimoor added the bug Bug fixes label Jun 12, 2023

brimoor requested review from allenleetc and a team June 12, 2023 04:05

brimoor self-assigned this Jun 12, 2023

brimoor changed the title ~~Improved robustness of concurrent schema updates~~ [WIP] Improved robustness of concurrent schema updates Jun 12, 2023

allenleetc previously approved these changes Jun 12, 2023

View reviewed changes

brimoor marked this pull request as draft June 12, 2023 15:02

voxel51 deleted a comment from allenleetc Jun 14, 2023

brimoor force-pushed the bugfix/iss-3185 branch 2 times, most recently from a55af08 to 244a959 Compare June 18, 2023 14:16

brimoor force-pushed the bugfix/iss-3185 branch from 244a959 to ac766ab Compare June 30, 2023 03:03

Base automatically changed from release/v0.21.1 to main June 30, 2023 20:18

brimoor force-pushed the bugfix/iss-3185 branch from ac766ab to 7f21f2c Compare July 1, 2023 04:41

benjaminpkane and others added 13 commits July 5, 2023 13:22

Release v0.21.2 (#3251)

20b0b6e

* update sidebar test, fix small regressions (#3250) * remove next index ref, fix string filter is matching default (#3249) * release notes * package bumps * disabled e2e * lint * fix mac arm64 * Teams v1.3.2 release note * rm e2e * add main

adding new vector integrations

505753f

bumping package versions

ae19131

adding release notes

19946ee

or count label tags (#3267)

95b491d

has group slices only when group dataset (#3262)

2ecdea1

Updating Session.close() (#3253)

57e7103

* kill server on session close * close desktop app * cleanup * lint * lint * is_open * rm remote warning

Embedded frame label fixes (#3256)

3910ba6

* add frame cases to dynamic label tests * embedded frame label fixes * linting

adding a merge_sample() method

fc96c8a

Minor typo fixes

d1606a2

Merge pull request #3272 from NeoKish/fix-faq-typo-pr

f77bbcc

Minor typo fixes

adding test for one() method

4aa2a5f

Merge pull request #3274 from voxel51/feature/merge-sample

dfa4d96

Adding merge_sample() method

brimoor and others added 20 commits July 10, 2023 18:14

updating release notes

9946b67

tweak

7b4ce09

documenting list bucket perms

1b28d03

Fix sidebar matching on label fields (#3270)

1dde7ee

* only matches * db_field bug and path fix

Suppress errors due to none fields (#3275)

ae6bcfc

* don't throw error * add debug msg

import order

a23b864

Fixing #3277 (#3279)

ca552ff

* fixing filter_keypoints() bug * adding dynamic doc test

interactive typo fix for Dave

8269f7b

removing persistent dataset usage in unit tests

fddb3cf

Merge branch 'release/v0.21.3' of https://github.com/voxel51/fiftyone …

4c08de1

…into release/v0.21.3

add to release notes

95b0e8f

whitespace

2638bb8

Merge branch 'main' into release/v0.21.3

86b5fc4

updating release notes

0778dc6

Minor fixes (#3283)

f20b523

docs tweaks

d44d015

documenting cache=True feature

5e534e8

always reload schema before modifying it

636b3e2

tweaking message

28daa74

brimoor force-pushed the bugfix/iss-3185 branch from 7f21f2c to 28daa74 Compare July 12, 2023 04:41

Merge branch 'develop' into bugfix/iss-3185

a5a7051

brimoor changed the base branch from main to develop July 13, 2023 00:17

brimoor added 2 commits July 12, 2023 20:19

Merge branch 'develop' into bugfix/iss-3185

d2907ae

Merge branch 'develop' into bugfix/iss-3185

4ad5b8f

brimoor closed this Jul 17, 2023

brimoor deleted the bugfix/iss-3185 branch July 17, 2023 01:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Improved robustness of concurrent schema updates #3186

[WIP] Improved robustness of concurrent schema updates #3186

brimoor commented Jun 12, 2023 •

edited

Loading

codecov bot commented Jun 12, 2023 •

edited

Loading

allenleetc commented Jun 12, 2023

brimoor commented Jun 12, 2023

brimoor commented Jul 17, 2023

[WIP] Improved robustness of concurrent schema updates #3186

[WIP] Improved robustness of concurrent schema updates #3186

Conversation

brimoor commented Jun 12, 2023 • edited Loading

codecov bot commented Jun 12, 2023 • edited Loading

Codecov Report

allenleetc commented Jun 12, 2023

brimoor commented Jun 12, 2023

brimoor commented Jul 17, 2023

brimoor commented Jun 12, 2023 •

edited

Loading

codecov bot commented Jun 12, 2023 •

edited

Loading