Fix syncing dirty pages on IOS #5993

nicola-cab · 2022-11-03T15:59:09Z

What, How & Why?

On IOS, we have been seeing a huge increase of database corruption issues.

I went down bisecting a little bit our code base, and it seems like the problem is related to the fact that as per the current implementation we never sync dirty pages up until this barrier is hit:

void GroupWriter::commit(ref_type new_top_ref) {
....
....
flush_all_mappings();
if (!disable_sync)
    m_alloc.get_file().barrier();
.....
....
}

The barrier implementation itself is platform specific, and on Linux for example this does not cause any issues, because msync() is basically equal to fsync(). On IOS, this does not look like the normal behaviour. Thus, there is the chance of losing some data if the dirty page is not msynced before we hit the barrier. Moreover, on IOS, even the barrier does not guarantee that the page will actually reach disk. We need to explicitly call msync for each page.

The easiest way to reproduce this is to start a write transaction, that writes several bytes of data (my test consisted of 2 reasonably long strings), and crash the platform in the middle of the commit itself. Most likely, some of the dirty pages won't be written, causing havoc in the database file.

The same behaviour could happen during a normal commit, thus some insertion or deletion could fail apparently spuriously because of some data that got corrupted during a previous commit.

Calling msync for each page proved to be effective, and the database was never corrupted.

We need to add a specific tests for this case, in which we write to disk, and we crash the platform.

Corruption:
Fixes: #5972
Fixes: #5718
Fixes: #5859
Fixes: #5976
FIxes: #5975
Fixes: #5970
Fixes: #5758
Fixes: #5761
Fixes: #5298
Fixes: #5299
Fixes: #5941

Encryption
Fixes: #5810
Fixes: #5811

☑️ ToDos

📝 Changelog update
🚦 Tests (or not relevant)
C-API, if public C++ API changed.

src/realm/group_writer.cpp

…into nc/fix_corruptions_ios

…uptions_ios

call msync for syncing dirty pages

0146fcc

nicola-cab requested review from jedelbo and finnschiermer November 3, 2022 15:59

cla-bot bot added the cla: yes label Nov 3, 2022

finnschiermer added 2 commits November 4, 2022 11:55

handle multiple platforms and add a barrier at the end of commit

3ee89ce

cleanup

e166d5d

finnschiermer force-pushed the nc/fix_corruptions_ios branch from e98a351 to e166d5d Compare November 4, 2022 12:28

finnschiermer added 4 commits November 4, 2022 14:42

remember to unmap when needed

1ed5e16

Explicitly control flush/sync of MapWindows

4fb26c4

we must flush encryption cache before unmapping

7f5d48c

fix misleading comment

2d61168

nicola-cab commented Nov 4, 2022

View reviewed changes

src/realm/group_writer.cpp Outdated Show resolved Hide resolved

nicola-cab added 2 commits November 7, 2022 12:53

changelog entry

20852ee

pull from master and resolve conflicts

468308a

nicola-cab commented Nov 7, 2022

View reviewed changes

src/realm/group_writer.cpp Outdated Show resolved Hide resolved

fix ensuring timely flush/sync in encryption layer

0ae00c6

kneth mentioned this pull request Nov 8, 2022

App crashes on startup realm/realm-js#5083

Closed

nicola-cab added 3 commits November 8, 2022 12:55

pull master and fix conflicts

3d938ad

Merge branch 'nc/fix_corruptions_ios' of github.com:realm/realm-core …

9e39c23

…into nc/fix_corruptions_ios

Merge branch 'master' of github.com:realm/realm-core into nc/fix_corr…

55e6c55

…uptions_ios

This was referenced Nov 9, 2022

Realm.init crash on realm->_realm = Realm::get_shared_realm(config); #5937

Closed

Crash on app start realm/realm-swift#8007

Closed

This was referenced Nov 9, 2022

Realm Crashed With realm.delete #5976

Closed

Got crash removing object #5975

Closed

finnschiermer and others added 6 commits November 9, 2022 14:13

minimal fix

21d8362

just a little bit better

72d29d2

sync when you have to

92c98cf

merge minimal fix from fsa/new_corruption_fix

985d61a

restored original files

e133560

select only changes from minimal branch pushed by Finn

7334dd4

finnschiermer and others added 2 commits November 9, 2022 16:16

extended comments

f80560a

Update CHANGELOG.md

41b32ff

tgoyne approved these changes Nov 9, 2022

View reviewed changes

nicola-cab merged commit 495dfb4 into master Nov 9, 2022

nicola-cab deleted the nc/fix_corruptions_ios branch November 9, 2022 18:27

bmunkholm linked an issue Nov 11, 2022 that may be closed by this pull request

Object has been deleted or invalidated. #5809

Closed

BlueCobold mentioned this pull request Dec 2, 2022

Decryption failed - page zero has wrong checksum #5810

Closed

This was referenced Jan 4, 2023

Fatal Exception: realm::KeyNotFound in Realm notification listener thread realm/realm-swift#7704

Closed

App crashes on launch realm/realm-swift#8089

Closed

This was referenced Jan 17, 2023

App keeps crashing on startup #6207

Closed

Merge realm-core #5993 PR realm/realm-js#5281

Closed

brenmcnamara mentioned this pull request Feb 1, 2023

Issue using Realm with Swift Actors realm/realm-swift#7901

Open

finnschiermer mentioned this pull request Jul 25, 2023

Freelist corruption issues #6813

Closed

sync-by-unito bot mentioned this pull request Sep 28, 2023

Overlapping blocks on freelist (on stack: GroupWriter::recreate_freelist()) #7006

Open

github-actions bot locked as resolved and limited conversation to collaborators Mar 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix syncing dirty pages on IOS #5993

Fix syncing dirty pages on IOS #5993

nicola-cab commented Nov 3, 2022 •

edited

Loading

Fix syncing dirty pages on IOS #5993

Fix syncing dirty pages on IOS #5993

Conversation

nicola-cab commented Nov 3, 2022 • edited Loading

What, How & Why?

☑️ ToDos

nicola-cab commented Nov 3, 2022 •

edited

Loading