IndexedDB support for wasm/browser environments #414

gnunicorn · 2021-11-17T20:43:21Z

Hi,

this PR attempts to implement permanent storage solutions on webbrowser environments using the browser own IndexedDB.

Current progress:

Things I'd very much like an opinion/input on:

this is currently hidden behind a target+feature-flag. Do we want to keep it this way or assume the target also means we are in the browser and thus assume that indexeddb should be used (effectively dropping the feature-flag)?
the current base client uses a sync new to instantiate, but for indexeddb it must be async. Right now, this goes for a feature-specific-approach, meaning the API is async for the browser but sync otherwise. Do we want to streamline the API and make it async both cases?
do we still want the option of having an in-memory-store in wasm, too? Right now the feature flag is either-or
I am re-using the sled-crypto to encrypt on the fly, maybe we should streamline that over multiple database implementations?

poljar · 2021-11-18T10:14:29Z

this is currently hidden behind a target+feature-flag. Do we want to keep it this way or assume the target also means we are in the browser and thus assume that indexeddb should be used (effectively dropping the feature-flag)?

Yeah the browser might be a special place where everybody just wants to use the indexeddb store, I don't think anyone will want to run bots/bridges in the browser.

the current base client uses a sync new to instantiate, but for indexeddb it must be async. Right now, this goes for a feature-specific-approach, meaning the API is async for the browser but sync otherwise. Do we want to streamline the API and make it async both cases?

Would use runBlocking be horrible there? I try to avoid async constructors and new() shouldn't do any real writing either way. The feature specific approach would be hard to document, so we most likely want to have the same function signature.

do we still want the option of having an in-memory-store in wasm, too? Right now the feature flag is either-or

I don't think so, no. The memory store is there for two reasons:

Because some platforms may not support the sled store
So people don't need to set a store path for ephemeral clients, i.e. scripts that may not even sync.

Number 1 is largely going away and people won't use number 2. in a browser.

I am re-using the sled-crypto to encrypt on the fly, maybe we should streamline that over multiple database implementations?

We could put the StoreKey into a separate crate so each store can depend on that crate. We should also move each store implementation, except the memory store, into a separate crate.

It was always the intention to have only the memory store be part of the SDK crates but due to time constraints shortcuts were taken.

improve security (currently keys leak information)

This is a bit of a tricky one and we'll likely need to measure the performance impact. It was also skipped due to time constraints and some things are deliberately left unencrypted. For example, the user ids that are part of a room, those need to be fetched every time a message is sent. We likely could keep the performance good enough using some smarter caching.

A scheme that avoid leaking the keys I had in mind is this:

Remove any occurrence of key decoding
Generate a random salt when we create the store and store it as well.
Modify EncodeKey to hash each part of the key separately and join the keys as usual using a null byte (this ensures that we can still use scan_prefix().

The salt, of course, will be used for the key hashing.

That all being said, thanks for working on this.

jplatte · 2021-11-18T10:42:08Z

Re. feature flag, I think the question isn't "does somebody want to run a bridge / bot in a browser", it's "does somebody want to use the SDK with state storage in non-browser wasm environment". I think it's definitely realistic somebody would want that so keeping this behind a feature flag seems like a good idea.

poljar · 2021-11-18T11:03:00Z

Re. feature flag, I think the question isn't "does somebody want to run a bridge / bot in a browser", it's "does somebody want to use the SDK with state storage in non-browser wasm environment". I think it's definitely realistic somebody would want that so keeping this behind a feature flag seems like a good idea.

That's a fair point, I'm a bit biased since I would just use it natively in that case, but someone is bound to be interested in the sandboxing features a WASM environment provides.

gnunicorn · 2021-11-18T12:14:23Z

Re feature-flag: yeah, @jplatte, that was my thinking. Especially since more and more non-browser platforms offer some form of WASM support, I don't think we can assume one by the other. Like node has wasm support and that won't have indexeddb support... Keeping it as a feature-flag than. Thanks for the input!

In light of this way of thinking of it, I'd leave the memory store as an option for wasm32 non-indexeddb environments (like nodejs), too...

Would use runBlocking be horrible there? I try to avoid async constructors and new() shouldn't do any real writing either way. The feature specific approach would be hard to document, so we most likely want to have the same function signature.

while I agree that new being async is a bit akward and something that should generally be avoided, the API level we are talking about (the client) already has a bunch of async constructors already _ namely new_from_user_id and new_from_user_id_with_config, which can't be converted easily. In hence of these I'd argue that it actually makes usage of the API more consistent when all constructors where async and/or drop the generic new in favor of specific async constructors only. I fear that a runBlocking for indexeddb won't stand the test of time anyways if we want to support more and other storages... (and blocking in startup procedures sounds really bad, though I am not sure whether asking for indexeddb would ever be actually blocking - but afaik a browser could prompt the user before allowing usage and thus significantly delay startup process...)

improve security (currently keys leak information)

This, as a well as

We should also move each store implementation, except the memory store, into a separate crate.

I'll leave for another PR another time. For simplicity I'll try to get the indexeddb to feature parity with sled store first and then we can investigate these further.

jplatte · 2021-11-18T12:23:09Z

👍🏼 from me for async constructor, regardless of feature flags. I don't really see what harm there would be in the additional .await.

poljar · 2021-11-18T12:51:14Z

In light of this way of thinking of it, I'd leave the memory store as an option for wasm32 non-indexeddb environments (like nodejs), too...

nodejs would probably use neon to create bindings, this way they get access to native threads. I was thinking more of something like wasmer, which provides an embedable WASM runtime.

while I agree that new being async is a bit akward and something that should generally be avoided, the API level we are talking about (the client) already has a bunch of async constructors already _ namely new_from_user_id and new_from_user_id_with_config, which can't be converted easily. In hence of these I'd argue that it actually makes usage of the API more consistent when all constructors where async and/or drop the generic new in favor of specific async constructors only.

The difference is that those methods send out requests. I prefer the non-async constructor because a client might want to create the object in the UI thread which might not be a Tokio thread. Then again a more feature-full client will certainly use Client::new_from_user_id() which makes my point moot. If people need to create the object on such an UI thread, they can certainly use runBlocking themselves.

gnunicorn · 2021-12-27T15:40:15Z

@ara4n yes, I was messing with that in the CI as well. It appears this only works with Node 14 and webpack5 reliably because of various factors (the long-name-stack-size-problem as well as the env-runtime-problem that is coming from libolm). As we have a truly-rust-olm on the horizon we've agreed to not waste more time on this version (other than the CI job for Node 14 showing it working) and instead switch to the other lib asap.

ara4n · 2021-12-27T21:35:03Z

turns out that the magic to make this work for me is:

increase the stack size (as per above)
downgrade emscripten to 2.0.27, such that it shakes out unused symbols correctly (otherwise it chokes trying to pull in an external snprintf & malloc/free which doesn't exist)
and also i needed to workaround a webpack 5 bug similar to This crate hits a buggy Webpack 5 warning when building for wasm32-unknown rust-random/getrandom#224, which I did by chucking into webpack.config.js:

  ignoreWarnings: [
    (warning) =>
      warning.message ===
      "Critical dependency: the request of a dependency is an expression",
  ],

Alternatively, one can manually snip out the unneeded code with:

for i in _ZN3olm7Session8describeEPcm olm_session_describe aes_encrypt_ccm aes_decrypt_ccm
do
	wasm-snip index_bg.wasm -o index_bg_fixed.wasm $i
	mv index_bg_fixed.wasm index_bg.wasm
done

gnunicorn · 2021-12-27T22:36:39Z

note: you can get wasm-snip here.

gnunicorn · 2022-02-02T14:28:29Z

Updated to latest main, including the remove-room-feature added. The only test failing is marked as continue-on-error to remind us that default wasm-crypto doesn't work (though this isn't blocking merges), while there's a test within an environment (node14, emcc2.0) that is working.

should be good to merge now.

poljar · 2022-02-02T14:57:27Z

.github/workflows/ci.yml

+    # hence the tests
+    name: ${{ matrix.name }} WASM test
+    runs-on: ubuntu-latest
+    continue-on-error: ${{ matrix.experimental }}


This will still turn our CI state into a ❌, oh well.

yeah, github actions don't have a proper ignore-failure switch ... what I could offer is to comment the specific test (which would also mean we don't waste these resources every time) until we have it fixed up ...?!?

Yeah, sounds good.

gnunicorn added 4 commits November 15, 2021 20:05

first steps

8a2732a

big batch

b8af613

activating browser tests

5bba24c

generic tests passing'

e7ec861

gnunicorn added 2 commits November 18, 2021 12:40

implement custom value

e603496

move and unify usage of store_key

a2b80f0

gnunicorn added 3 commits November 18, 2021 13:36

fixing style

3e11e0d

first attempt at creating a CI job for wasm tests

7df36e3

fixing typo

b16c660

gnunicorn added 2 commits November 18, 2021 13:54

minor clippy fixes

3fcfe98

clarify API

10d4fe5

stoically mentioned this pull request Nov 18, 2021

WASM tracking #35

Closed

8 tasks

gnunicorn added 12 commits November 18, 2021 17:40

style fix

76454e6

clean up API and corresponding docs

917e901

fixing browser test

9e83bcb

infrastructure for indexeddb cryptostore

94d3ffa

Merge remote-tracking branch 'upstream/main' into ben-wasm-store

4c0bbeb

Adding result-signature support for wasm32 async-test-macro

5cf56ad

fix up CI test

4268016

Implement helper for wasm32 MilliSecondsSinceUnixEpoch

4c60db9

minor wasm32 fixes

32a8ec7

update trait impl of indexeddb

47dff21

various minor cleanups of unused imports

6d9920c

Implement saving

229a81b

trying older emscripten

3cde28a

gnunicorn added 2 commits December 27, 2021 23:27

fix broken now call

15cdaea

testing emcc versions

9ff4609

gnunicorn added 18 commits December 27, 2021 23:37

fixin style

8ce622d

create sync token store

3468773

Merge remote-tracking branch 'upstream/main' into ben-wasm-store

e1ad8fe

[fix ci] remove unused import

0937c2e

fix broken merge

278d934

fixing style again

fb81ebf

fixing style again

92044ce

Merge remote-tracking branch 'upstream/main' into ben-wasm-store

a0f2e38

fixing build warnings and clippy lints

93c75c1

Merge remote-tracking branch 'upstream/main' into ben-wasm-store

e2c6dc3

Merge remote-tracking branch 'upstream/main' into ben-wasm-store

ea959a1

fixing style

b4d5ad9

fixing docs for await

c07c284

fixing linux tests

7e008d0

fixing indexeddb types merge

666bec4

implementing room removal for indexeddb

b8d93d0

fixing style

64709f1

switch tokio:test to async_test

990b897

gnunicorn requested a review from poljar February 2, 2022 14:26

poljar approved these changes Feb 2, 2022

View reviewed changes

disable broken test

fa60881

gnunicorn merged commit 1286357 into matrix-org:main Feb 3, 2022

richvdh mentioned this pull request Dec 13, 2023

Configure Instant wasm polyfill to use monotonic time #2935

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IndexedDB support for wasm/browser environments #414

IndexedDB support for wasm/browser environments #414

gnunicorn commented Nov 17, 2021 •

edited

Loading

poljar commented Nov 18, 2021

jplatte commented Nov 18, 2021

poljar commented Nov 18, 2021

gnunicorn commented Nov 18, 2021

jplatte commented Nov 18, 2021

poljar commented Nov 18, 2021

gnunicorn commented Dec 27, 2021

ara4n commented Dec 27, 2021 •

edited

Loading

gnunicorn commented Dec 27, 2021

gnunicorn commented Feb 2, 2022 •

edited

Loading

poljar Feb 2, 2022

gnunicorn Feb 2, 2022

poljar Feb 2, 2022

IndexedDB support for wasm/browser environments #414

IndexedDB support for wasm/browser environments #414

Conversation

gnunicorn commented Nov 17, 2021 • edited Loading

poljar commented Nov 18, 2021

jplatte commented Nov 18, 2021

poljar commented Nov 18, 2021

gnunicorn commented Nov 18, 2021

jplatte commented Nov 18, 2021

poljar commented Nov 18, 2021

gnunicorn commented Dec 27, 2021

ara4n commented Dec 27, 2021 • edited Loading

gnunicorn commented Dec 27, 2021

gnunicorn commented Feb 2, 2022 • edited Loading

poljar Feb 2, 2022

Choose a reason for hiding this comment

gnunicorn Feb 2, 2022

Choose a reason for hiding this comment

poljar Feb 2, 2022

Choose a reason for hiding this comment

gnunicorn commented Nov 17, 2021 •

edited

Loading

ara4n commented Dec 27, 2021 •

edited

Loading

gnunicorn commented Feb 2, 2022 •

edited

Loading