Add offline support #137

dmonad · 2021-06-04T12:08:48Z

This PR adds a service worker to the lab folder. It will intercept all requests to the server and store the results in a cache. The next time you load the website, the content will be loaded from the cache instead from the server (while updating the cache in the background). If there is no internet access, the /lab address should still work.

Note that this might make it harder to debug because it will serve stale content by default. When debugging the application (e.g. in dev_mode on localhost) you should enable "bypass for network" in the debugging application panel.

Maybe we could disable service worker for localhost alltogether.

bollwyvl · 2021-06-04T13:03:34Z

😻 This is a huge step forward, and the reload times are fantastic with e.g. pyodide. Works as advertised when turning off wifi! We'd probably want to enable even more goodies, e.g. have a pyodide kernel shared between multiple tabs.

But indeed, the various deployment gotchas are very real, so it needs to be easy to turn off at build time for someone who knows they will be deploying someplace not so fun, e.g. jupyter-lite.json#/jupyter-config-data/disableServiceWorker.

Also, I think before landing this, we'd also want to land #118 to get deduplicated, cache-busting assets, so at least first-party stuff doesn't have surprising stale experiences on the docs site, especially since we're tracking upstream alphas now.

As we want this to work on every page, and likely know whether we are in a service worker, perhaps we move all this to the config-utils.js, so it gets used on both the full lab and the various retro pages, and set useServiceWorker dynamically, if not explicitly opted-out... if we can use it, we probably always should!

dmonad · 2021-06-04T13:09:31Z

Yep, this all makes sense. Let's wait a bit until we merge this. I'll try to keep this up to date. I think it would be nice to also add a Manifest so that users can install this as a web-app. Maybe the service worker should only be active when this app is served as a web-app. This would be another method to circumvent some of the issues.

Btw, the documentation generated a working example of this:

https://jupyterlite--137.org.readthedocs.build/en/137/_static/lab/index.html

bollwyvl · 2021-06-04T13:18:40Z

documentation generated a working example

Oh yeah, tried it out immediately, did the ceremonial network-turn-off and everything! This really makes slightly-larger-than-trivial compute reasonable for a documentation site.

dmonad · 2021-06-04T14:24:25Z

I added a manifest so you can install this as a web app now :)

bollwyvl · 2021-06-04T15:17:51Z

Very cool. We'll definitely want to hoist the icons, label, theme colors, etc. to something a user "build" (e.g. copy and add <5 json files or a notebook) can configure in #41.

there's already a schema 🎉

bollwyvl · 2021-06-25T19:42:13Z

So with #173, we've got some more structure in place for configuring how a site builds, etc. I left a placeholder for the serviceworker stuff, but wasn't sure how to proceed. It would be interesting to get main merged into this so we can start looking at options for moving forward.

bollwyvl · 2021-06-25T20:08:30Z

Another angle: the webpack docs point out workbox which seems to make some of the stuff a little more manageable over time. Not sure how this would play with our desire to be able to tweak things after the webpack build, but might still be interesting.

martinRenou · 2022-05-30T14:39:18Z

I'd like to take over that PR and rebase it if it's fine with everyone.

I've been looking for the past working days at service workers and how to make use of their advantages from the Python kernel.

I'm not only interested in the caching logic they bring (for offline support etc), I'm also interested in the Python kernel being able to synchronously access data from the virtual file system (local storage). The Python kernel running in a web worker, it can make blocking HTTP requests that are intercepted by the service worker, and the service worker can answer with whatever data it finds. We could monkey patch the open global function with something that does exactly that.

joemarshall · 2022-06-01T09:10:12Z

@martinRenou I was just thinking about this. I think the right approach for flexibility would be to register a very simple service worker which just allows the registration of URL handlers. So one could make a cache handler extension, a sleep extension, import extension.

It would need some way to identify which main browser thread owned webworker clients, I'm not sure if that can be done through fetch request detection in the service worker or if it needs a wrapper for webworker.

joemarshall · 2022-06-01T10:22:51Z

Oh hang on, the client Id at least in Chrome appears to be for a whole session, ie. Main page and workers. That's easier than I thought

joemarshall · 2022-06-01T12:19:16Z

I just checked, and it is trivial to associate client ids of web-workers to the clientid of the main window -

In chrome - all fetches seem to come with the main window clientid
In firefox, when you first fetch the webworker, the fetch has 'resultingClientId for the worker, which is a new client id. worker requests come with that client ID and the same url. So if you're in a web-worker, the client id of the main window is trivial to get.

So a basic URL handler API for pyodide or similar stuff where you essentially want synchronous calls to async javascript things (files, sleep etc.) would look like this:

Serviceworker gets a POST request to a special URL with some content (as JSON or something)
It makes this into a promise and postmessages it to the correct mainwindow (and returns a new unresolved promise to the fetch request)
The main app converts that to a jupyter message and sends it wherever it needs to go.
The handler extension makes a response and sends that back to the main app, which posts it back to the serviceworker, which then resolves the promise made in step 2.
Tada, XML httprequest in webworker finishes, we're all good.

I think caching should maybe be handled in the main serviceworker for performance reasons. But the rest of the serviceworker should stay absolutely barebones - it just takes requests and turns them into jupyter messages if they are a special PUT request.

martinRenou · 2022-06-02T09:15:13Z

Thanks for your comments @joemarshall !

My comments about monkey-patching the open global function in Python are invalidated now, so please discard them.

I'm exploring implementing a custom FileSystem (in the emscripten sense) that we mount for Python to use, and exposes the files of the current JupyterLab drive. The work is done in #655 (not all my code is pushed yet).
The problem is emscripten file systems must be synchronous (there is currently some work in emscripten to make those APIs async but it's not finished/released yet, see discussion in the PR mentioned above).

So I think using a service worker the way you're describing above is needed, in order to turn async file/directory fetches into synchronous tasks.

Serviceworker gets a POST request to a special URL with some content (as JSON or something)
It makes this into a promise and postmessages it to the correct mainwindow (and returns a new unresolved promise to the fetch request)
The main app converts that to a jupyter message and sends it wherever it needs to go.
The handler extension makes a response and sends that back to the main app, which posts it back to the serviceworker, which then resolves the promise made in step 2.
Tada, XML httprequest in webworker finishes, we're all good.

This makes perfect sense. I was reading this morning about BroadcastChannel which I think will be perfect for that:

it's bidirectional
you can have multiple channels (one for input, one for sleep, one for the file system etc)

joemarshall · 2022-06-03T12:35:11Z

I think normal postmessage makes more sense, because e.g. if you call input, you only want to read from things in the same window the kernel was launched from. Instead of channels the messages could just have a URL included, so that they can be routed to server or client plugins by JupyterLiteServer same way other messages are routed (I guess messages start off at the front-end because presumably that is what has access to the window object) I think that makes more sense than adding another form of communication to the whole thing.

…

________________________________ From: martinRenou ***@***.***> Sent: Thursday, June 2, 2022 10:15:25 AM To: jupyterlite/jupyterlite ***@***.***> Cc: Joe Marshall (staff) ***@***.***>; Mention ***@***.***> Subject: Re: [jupyterlite/jupyterlite] Add offline support (#137) Thanks for your comments @joemarshall<https://github.com/joemarshall> ! My comments about monkey-patching the open global function in Python are invalidated now, so please discard them. I'm exploring implementing a custom FileSystem (in the emscripten sense) that we mount for Python to use, and exposes the files of the current JupyterLab drive. The work is done in #655<#655> (not all my code is pushed yet). The problem is emscripten file systems must be synchronous (there is currently some work in emscripten to make those APIs async but it's not finished/released yet, see discussion in the PR mentioned above). So I think using a service worker the way you're describing above is needed, in order to turn async file/directory fetches into synchronous tasks. Serviceworker gets a POST request to a special URL with some content (as JSON or something) It makes this into a promise and postmessages it to the correct mainwindow (and returns a new unresolved promise to the fetch request) The main app converts that to a jupyter message and sends it wherever it needs to go. The handler extension makes a response and sends that back to the main app, which posts it back to the serviceworker, which then resolves the promise made in step 2. Tada, XML httprequest in webworker finishes, we're all good. This makes perfect sense. I was reading this morning about BroadcastChannel<https://developer.mozilla.org/en-US/docs/Web/API/BroadcastChannel> which I think will be perfect for that: * it's bidirectional * you can have multiple channels (one for input, one for sleep, one for the file system etc) — Reply to this email directly, view it on GitHub<#137 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAK6Y6ZJZBMX3C3H6Y3ZD7LVNB3S3ANCNFSM46CVZNIQ>. You are receiving this because you were mentioned.Message ID: ***@***.***> This message and any attachment are intended solely for the addressee and may contain confidential information. If you have received this message in error, please contact the sender and delete the email and attachment. Any views or opinions expressed by the author of this email do not necessarily reflect the views of the University of Nottingham. Email communications with the University of Nottingham may be monitored where permitted by law.

jtpio · 2022-06-23T18:37:30Z

Thanks @dmonad for initially starting this PR!

The commits have been included in #686.

add offline support with service worker

9f53622

add webmanifest and logos

929d823

lint

92c2f81

bollwyvl mentioned this pull request Jun 6, 2021

add chunkHashname with contenthash in webpack #138

Merged

bollwyvl mentioned this pull request Jun 17, 2021

Support for Comms in the pyolite kernel #145

Merged

jtpio added the enhancement New feature or request label Jun 22, 2021

bollwyvl mentioned this pull request Jul 6, 2021

Move the repo to the jupyterlite organization on GitHub #218

Closed

4 tasks

jtpio mentioned this pull request Jul 7, 2021

Avoid too many requests to PyPI #225

Open

bollwyvl mentioned this pull request Sep 1, 2021

add piplite for customizing pyolite packages, automate wheel management #310

Merged

17 tasks

bollwyvl added the performance Gotta go fast label Oct 15, 2021

bollwyvl mentioned this pull request Dec 20, 2021

Temporarily use JupyterLite for high-traffic links to mybinder.org jupyter/jupyter.github.io#513

Closed

bollwyvl mentioned this pull request Jan 25, 2022

Can't Interrupt Cell Execution in Python Kernel #459

Open

fcollonval mentioned this pull request Feb 17, 2022

Monthly Performance Meetings: 2022 jupyterlab/benchmarks#88

Closed

jtpio mentioned this pull request Feb 21, 2022

Use JupyterLite and Pyodide for the online shell sympy/sympy.github.com#169

Closed

3 tasks

bollwyvl mentioned this pull request Apr 29, 2022

0.1.0 Release Plan #383

Closed

bollwyvl mentioned this pull request Jun 21, 2022

Service Workers in web browsers with stricter restrictions #679

Open

martinRenou mentioned this pull request Jun 22, 2022

Caching service worker #686

Merged

jtpio closed this in #686 Jun 23, 2022

jtpio mentioned this pull request Jan 18, 2023

Display field in webmanifest changed to standalone to support iPadOS #951

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add offline support #137

Add offline support #137

dmonad commented Jun 4, 2021

bollwyvl commented Jun 4, 2021

dmonad commented Jun 4, 2021

bollwyvl commented Jun 4, 2021

dmonad commented Jun 4, 2021

bollwyvl commented Jun 4, 2021 •

edited

Loading

bollwyvl commented Jun 25, 2021

bollwyvl commented Jun 25, 2021

martinRenou commented May 30, 2022

joemarshall commented Jun 1, 2022

joemarshall commented Jun 1, 2022

joemarshall commented Jun 1, 2022

martinRenou commented Jun 2, 2022

joemarshall commented Jun 3, 2022 via email

jtpio commented Jun 23, 2022

Add offline support #137

Add offline support #137

Conversation

dmonad commented Jun 4, 2021

bollwyvl commented Jun 4, 2021

dmonad commented Jun 4, 2021

bollwyvl commented Jun 4, 2021

dmonad commented Jun 4, 2021

bollwyvl commented Jun 4, 2021 • edited Loading

bollwyvl commented Jun 25, 2021

bollwyvl commented Jun 25, 2021

martinRenou commented May 30, 2022

joemarshall commented Jun 1, 2022

joemarshall commented Jun 1, 2022

joemarshall commented Jun 1, 2022

martinRenou commented Jun 2, 2022

joemarshall commented Jun 3, 2022 via email

jtpio commented Jun 23, 2022

bollwyvl commented Jun 4, 2021 •

edited

Loading