Migrate to Dask #242

dlqqq · 2023-06-23T23:16:04Z

Problem

We are currently using Ray in our backend to parallelize certain tasks. However, for our use-case, Dask may be more appropriate. Here is our team's rationale:

Dask is a parallel compute (single-node) library for Python, whereas Ray is more for distributed compute (multi-node).
- In general, Dask seems more optimized for our use-case of single-node compute by offering more granular configuration on how workers are run on the hardware.
Ray is blocking cross-platform Python 3.11 compatibility for Jupyter AI.
Ray does not allow for multithreading.
- From our recent investigation (use Dask to parallelize document learning #182), learning tasks (via /learn) observe a ~2x performance gain via multiprocessing, and a ~10x performance gain via multithreading.
- Forking processes incurs a very significant overhead due to the size of the libraries Jupyter AI is using. The Dask dashboard shows that roughly ~3-4 GB of Python code is being pickled & unpickled for each process.
- Furthermore, most of our long-running tasks spend a majority of their lifetime waiting on I/O. Hence, Python's GIL is not the limiting factor in performance here.
Spawning Ray actors significantly impacts server start time, because each initialized Ray actor requires a process to be spawned, which as discussed previously, takes a significant amount of time for Jupyter AI. Furthermore, Ray actors consume a generous amount of memory (again, because each is a separate process) despite being idle throughout most of their lifetime.
- Not a problem with Ray, but rather with our initial implementation consisting almost exclusively of Ray actors.

Proposed Solution

Implement a comprehensive proof-of-concept for migrating the backend to Dask.

Additional context

We have to be careful about avoiding thread starvation, i.e. handle the case in which a worker thread (for whatever reason) is executing a significant amount of kernel-thread-blocking code.

The text was updated successfully, but these errors were encountered:

dlqqq added the enhancement New feature or request label Jun 23, 2023

dlqqq added this to the 0.9.0 Release milestone Jun 23, 2023

dlqqq self-assigned this Jun 26, 2023

dlqqq mentioned this issue Jun 29, 2023

Migrate to Dask #244

Merged

JasonWeill modified the milestones: 0.9.0 Release, 0.10.0 Release Jul 5, 2023

JasonWeill mentioned this issue Jul 10, 2023

Generate command can create sections with no code cells (no retry logic) #259

Open

dlqqq closed this as completed in #244 Jul 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate to Dask #242

Migrate to Dask #242

dlqqq commented Jun 23, 2023 •

edited

Loading

Migrate to Dask #242

Migrate to Dask #242

Comments

dlqqq commented Jun 23, 2023 • edited Loading

Problem

Proposed Solution

Additional context

dlqqq commented Jun 23, 2023 •

edited

Loading