[$500] [HOLD for payment 2023-12-20] The whole queue of offline requests are processed when you come back online, delaying online requests #28172

muttmuure · 2023-09-25T16:56:54Z

https://expensify.slack.com/archives/C05LX9D6E07/p1695105894373289

Problem

When we're offline, we'll build up a ton of requests if we keep navigating. OpenReport is probably the biggest offender. That's bad because we'll process the whole queue once we're back online, and because of that requests that I actually need to execute when I'm online will have to wait until the whole queue is processed.

Let's say you have navigated to 5 reports while offline.

When you go online:

We'll start processing the queue.
Open report usually takes a couple seconds per request, so we'll wait 10s until you do all requests
Because we also queue the onyxUpdates for write requests, we'll need for the write operation for all of them to finish before we actually see anything in the report we just navigated

Solution

Only call OpenReport exactly once per report

Use a report metadata key to store if we have ever called the OpenReport for a specific report

Issue Owner

Current Issue Owner: @sakluger

Upwork Automation - Do Not Edit

Upwork Job URL: https://www.upwork.com/jobs/~01697b67b9aa3280da
Upwork Job ID: 1742376039165644800
Last Price Increase: 2024-01-03

roryabraham · 2023-10-19T17:45:12Z

One idea for a solution to help with the scenario where there are many essentially duplicate requests in the queue would be to add an idempotencyKey to the request object. In the case of OpenReport, I think this could be some combination of the command name and reportID: OpenReport?reportID=1234. Then whenever we're adding a request to the sequentialQueue, we look for any requests with a matching idempotencyKey and only keep the newest one, replacing any older duplicate requests in the queue with the newest one.

Using this technique we would de-dupe requests in the queue to help reduce the size of the queue when we come back online

Before	After

Edit: You don't just "keep the newest instance of an idempotent request only, I think you'd want to replace the previous instance with the newest one, so that the order of operations is preserved

roryabraham · 2023-10-19T17:55:03Z

If we wanted to take it a step further, I have another idea building off an idea I've proposed before in a different context...

Solution:

Replace the sequential queue with a directed acyclic graph (DAG) of requests, which we can call the RequestGraph.

Each request is an object with an ID and an arbitrary set of data. Each request object represents a nodes in the graph.
Each request can have zero or more optional parentRequests. The presence of a parentRequest represents a directed edge in the graph.
API.write will add a node (and potentially an edge) to the graph. It will throw an error if the edge introduces a cycle in the graph. 🚫 ♻️
The RequestGraph, will continuously perform a parallel breadth-first search from every root node in the graph. i.e: whenever a request is added to the graph, find all root nodes in the graph, and for each:
- Process the request, then process all child requests, then process all their children, etc...
- If any parent request fails, then all its descendant requests are cancelled
- Only once there are no more children originating from this root, delete the parent request and all descendants
Requests with a parentRequest can reference data from the parent request. Depending on the server response, a parent request can update its data and dependent children will be able to reference that updated data (eliminating the need for the HandleUnusedOptimisticID middleware).
Just like with the sequentialQueue, requests can have an idempotencyKey, and if a node with the matching idempotencyKey is found in the graph we replace the old request with the new one.

This means that instead of processing write requests one-by-one in the sequential queue, we can process non-conflicting requests in parallel and only have to process requests that depend on other parent requests to run in sequence. This could massively improve the speed at which we process requests made offline, especially on a slower network.

Before	After

It's worth nothing that this would be a pretty big change, so if we were going to roll this out I think the prudent thing to do would be to make the default behavior of the RequestGraph the same as the SequentialQueue. i.e: any time we add a request to the RequestGraph, we assume that its parentRequest is the last request added to the graph. Thus at first the graph ends up just being a linear queue as it is today.

flowchart LR
    subgraph first requests
        A(Request A)
    end

    subgraph second requests
        B(Request B)
    end

    subgraph third requests
        C(Request C)
    end

    subgraph fourth requests
        D(Request D)
    end

    subgraph fifth requests
        E(Request E)
    end

    subgraph sixth requests
        F(Request F)
    end

    A --> B --> C --> D --> E --> F

Only when we're certain which parentRequest a new request is dependent upon, the developer will call API.write and provide the ID of a parentRequest. This will mean that request can happen as soon as that parent request is done, instead of having to wait for unrelated requests to finish.

flowchart LR
    subgraph first requests
        A(Request A)
    end

    subgraph second requests
        B(Request B)
        D(Request D)
    end

    subgraph third requests
        C(Request C)
    end

    subgraph fourth requests
        E(Request E)
    end

    subgraph fifth requests
        F(Request F)
    end

    A --> B --> C
    A --> D
    C --> E --> F

If we know that a request is independent of any other requests in the graph, we can provide an empty array for parentRequests, and that would allow it to run immediately the next time the RequestGraph processes.

flowchart LR
    subgraph first requests
        A(Request A)
        F(Request F)
    end

    subgraph second requests
        B(Request B)
        D(Request D)
    end

    subgraph third requests
        C(Request C)
    end

    subgraph fourth requests
        E(Request E)
    end

    A --> B --> C
    A --> D
    C --> E

WojtekBoman · 2023-10-23T09:11:04Z

@muttmuure Can you assign me and @kosmydel to this issue?

melvin-bot · 2023-10-23T09:11:08Z

📣 @WojtekBoman! 📣
Hey, it seems we don’t have your contributor details yet! You'll only have to do this once, and this is how we'll hire you on Upwork.
Please follow these steps:

Make sure you've read and understood the contributing guidelines.
Get the email address used to login to your Expensify account. If you don't already have an Expensify account, create one here. If you have multiple accounts (e.g. one for testing), please use your main account email.
Get the link to your Upwork profile. It's necessary because we only pay via Upwork. You can access it by logging in, and then clicking on your name. It'll look like this. If you don't already have an account, sign up for one here.
Copy the format below and paste it in a comment on this issue. Replace the placeholder text with your actual details.

Format:

Contributor details
Your Expensify account email: <REPLACE EMAIL HERE>
Upwork Profile Link: <REPLACE LINK HERE>

kosmydel · 2023-10-24T08:50:10Z

Comment to enable the assignment :)

WojtekBoman · 2023-10-24T14:03:49Z

@roryabraham I have one question about the first proposed approach to solve this problem. Is the reportId parameter sufficient to make requests idempotent? I checked params of this request and besides the reportId it also accepts params: emailList, accountIDList etc. In the first approach if we send OpenReport request twice with the same reportID but with different additional parameters, we will get response only from the last sent request. Is this okay? I'd like to be sure that this how it should work.

roryabraham · 2023-10-25T00:30:01Z

Is the reportId parameter sufficient to make requests idempotent?

It depends on the request. Each request may have it's own idempotencyKey, and it's up to us as developers to determine what data is appropriate to use for the idempotencyKey.

In the first approach if we send OpenReport request twice with the same reportID but with different additional parameters, we will get response only from the last sent request. Is this okay?

I think that is indeed ok for OpenReport. The only data that's written in OpenReport is when the user last read the report, so we only care about the last call.

roryabraham · 2023-10-25T00:35:33Z

if other params than reportID are included that means we are creating a new report

@mountiny made a great point here that if we make a report optimistically OpenReport will have more params that we don't want to just throw away.

So maybe what we want to do is – instead of just throwing away the earlier request and only keeping the later one, we should:

merge the params, optimisticData, successData, failureData of the later request into the earlier one
Keep the request in the same position in the queue that the earlier request had

WojtekBoman · 2023-10-25T13:51:06Z

@roryabraham Okay so I would like to define a strategy for merging data from old and new OpenReport requests. For optimisticData, failureData and successData it's easy, we can merge data by key property. To the new request will be added these objects from old request which are not included in it. I have a couple of questions about how it should work when we want to merge params from two requests.

How to determine which parameters should be replaced between the old and new request? We can analyze OpenReport request. When we send this request with defined value for emailList param and next request is sent without it, what should be the result of merging these two requests? Should the merged request have value for this param defined?
Is there a risk that merging request params will cause side effects?

kosmydel · 2023-10-30T16:22:05Z

Hey @roryabraham. Together with Wojciech Boman we have another set of questions about GraphQueue implementation.

General questions:

Is it possible that the request will have more than one parent? If so:
- Is the following approach correct? We should wait until all the parents' requests are resolved, and then proceed with the children. We are thinking about a counter, which will count for how many parent requests the request still has to wait.
- In which cases can it happen?
How should we enable children to reference data from parent requests? In which cases can it be helpful?
How can a parent request update dependent children's data? In which cases can it happen?
From where would a developer obtain the requests’ IDs to pass to the parentRequests?

Idempotency questions:
What if an offline user makes two actions: post and then delete? Do we need to send these requests when we know that the second request cancels the first? For example when we add and remove emojis from a text message.

roryabraham · 2023-10-30T18:38:26Z

How to determine which parameters should be replaced between the old and new request? We can analyze OpenReport request. When we send this request with defined value for emailList param and next request is sent without it, what should be the result of merging these two requests? Should the merged request have value for this param defined?

Yes, I think the merged request should have the value for that param defined. Basically just use lodashMerge I think

Is there a risk that merging request params will cause side effects?

Possibly – we have to think about and test it carefully. Of course, we should make this feature opt-in by just providing an idempotencyKey only for requests which can be safely de-duped. So we take it one request at a time and migrate each carefully.

roryabraham · 2023-10-30T18:41:38Z

Is it possible that the request will have more than one parent?

I can't think of any cases when this would happen, but it seems likely that there may be such a case. I don't think there's much complexity that will be added by having parentRequests be an array of requests instead of just a single request. So instead of just parentRequest.then(...) it would be Promise.allSettled(parentRequests).then(...)

roryabraham · 2023-10-30T18:47:38Z

How should we enable children to reference data from parent requests? In which cases can it be helpful?

One case when it would be helpful is already solved in another way by the src/libs/Middleware/HandleUnusedOptimisticID middleware. If you disable the HandleUnusedOptimisticID middleware, you can reproduce the following issue:

Alice and Bob have never chatted before.
Alice goes offline
Bob (online) sends a DM to Alice
Alice sends several messages to Bob
Alice comes back online, then:
1. Due to some back-end code we have OpenReport will succeed, but the optimistic reportID passed as a param is not used because the report already exists
2. The queued AddComment requests from Alice will all fail, because they are referencing the unused optimistic reportID.

This could be a case when AddComment can reference data from the parent, eliminating the need for the slightly hacky HandleUnusedOptimisticID middleware

roryabraham · 2023-10-30T18:48:35Z

From where would a developer obtain the requests’ IDs to pass to the parentRequests?

I suppose it would be returned from API.write. Due to the design of our network code, that function should not return a promise, but could be updated to synchronously return the requestID

muttmuure · 2023-11-13T19:23:43Z

#30425

melvin-bot · 2023-11-22T13:37:52Z

⚠️ Looks like this issue was linked to a Deploy Blocker here

If you are the assigned CME please investigate whether the linked PR caused a regression and leave a comment with the results.

If a regression has occurred and you are the assigned CM follow the instructions here.

If this regression could have been avoided please consider also proposing a recommendation to the PR checklist so that we can avoid it in the future.

melvin-bot · 2023-11-22T21:35:22Z

⚠️ Looks like this issue was linked to a Deploy Blocker here

If you are the assigned CME please investigate whether the linked PR caused a regression and leave a comment with the results.

If a regression has occurred and you are the assigned CM follow the instructions here.

If this regression could have been avoided please consider also proposing a recommendation to the PR checklist so that we can avoid it in the future.

melvin-bot · 2023-11-22T21:40:16Z

⚠️ Looks like this issue was linked to a Deploy Blocker here

If you are the assigned CME please investigate whether the linked PR caused a regression and leave a comment with the results.

If a regression has occurred and you are the assigned CM follow the instructions here.

If this regression could have been avoided please consider also proposing a recommendation to the PR checklist so that we can avoid it in the future.

melvin-bot · 2023-12-13T11:02:22Z

Reviewing label has been removed, please complete the "BugZero Checklist".

melvin-bot · 2023-12-13T11:02:25Z

The solution for this issue has been 🚀 deployed to production 🚀 in version 1.4.11-25 and is now subject to a 7-day regression period 📆. Here is the list of pull requests that resolve this issue:

Refactor the PersistedRequests v2 #32246

If no regressions arise, payment will be issued on 2023-12-20. 🎊

After the hold period is over and BZ checklist items are completed, please complete any of the applicable payments for this issue, and check them off once done.

External issue reporter
Contributor that fixed the issue
Contributor+ that helped on the issue and/or PR

For reference, here are some details about the assignees on this issue:

@WojtekBoman does not require payment (Contractor)
@kosmydel does not require payment (Contractor)

roryabraham · 2023-12-22T17:58:58Z

New PR is on prod: #32246

roryabraham · 2023-12-22T17:59:34Z

So I believe C+ payment is due here to @alitoshmatov

roryabraham · 2024-01-01T17:35:06Z

My mistake @alitoshmatov, just realized that we don't have anyone assigned to this issue to help issue payment for the C+ review of #32246. Let's get that sorted...

melvin-bot · 2024-01-01T17:35:16Z

Triggered auto assignment to @sakluger (NewFeature), see https://stackoverflowteams.com/c/expensify/questions/14418#:~:text=BugZero%20process%20steps%20for%20feature%20requests for more details.

roryabraham · 2024-01-01T17:36:04Z

@sakluger only action-item for you here is to issue a standard C+ review payment to @alitoshmatov for #32246. Thanks!

melvin-bot · 2024-01-03T02:43:15Z

Job added to Upwork: https://www.upwork.com/jobs/~01697b67b9aa3280da

melvin-bot · 2024-01-03T02:43:20Z

Current assignee @alitoshmatov is eligible for the External assigner, not assigning anyone new.

sakluger · 2024-01-03T02:46:02Z

@alitoshmatov I sent you an offer on Upwork, please let me know once you've accepted. Thanks!

alitoshmatov · 2024-01-03T10:58:48Z

@sakluger Accepted the offer

sakluger · 2024-01-04T21:49:18Z

Completed payment, thanks!

melvin-bot bot added the Monthly KSv2 label Sep 29, 2023

muttmuure changed the title ~~[HOLD on #reliable-updates] The whole queue of offline requests are processed when you come back online, delaying online requests~~ The whole queue of offline requests are processed when you come back online, delaying online requests Oct 5, 2023

mountiny assigned WojtekBoman and kosmydel Oct 24, 2023

roryabraham self-assigned this Oct 25, 2023

roryabraham added Weekly KSv2 and removed Monthly KSv2 labels Oct 25, 2023

WojtekBoman mentioned this issue Oct 26, 2023

Refactor save method in PersistedRequests #30425

Merged

58 tasks

melvin-bot bot added Reviewing Has a PR in review Weekly KSv2 and removed Weekly KSv2 labels Oct 31, 2023

erquhart mentioned this issue Nov 22, 2023

[$500] Pin - Pinned chat returns to unpin status when returning online after reload #31733

Closed

6 tasks

melvin-bot bot removed the Reviewing Has a PR in review label Dec 13, 2023

melvin-bot bot added the Overdue label Dec 21, 2023

melvin-bot bot removed the Overdue label Dec 22, 2023

roryabraham assigned alitoshmatov Dec 22, 2023

melvin-bot bot added the Overdue label Jan 1, 2024

melvin-bot bot removed the Overdue label Jan 1, 2024

roryabraham added NewFeature Something to build that is a new item. Overdue labels Jan 1, 2024

melvin-bot bot assigned sakluger Jan 1, 2024

melvin-bot bot removed the Overdue label Jan 1, 2024

sakluger added the External Added to denote the issue can be worked on by a contributor label Jan 3, 2024

melvin-bot bot added the Help Wanted Apply this label when an issue is open to proposals by contributors label Jan 3, 2024

melvin-bot bot added Daily KSv2 and removed Weekly KSv2 labels Jan 3, 2024

sakluger closed this as completed Jan 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[$500] [HOLD for payment 2023-12-20] The whole queue of offline requests are processed when you come back online, delaying online requests #28172

[$500] [HOLD for payment 2023-12-20] The whole queue of offline requests are processed when you come back online, delaying online requests #28172

muttmuure commented Sep 25, 2023 •

edited by melvin-bot bot

Loading

roryabraham commented Oct 19, 2023 •

edited

Loading

roryabraham commented Oct 19, 2023 •

edited

Loading

WojtekBoman commented Oct 23, 2023

melvin-bot bot commented Oct 23, 2023

kosmydel commented Oct 24, 2023

WojtekBoman commented Oct 24, 2023

roryabraham commented Oct 25, 2023

roryabraham commented Oct 25, 2023

WojtekBoman commented Oct 25, 2023 •

edited

Loading

kosmydel commented Oct 30, 2023

roryabraham commented Oct 30, 2023

roryabraham commented Oct 30, 2023

roryabraham commented Oct 30, 2023

roryabraham commented Oct 30, 2023 •

edited

Loading

muttmuure commented Nov 13, 2023

melvin-bot bot commented Nov 22, 2023

melvin-bot bot commented Nov 22, 2023

melvin-bot bot commented Nov 22, 2023

melvin-bot bot commented Dec 13, 2023

melvin-bot bot commented Dec 13, 2023

roryabraham commented Dec 22, 2023

roryabraham commented Dec 22, 2023

roryabraham commented Jan 1, 2024

melvin-bot bot commented Jan 1, 2024

roryabraham commented Jan 1, 2024

melvin-bot bot commented Jan 3, 2024

melvin-bot bot commented Jan 3, 2024

sakluger commented Jan 3, 2024

alitoshmatov commented Jan 3, 2024

sakluger commented Jan 4, 2024

[$500] [HOLD for payment 2023-12-20] The whole queue of offline requests are processed when you come back online, delaying online requests #28172

[$500] [HOLD for payment 2023-12-20] The whole queue of offline requests are processed when you come back online, delaying online requests #28172

Comments

muttmuure commented Sep 25, 2023 • edited by melvin-bot bot Loading

Problem

Solution

roryabraham commented Oct 19, 2023 • edited Loading

roryabraham commented Oct 19, 2023 • edited Loading

WojtekBoman commented Oct 23, 2023

melvin-bot bot commented Oct 23, 2023

kosmydel commented Oct 24, 2023

WojtekBoman commented Oct 24, 2023

roryabraham commented Oct 25, 2023

roryabraham commented Oct 25, 2023

WojtekBoman commented Oct 25, 2023 • edited Loading

kosmydel commented Oct 30, 2023

roryabraham commented Oct 30, 2023

roryabraham commented Oct 30, 2023

roryabraham commented Oct 30, 2023

roryabraham commented Oct 30, 2023 • edited Loading

muttmuure commented Nov 13, 2023

melvin-bot bot commented Nov 22, 2023

melvin-bot bot commented Nov 22, 2023

melvin-bot bot commented Nov 22, 2023

melvin-bot bot commented Dec 13, 2023

melvin-bot bot commented Dec 13, 2023

roryabraham commented Dec 22, 2023

roryabraham commented Dec 22, 2023

roryabraham commented Jan 1, 2024

melvin-bot bot commented Jan 1, 2024

roryabraham commented Jan 1, 2024

melvin-bot bot commented Jan 3, 2024

melvin-bot bot commented Jan 3, 2024

sakluger commented Jan 3, 2024

alitoshmatov commented Jan 3, 2024

sakluger commented Jan 4, 2024

muttmuure commented Sep 25, 2023 •

edited by melvin-bot bot

Loading

roryabraham commented Oct 19, 2023 •

edited

Loading

roryabraham commented Oct 19, 2023 •

edited

Loading

WojtekBoman commented Oct 25, 2023 •

edited

Loading

roryabraham commented Oct 30, 2023 •

edited

Loading