Process billing batch move db writes #230

StuAA78 · 2023-05-17T13:37:21Z

https://eaflood.atlassian.net/browse/WATER-3996

This PR aims to make ProcessBillingBatchService more performant by moving database writes out of the main loop. As part of this we undertake additional refactoring to clean up the main loop.

This PR aims to make `ProcessBillingBatchService` more performant by moving database writes out of the main loop.

In order to perform all db writing outside of the main processing loop, we want to change how we structure the stored billing data. The first part of this is refactoring our functions to avoid mutating data

Instead of having a `currentBillingData` object which we update as we iterate over the charge versions, we have a `billingData` object which is a keyed equivalent where the key is the billing invoice licence id and the value is effectively the `currentBillingData`. We update this as we iterate over the charge versions, then iterate over it after the loop to perform the db writing. Although this doesn't yet reduce the number of db writes, it does move them outside of the loop. As a bonus, we can simply update the desired billing invoice licence's data on each iteration and we no longer need to check for a change in id

The next step of separating out our db writes so we can do them in as few writes as possible is to refactor `_finaliseCurrentInvoiceLicence()` so that instead of reading the db to get previous transactions to cleanse then writing the results to the db, we have a separate `_generateCleansedTransactions()` function which does the reading and cleansing, leaving `_finaliseCurrentInvoiceLicence()` to do the writing

Instead of persisting data one item at a time as we iterate over `billingData`, we instead store all the transactions, invoices and invoice licences in arrays which we can then batch insert

When we loop over the billing data, we previously set the `isEmpty` flag to `false` before we add transactions, billing invoices and billing invoice licences to our `dataToPersist` object. Instead of setting the flag we can instead calculate it based on whether `dataToPersist.billingInvoiceLicences` is populated. We also rename it to `billingBatchIsEmpty` to make its intent a little clearer

On reflection, having a variable in our `go()` function to track whether the billing batch is empty isn't something we need. Instead, we refactor to pass the billing invoice licences through to `_finaliseBillingBatch()` and check in that function whether or not the array is empty

Previously we went through everything in the main loop, then checked to see if we should generate calculate transactions based on whether the charge version status is `current`. Since this last bit is the only part of the loop that does anything, we move the status check to the top of the loop so we can skip the iteration entirely if the status isn't `current`

Instead of blindly overwriting `billingData[billingInvoiceLicenceId]` with either the data we provided or some empty initial data, we check first to see if it exists and only write our defaults if it doesn't. This makes it clearer what we're doing (ie. "only set the initial data if something doesn't already exist")

We move the main loop into its own function and refactor it to use `.reduce()`

We move the billing data handling loop out of `go()` and into its own function

Cruikshanks

In my opinion I think we could pass around more specific values than the bulk data objects in some places. I also think with all that is going on in this service there are a few things we could do elsewhere.

But I also know we need to move some stuff out ASAP to support the multi-year work. And with that done what remains I think screams out for splitting into smaller, more focused services.

So, I'm keeping those details to myself and approving this! We can catch up after on the final refactoring/cleanup 😁

Other than that, this is going to really help 💪

StuAA78 added 13 commits May 15, 2023 17:18

ProcessBillingBatchService refactor - consolidate db writes

0354571

This PR aims to make `ProcessBillingBatchService` more performant by moving database writes out of the main loop.

Refactor out mutation of billing data

bd342d8

In order to perform all db writing outside of the main processing loop, we want to change how we structure the stored billing data. The first part of this is refactoring our functions to avoid mutating data

Batch insert data

bb4e550

Instead of persisting data one item at a time as we iterate over `billingData`, we instead store all the transactions, invoices and invoice licences in arrays which we can then batch insert

Refactor main loop

159790d

We move the main loop into its own function and refactor it to use `.reduce()`

Refactor billing data handling loop

e251791

We move the billing data handling loop out of `go()` and into its own function

Refactor into functions

5bb285f

Renaming and commenting

59d1261

StuAA78 added the housekeeping Refactoring, tidying up or other work which supports the project label May 17, 2023

StuAA78 self-assigned this May 17, 2023

Line length fix

73c712b

StuAA78 marked this pull request as ready for review May 17, 2023 14:01

Merge branch 'main' into process-billing-batch-move-db-writes

e47d796

StuAA78 requested review from Cruikshanks, Jozzey and Beckyrose200 May 17, 2023 15:50

Cruikshanks approved these changes May 17, 2023

View reviewed changes

Merge branch 'main' into process-billing-batch-move-db-writes

8ad66e0

StuAA78 merged commit 51fd91e into main May 17, 2023

StuAA78 deleted the process-billing-batch-move-db-writes branch May 17, 2023 18:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Process billing batch move db writes #230

Process billing batch move db writes #230

StuAA78 commented May 17, 2023 •

edited

Loading

Cruikshanks left a comment

Process billing batch move db writes #230

Process billing batch move db writes #230

Conversation

StuAA78 commented May 17, 2023 • edited Loading

Cruikshanks left a comment

Choose a reason for hiding this comment

StuAA78 commented May 17, 2023 •

edited

Loading