Reduce SQL calls when incrementing/decrementing run counters #881

bkiahstroud · 2023-10-30T15:58:39Z

Story

SQL logs before changes (using #increment!):

Bulkrax::ImporterRun Load (8.9ms)  SELECT  "bulkrax_importer_runs".* FROM "bulkrax_importer_runs" WHERE "bulkrax_importer_runs"."id" = $1 LIMIT $2  [["id", 1], ["LIMIT", 1]]
Bulkrax::ImporterRun Update All (2.5ms)  UPDATE "bulkrax_importer_runs" SET "processed_records" = COALESCE("processed_records", 0) + 1 WHERE "bulkrax_importer_runs"."id" = $1  [["id", 1]]

SQL logs after changes (using #increment_counter):

Bulkrax::ImporterRun Update All (6.5ms)  UPDATE "bulkrax_importer_runs" SET "processed_records" = COALESCE("processed_records", 0) + 1 WHERE "bulkrax_importer_runs"."id" = $1  [["id", 1]]

The SELECT statement serves no purpose since the UPDATE statement is atomic.

I haven't done any benchmarks on these changes, but since less SQL statements seems like a pretty obvious win and since we update counters a ton, I'm hoping we'll see at least some speed improvements

SQL logs before changes (using `#increment!`): ```sql Bulkrax::ImporterRun Load (8.9ms) SELECT "bulkrax_importer_runs".* FROM "bulkrax_importer_runs" WHERE "bulkrax_importer_runs"."id" = $1 LIMIT $2 [["id", 1], ["LIMIT", 1]] Bulkrax::ImporterRun Update All (2.5ms) UPDATE "bulkrax_importer_runs" SET "processed_records" = COALESCE("processed_records", 0) + 1 WHERE "bulkrax_importer_runs"."id" = $1 [["id", 1]] ``` SQL logs after changes (using `#increment_counter`): ```sql Bulkrax::ImporterRun Update All (6.5ms) UPDATE "bulkrax_importer_runs" SET "processed_records" = COALESCE("processed_records", 0) + 1 WHERE "bulkrax_importer_runs"."id" = $1 [["id", 1]] ``` The `SELECT` statement serves no purpose since the `UPDATE` statement is atomic

These columns (`processed_children` and `failed_children`) were renamed in the RenameChildrenCountersToRelationships migration; they no longer exist

jeremyf · 2023-10-31T16:40:45Z

app/jobs/bulkrax/create_relationships_job.rb

-        # rubocop:disable Rails/SkipsModelValidations
-        Bulkrax::ImporterRun.find(importer_run_id).increment!(:processed_relationships, number_of_successes)
-        # rubocop:enable Rails/SkipsModelValidations
+        ImporterRun.connection.execute(<<-SQL)


Should this call ImporterRun.increment_counter(:processed_relationships, importer_run_id, amount: number_of_successes) ?

I don't like the raw SQL so far from the model.

Oops…the increment_counter is a Rails method. Hmm.

Wondering about using the update_counters method: https://api.rubyonrails.org/classes/ActiveRecord/CounterCache/ClassMethods.html#method-i-update_counters

#update_counters seems good to me. Same SQL, just gives you control over the amount, which is nice.

I'm curious, why do you not like raw SQL?

does this raw sql work in mysql? all versions of postgres? msql? oracle db? sqlite? because the old version does and that's the reason we avoid raw sql whenever we can.

Replaced the raw SQL with #update_counters and CI is still green so I think we're good!

orangewolf · 2023-10-31T18:08:20Z

app/jobs/bulkrax/import_file_set_job.rb

@@ -21,14 +21,14 @@ def perform(entry_id, importer_run_id)
      entry.build
      if entry.succeeded?
        # rubocop:disable Rails/SkipsModelValidations
-        ImporterRun.find(importer_run_id).increment!(:processed_records)
-        ImporterRun.find(importer_run_id).increment!(:processed_file_sets)
+        ImporterRun.increment_counter(:processed_records, importer_run_id)


these are great

orangewolf

I'm tempted to say that avoiding the find isn't a good enough reason to go to raw sql. if we can do it in an active record way w/o the find great, but having the find is better than maintaining the raw query

* main: (24 commits) Retry and delete take 2 (#894) 🎁 Add `Bulkrax.persistence_adapter` (#895) 💸 Mint v6.0.1 (#892) 🐛 Fix #work_identifier_search_field logic (#891) 💸 Bump to v6.0.0 (#889) make search string used to look up objects configurable (#884) 💸 v5.5.0 (#888) unpin dry-monads. its not a dependency of bulkrax (#885) fix syntax error in ERB (#883) add support for Rails 6, Hyrax 4, and Blacklight 7 (#782) Reduce SQL calls when incrementing/decrementing run counters (#881) Update readme to remove references to samvera-labs (#880) add Compatibility section to readme (#879) 🐛 Fix tabs for Hydra application (#875) Nav-tabs event scoping (#874) 📚 Update docs in preparation for best practices seminar (#873) use the `GlobalID` library tooling to determine global id (#869) Avoid NoMethodError in Bulkrax::Importers::Controller#create. (#870) preparing to deploy v5.4.1 (#868) 5.4.0-bug-fixes (#865) ...

bkiahstroud added the patch-ver for release notes label Oct 30, 2023

bkiahstroud added 6 commits October 30, 2023 11:36

replace #decrement! with #decrement_counter

e359ada

put rubocop disable comments back

ae65cb7

WIP: update counter-related specs

3edd18c

fix more specs

e99708e

make some counter specs more specific

658ff3c

remove references to "children" ImporterRun columns

d2d46d7

These columns (`processed_children` and `failed_children`) were renamed in the RenameChildrenCountersToRelationships migration; they no longer exist

bkiahstroud marked this pull request as ready for review October 31, 2023 15:57

jeremyf reviewed Oct 31, 2023

View reviewed changes

orangewolf reviewed Oct 31, 2023

View reviewed changes

orangewolf requested changes Oct 31, 2023

View reviewed changes

bkiahstroud added 2 commits October 31, 2023 11:18

prefer avoiding raw SQL

321ca84

rubocop

de3b9b4

bkiahstroud requested review from orangewolf and jeremyf October 31, 2023 19:34

jeremyf approved these changes Oct 31, 2023

View reviewed changes

bkiahstroud merged commit 1c39f9f into main Oct 31, 2023
6 checks passed

bkiahstroud deleted the SQLlence branch October 31, 2023 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce SQL calls when incrementing/decrementing run counters #881

Reduce SQL calls when incrementing/decrementing run counters #881

bkiahstroud commented Oct 30, 2023 •

edited

Loading

jeremyf Oct 31, 2023

jeremyf Oct 31, 2023

jeremyf Oct 31, 2023

bkiahstroud Oct 31, 2023

orangewolf Oct 31, 2023

bkiahstroud Oct 31, 2023

orangewolf Oct 31, 2023

orangewolf left a comment

Reduce SQL calls when incrementing/decrementing run counters #881

Reduce SQL calls when incrementing/decrementing run counters #881

Conversation

bkiahstroud commented Oct 30, 2023 • edited Loading

Story

jeremyf Oct 31, 2023

Choose a reason for hiding this comment

jeremyf Oct 31, 2023

Choose a reason for hiding this comment

jeremyf Oct 31, 2023

Choose a reason for hiding this comment

bkiahstroud Oct 31, 2023

Choose a reason for hiding this comment

orangewolf Oct 31, 2023

Choose a reason for hiding this comment

bkiahstroud Oct 31, 2023

Choose a reason for hiding this comment

orangewolf Oct 31, 2023

Choose a reason for hiding this comment

orangewolf left a comment

Choose a reason for hiding this comment

bkiahstroud commented Oct 30, 2023 •

edited

Loading