Bug: Batch ordering - Regression introduced in version 13.26.1, insertAll() leaves batchMode=true enabled ... when it should not #3487

nPraml · 2024-10-02T12:20:17Z

we are currently in the process of updating from ebean 13 to 14 and we encountered a bug, see the test case.

The entity Contact has a @ManyToOne property (without cascadeType) ContactGroup.

In a transaction we store some elements using DB.insertAll(). Then we save a contact (without a group), we create a group with a certain id, we save it and then we put the group in the contact.

With transaction flush, however, the order in the queue is not correct: contact (with group) is saved first, but group does not yet exist in the DB.

What we discovered during debugging: after DB.insertAll(), txn.batchMode = true is set (this was not the case with ebean 13) and during flush the objects contact and group are in the wrong order in the queue.

With ebean 13 you have to explicitly call txn.setBatchMode(true) for the test to fail.

Could you please take a look and help us fix this?

Cheers,
Noemi

@rPraml FYI

nPraml · 2024-10-02T13:37:34Z

if I modify insertAll on the same way as saveAll:

it fixes the test, but the test case fails if I use instead insertAll a trx.setBatchMode(true)

rbygrave · 2024-10-08T06:35:03Z

txn.flushBatchOnCollection()

Yes, this bug is a regression that was introduced in version 13.26.1 by #3324 line - https://github.com/ebean-orm/ebean/pull/3324/files#diff-0804968369579e408eea483a9e582485cd7faf94e5d78dc8f7a918c729962a70R1695

It should have that txn.flushBatchOnCollection()

but the test case fails if I use ... trx.setBatchMode(true)

That is expected by me.

The thinking here is that this isn't a case of "Cascading" persistence [where ebean is determining the "depth" and ordering based on the "depth"] but instead a case where both Contact and ContactGroup are explicitly saved as "Top Level" via database.save()

So when we have:

...
database.save(contact);
...
database.save(contactGroup);

The contact was saved before the contactGroup ... and these are both "Top Level" ... and so the ordering is then down to the order in which those are saved, which is contact first.

Does that make sense?

rPraml · 2024-10-08T08:23:56Z

Hello Rob, unfortunately the places in our code are a bit spread out. One part of the code creates the contacts, the other the contact groups. A third then takes care of mapping both.
Imagine there are 3 CSV files (contacts.csv, groups.csv and mapping.csv) I tried to model this in a unit test

ebean-test/src/test/java/org/tests/batchinsert/TestBatchInsertFlush.java

rPraml · 2024-10-08T08:27:24Z

ebean-test/src/test/java/org/tests/batchinsert/TestBatchInsertFlush.java

+      mappingCsv.forEach((contactId, groupId) -> {
+        Contact contact = createdContacts.get(contactId);
+        contact.setGroup(DB.reference(ContactGroup.class, groupId));
+        DB.save(contact);


The last code part does some post-processing on some of the objects (the objects may not be saved, yet due ebean batching)

Yes, there are several ways, how to solve this issue

do a DB.saveAll(createdContacts.value())

disable batching

do a flush

write a better import-code 😉

I'm unsure, if ebean can do much here. It must know or adjust the save-order, but this may rise up new problems

I'm unsure, if ebean can do much here

I know its an example but to me its pretty clear that import code should look to first materialise and save the [ContactGroup] beans that will be later referenced by other beans [Contact].

Ebean gives the application exact control over this. With that control, this import can be done as batch inserts of ContactGroup followed by batch inserts of Contact with no actual updates required [optimal handling of this from the database perspective].

You are going to have a hard time convincing me that ebean should change behaviour here.

What we want is for ebean to give developers the ability to control exactly how this works because at certain levels of scale we absolutely need that control. Trying to get "fancy" here would not end well.

do a DB.saveAll(createdContacts.value())

So, do that and rely on ebean cascade persisting from Contact to ContactGroup [and then ebean will determine the ordering because we are now using cascade persist and ebean will get it right]. Yes, a good option.

disable batching

At some scale people regret this. I say invest the time and think about ordering and stay with jdbc batch [and get orders of magnitude performance benefit from that investment]. I'd personally never go without batching here.

do a flush

If we really must. Doing so, implies that we'll see extra updates being used when if the ordering was correct the app would avoid those extra updates. So generally not optimal.

write a better import-code

IMO this is actually the best answer. Actually spend some time thinking about the ordering and design import processing with that in mind, looking to use JDBC batch and avoiding extra updates when possible.

rPraml · 2024-10-08T10:07:19Z

You are going to have a hard time convincing me that ebean should change behaviour here.

Everything is fine, we just noticed this "regression" because with ebean 14 some unit tests in our application failed because batching was active after insertAll.

We have the situation here that many people work on our code base and (unfortunately) often produce suboptimal code, as in this case.

Fortunately, we also have a lot of unit tests, so we can find most of these errors quickly.

If you say it "works as planned", then I also have good arguments in our company that maybe some parts of the code need to be revised.

ebean-test/src/test/java/org/tests/batchinsert/TestBatchInsertFlush.java

rPraml · 2024-10-08T11:17:50Z

Stripped down the testcase

rob-bygrave · 2024-10-08T21:43:13Z

Stripped down the testcase

Beautiful, love it, thanks !!

I'm happy with this PR. As this is a "Regression Bug" I think this should go into version 14.7.0 (not patch version).

So, wondering if there is anything that should go ahead into a 14.6.1 release or if the next release is 14.7.0 with this change. I suspect we are not going to have a 14.6.1 at this stage.

rbygrave assigned rbygrave and nPraml Oct 8, 2024

rbygrave added bug regression labels Oct 8, 2024

rbygrave changed the title ~~Bug: Batch ordering~~ Bug: Batch ordering - Regression introduced in version 13.26.1, insertAll() leaves batchMode=true enabled ... when it should not Oct 8, 2024

rPraml reviewed Oct 8, 2024

View reviewed changes

ebean-test/src/test/java/org/tests/batchinsert/TestBatchInsertFlush.java Outdated Show resolved Hide resolved

rPraml force-pushed the batch-ordering-bug branch from 3286276 to bf6c7e2 Compare October 8, 2024 11:13

rPraml added 2 commits October 8, 2024 13:16

Created test for batch escalation

a8418a3

Fix: after DB.inserAll, transaction may stay in batchmode

8b4804b

rPraml force-pushed the batch-ordering-bug branch from bf6c7e2 to 8b4804b Compare October 8, 2024 11:17

rbygrave added this to the 14.6.1 milestone Oct 10, 2024

rbygrave merged commit a2ee8b6 into ebean-orm:master Oct 10, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: Batch ordering - Regression introduced in version 13.26.1, insertAll() leaves batchMode=true enabled ... when it should not #3487

Bug: Batch ordering - Regression introduced in version 13.26.1, insertAll() leaves batchMode=true enabled ... when it should not #3487

nPraml commented Oct 2, 2024 •

edited

Loading

nPraml commented Oct 2, 2024

rbygrave commented Oct 8, 2024

rPraml commented Oct 8, 2024

rPraml Oct 8, 2024

rbygrave Oct 8, 2024

rPraml commented Oct 8, 2024

rPraml commented Oct 8, 2024

rob-bygrave commented Oct 8, 2024

Bug: Batch ordering - Regression introduced in version 13.26.1, insertAll() leaves batchMode=true enabled ... when it should not #3487

Bug: Batch ordering - Regression introduced in version 13.26.1, insertAll() leaves batchMode=true enabled ... when it should not #3487

Conversation

nPraml commented Oct 2, 2024 • edited Loading

nPraml commented Oct 2, 2024

rbygrave commented Oct 8, 2024

rPraml commented Oct 8, 2024

rPraml Oct 8, 2024

Choose a reason for hiding this comment

rbygrave Oct 8, 2024

Choose a reason for hiding this comment

rPraml commented Oct 8, 2024

rPraml commented Oct 8, 2024

rob-bygrave commented Oct 8, 2024

nPraml commented Oct 2, 2024 •

edited

Loading