Feature / Add columnar data support in main model #712

Jerry-Jinfeng-Guo · 2024-09-05T13:46:20Z

input
output
update
integration test
~~data id field inference~~

Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

Signed-off-by: Jerry Guo <Jerry.Jinfeng.Guo@alliander.com>

TonyXiang8787 · 2024-09-05T14:32:04Z

@Jerry-Jinfeng-Guo @mgovers @figueroa1395, from the user's perspective, the user would definitely like to provide a columnar batch dataset in a way that the id is not provided for a certain component. In that case, it should be inferred that the elements where attributes are to be updated via columnar buffer are in the exact same sequence of the input data. This is a realistic use-case and will be appreciated by the user, to save the additional step to just assign the exactly the same id as in the input data. The following Python code should work:

model = PowerGridModel(input_data=input_data)
result = model.calculate_power_flow(
    update_data={
        "sym_load": {"p_specified": np.random.randn(n_step, n_sym_load)}
    }
)

In the main core, we need to have special treatment in is_update_independent to make this work:

is_update_independent should be per component instead of the whole dataset. So we can allow individual sequence for each component.
For a certain component, if the buffer is row-based, we use the same logic as is now.
For a certain component, if the buffer is columnar, we do the following:
1. If id attribute buffer is provided, we look at id to judge if the component is independent or not. We do not need to create proxy stuff which is waste of time. Just directly look at id buffer.
2. If id attribute buffer is not provided:
  1. If the buffer is not uniform, or the buffer is uniform but elements_per_scenario is not the same as the number of elements in the input data (in the model). An error should be raised.
  2. If the above check passes, we assume the component buffer is independent. And we generate a sequence from 0 to n_comp for this component. This will be consumed by the update function so the update function does not do id lookup.

Signed-off-by: Jerry Guo <Jerry.Jinfeng.Guo@alliander.com>

mgovers · 2024-09-05T16:41:17Z

@Jerry-Jinfeng-Guo @mgovers @figueroa1395, from the user's perspective, the user would definitely like to provide a columnar batch dataset in a way that the id is not provided for a certain component. In that case, it should be inferred that the elements where attributes are to be updated via columnar buffer are in the exact same sequence of the input data. This is a realistic use-case and will be appreciated by the user, to save the additional step to just assign the exactly the same id as in the input data. The following Python code should work:
model = PowerGridModel(input_data=input_data)
result = model.calculate_power_flow(
    update_data={
        "sym_load": {"p_specified": np.random.randn(n_step, n_sym_load)}
    }
)
In the main core, we need to have special treatment in is_update_independent to make this work:

is_update_independent should be per component instead of the whole dataset. So we can allow individual sequence for each component.

For a certain component, if the buffer is row-based, we use the same logic as is now.

For a certain component, if the buffer is columnar, we do the following:

If id attribute buffer is provided, we look at id to judge if the component is independent or not. We do not need to create proxy stuff which is waste of time. Just directly look at id buffer.

If id attribute buffer is not provided:

If the buffer is not uniform, or the buffer is uniform but elements_per_scenario is not the same as the number of elements in the input data (in the model). An error should be raised.

If the above check passes, we assume the component buffer is independent. And we generate a sequence from 0 to n_comp for this component. This will be consumed by the update function so the update function does not do id lookup.

I do think it's very valuable, but I also believe that it's a separate feature request and definitely out of scope of this PR

Signed-off-by: Jerry Guo <Jerry.Jinfeng.Guo@alliander.com>

TonyXiang8787 · 2024-09-05T19:06:34Z

@Jerry-Jinfeng-Guo @mgovers @figueroa1395, from the user's perspective, the user would definitely like to provide a columnar batch dataset in a way that the id is not provided for a certain component. In that case,
...

I do think it's very valuable, but I also believe that it's a separate feature request and definitely out of scope of this PR

I agree to put this as a separate PR. But this feature needs to be part of the release 1.10.0, including its documentation and examples. I have now edited in #548 as step 7 of the task.

Signed-off-by: Jerry Guo <Jerry.Jinfeng.Guo@alliander.com>

…lumnar-data Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

power_grid_model_c/power_grid_model/include/power_grid_model/main_core/input.hpp

Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

power_grid_model_c/power_grid_model/include/power_grid_model/main_model_impl.hpp

figueroa1395

Looks good to me, but I have a final question to get the full picture better:

Was the introduction of all iterator_like stuff necessary because the columnar stuff was implemented using boost iterator stuff? And since this boost iterator stuff doesn't fully support all the std::iterator concepts, then you had to mock it? And that is the reason why extending the use of columnar in main model was not trivial?

power_grid_model_c/power_grid_model/include/power_grid_model/common/grouped_index_vector.hpp

power_grid_model_c/power_grid_model/include/power_grid_model/common/iterator_like_concepts.hpp

power_grid_model_c/power_grid_model/include/power_grid_model/main_core/output.hpp

Jerry-Jinfeng-Guo

I think the subsequent changes make sense to me. I can not approve this PR since I created it. Could someone else give the approval if it looks good to you as well? @TonyXiang8787 @figueroa1395 @nitbharambe

mgovers · 2024-09-10T08:10:53Z

Looks good to me, but I have a final question to get the full picture better:

Was the introduction of all iterator_like stuff necessary because the columnar stuff was implemented using boost iterator stuff? And since this boost iterator stuff doesn't fully support all the std::iterator concepts, then you had to mock it? And that is the reason why extending the use of columnar in main model was not trivial?

yes indeed. The fundamental reason is that std::iterator requires you to return an exact reference, but a view only returns something that acts like a reference but is not necessarily one.

Jerry-Jinfeng-Guo · 2024-09-10T08:11:14Z

Was the introduction of all iterator_like stuff necessary because the columnar stuff was implemented using boost iterator stuff?

Since @mgovers made that change, I could only answer from my understanding. Making it typename ForwardIterator is technically working but provides much less control for us than ideal (the real forward iterator). Having one temporary one in place made it easier to switch once in the future said functionality is fully supported by boost.

Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

figueroa1395 · 2024-09-10T08:20:47Z

Since @mgovers made that change, I could only answer from my understanding. Making it typename ForwardIterator is technically working but provides much less control for us than ideal (the real forward iterator). Having one temporary one in place made it easier to switch once in the future said functionality is fully supported by boost.

But would it work though? I think not, because the boost iterator stuff doesn't yet provide range access as required in the newly introduced concepts. Specifically this:

yes indeed. The fundamental reason is that std::iterator requires you to return an exact reference, but a view only returns something that acts like a reference but is not necessarily one.

Hence triggering the changes from std::convertible_to to detail::assignable_to

But indeed this change would be easy to change once boost does.

I think the subsequent changes make sense to me. I can not approve this PR since I created it. Could someone else give the approval if it looks good to you as well? @TonyXiang8787 @figueroa1395 @nitbharambe

I will give approval. Feel free to resolve remaining open conversations.

mgovers · 2024-09-10T08:26:49Z

But would it work though? I think not, because the boost iterator stuff doesn't yet provide range access as required in the newly introduced concepts.

std::views::iota coming to the rescue for the IdxCount issue

sonarqubecloud · 2024-09-10T08:47:32Z

Quality Gate passed

Issues
3 New issues
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

mgovers and others added 2 commits September 5, 2024 13:30

add columnar input data support to main model

3fcbb47

Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

added columnar support for output

9393615

Signed-off-by: Jerry Guo <Jerry.Jinfeng.Guo@alliander.com>

Jerry-Jinfeng-Guo self-assigned this Sep 5, 2024

Jerry-Jinfeng-Guo added the feature New feature or request label Sep 5, 2024

Jerry-Jinfeng-Guo mentioned this pull request Sep 5, 2024

Feature / Columnar data C API #711

Merged

6 tasks

Jerry-Jinfeng-Guo marked this pull request as draft September 5, 2024 13:55

Jerry-Jinfeng-Guo added 2 commits September 5, 2024 16:44

fix clang tidy complains

1fd5c36

Signed-off-by: Jerry Guo <Jerry.Jinfeng.Guo@alliander.com>

[skip ci] added update data, not yet working

c497100

Signed-off-by: Jerry Guo <Jerry.Jinfeng.Guo@alliander.com>

[skip ci] is_columnar needs update

3193790

Signed-off-by: Jerry Guo <Jerry.Jinfeng.Guo@alliander.com>

Jerry-Jinfeng-Guo changed the title ~~Add columnar data support in main model~~ Feature / Add columnar data support in main model Sep 6, 2024

Jerry-Jinfeng-Guo and others added 4 commits September 6, 2024 10:40

Merge branch 'main' into feature/main-model-columnar-data

3607fb2

[skip ci] commit before hand over: issue marked in comment

9bd4524

Signed-off-by: Jerry Guo <Jerry.Jinfeng.Guo@alliander.com>

Merge remote-tracking branch 'origin/main' into feature/main-model-co…

f2a773a

…lumnar-data Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

add support for columnar update data

d92180d

Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

mgovers reviewed Sep 9, 2024

View reviewed changes

power_grid_model_c/power_grid_model/include/power_grid_model/main_core/input.hpp Outdated Show resolved Hide resolved

Merge branch 'main' into feature/main-model-columnar-data

92ff68d

mgovers marked this pull request as ready for review September 9, 2024 13:26

resolve sonar-cloud + clang-tidy

60a7be3

Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

mgovers enabled auto-merge September 10, 2024 06:02

Jerry-Jinfeng-Guo commented Sep 10, 2024

View reviewed changes

power_grid_model_c/power_grid_model/include/power_grid_model/main_model_impl.hpp Outdated Show resolved Hide resolved

figueroa1395 reviewed Sep 10, 2024

View reviewed changes

Jerry-Jinfeng-Guo commented Sep 10, 2024

View reviewed changes

minor cleanup

8e9effa

Signed-off-by: Martijn Govers <Martijn.Govers@Alliander.com>

figueroa1395 approved these changes Sep 10, 2024

View reviewed changes

mgovers added this pull request to the merge queue Sep 10, 2024

Merged via the queue into main with commit a05b88a Sep 10, 2024
26 checks passed

mgovers deleted the feature/main-model-columnar-data branch September 10, 2024 09:59

mgovers mentioned this pull request Nov 5, 2024

[Release] v1.10.0 #803

Closed

27 tasks

figueroa1395 mentioned this pull request Nov 7, 2024

[FEATURE] Add support columnar data buffer to save memory usage #548

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature / Add columnar data support in main model #712

Feature / Add columnar data support in main model #712

Uh oh!

Jerry-Jinfeng-Guo commented Sep 5, 2024 •

edited by mgovers

Loading

Uh oh!

TonyXiang8787 commented Sep 5, 2024

Uh oh!

mgovers commented Sep 5, 2024

Uh oh!

TonyXiang8787 commented Sep 5, 2024

Uh oh!

Uh oh!

Uh oh!

figueroa1395 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jerry-Jinfeng-Guo left a comment

Uh oh!

mgovers commented Sep 10, 2024

Uh oh!

Jerry-Jinfeng-Guo commented Sep 10, 2024

Uh oh!

figueroa1395 commented Sep 10, 2024 •

edited

Loading

Uh oh!

mgovers commented Sep 10, 2024

Uh oh!

sonarqubecloud bot commented Sep 10, 2024

Uh oh!

Uh oh!

Uh oh!

Feature / Add columnar data support in main model #712

Feature / Add columnar data support in main model #712

Uh oh!

Conversation

Jerry-Jinfeng-Guo commented Sep 5, 2024 • edited by mgovers Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TonyXiang8787 commented Sep 5, 2024

Uh oh!

mgovers commented Sep 5, 2024

Uh oh!

TonyXiang8787 commented Sep 5, 2024

Uh oh!

Uh oh!

Uh oh!

figueroa1395 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jerry-Jinfeng-Guo left a comment

Choose a reason for hiding this comment

Uh oh!

mgovers commented Sep 10, 2024

Uh oh!

Jerry-Jinfeng-Guo commented Sep 10, 2024

Uh oh!

figueroa1395 commented Sep 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mgovers commented Sep 10, 2024

Uh oh!

sonarqubecloud bot commented Sep 10, 2024

Quality Gate passed

Uh oh!

Uh oh!

Uh oh!

Jerry-Jinfeng-Guo commented Sep 5, 2024 •

edited by mgovers

Loading

figueroa1395 commented Sep 10, 2024 •

edited

Loading