You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
In #4898 a new mechanism was added to RFile to compute bulk import load plans as the RFile is written. This new mechanism was implemented using completely new code that examines each key value written. There may be existing code in RFile that could be leveraged for this computation that may reduce the amount of work done per key value written.
Describe the solution you'd like
Determine if this code could be modified to help compute the load plan leveraging its tracking of first and last keys. Ideally this change would minimize the overall work done per key value when writing to a RFile.
Describe alternatives you've considered
It may be best to not make any changes at for this issue, its needs investigation.
The following are some reasons that maybe no changes should be made for this issue.
The performance impact of the code that does per key examination added in Offers new ways of computing bulk load plans #4898 is negligible compared to other parts of the rfile code write pipeline. Optimizing something that is not taking much time will not really speed up the overall write pipeline. Need to optimize the slowest parts to see measurable improvement.
The existing code is not well suited for the new task.
There too many existing layers of abstraction that would need to be broken to make the change.
Only want to make this change if it shows a measurable performance improvement and does not add tech debt to the code.
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe.
In #4898 a new mechanism was added to RFile to compute bulk import load plans as the RFile is written. This new mechanism was implemented using completely new code that examines each key value written. There may be existing code in RFile that could be leveraged for this computation that may reduce the amount of work done per key value written.
Describe the solution you'd like
Determine if this code could be modified to help compute the load plan leveraging its tracking of first and last keys. Ideally this change would minimize the overall work done per key value when writing to a RFile.
Describe alternatives you've considered
It may be best to not make any changes at for this issue, its needs investigation.
The following are some reasons that maybe no changes should be made for this issue.
Only want to make this change if it shows a measurable performance improvement and does not add tech debt to the code.
The text was updated successfully, but these errors were encountered: