Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: more perf by offloading mork work to cores #3189

Merged
merged 2 commits into from
Sep 27, 2024
Merged

fix: more perf by offloading mork work to cores #3189

merged 2 commits into from
Sep 27, 2024

Conversation

tobymao
Copy link
Contributor

@tobymao tobymao commented Sep 26, 2024

No description provided.


model._data_hash = data_hash
model._metadata_hash = metadata_hash
model.update_schema(mapping_schema)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We wouldn't have to do this twice, if we were updating the schema prior to submitting a task like we used to

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we need to do it here on the computed mapping because it was done in a different process. we do it in the process so that we can leverage cores

two hot spots:

model.update_schema was expensive because we had to generate sql for every column

model._data_hash/_metadata_hash was expensive when calling context.snapshots
@tobymao tobymao enabled auto-merge (squash) September 27, 2024 04:47
@tobymao tobymao merged commit c07bead into main Sep 27, 2024
23 checks passed
@tobymao tobymao deleted the more_perf branch September 27, 2024 04:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants