Only initialize counterfitted_GLOVE_embedding when needed, massively decreasing ram usage #609

duesenfranz · 2022-02-09T18:30:22Z

What does this PR do?

Summary

By only instanciating the counterfitted_GLOVE_embedding when
necessary, the ram usage decreases by at least two gigabytes and the startup time decreases massively.

Changes

WordSwapEmbedding, WordEmbeddingDistance and ThoughtVector only initialize WordEmbedding.counterfitted_GLOVE_embedding upon initialization, not upon python parsing the files containing the class definitions. Initializing counterfitted_GLOVE_embedding means
- downloading large chunks of data on initial use
- loading a lot of data into ram, no matter whether it is ever used.

Checklist

[ x ] The title of your pull request should be a summary of its contribution.
[ x ] Please write detailed description of what parts have been newly added and what parts have been modified. Please also explain why certain changes were made.
[ x ] If your pull request addresses an issue, please mention the issue number in the pull request description to make sure they are linked (and people consulting the issue know you are working on it)
[ x ] To indicate a work in progress please mark it as a draft on Github.
[ x ] Make sure existing tests pass.
[ x ] Add relevant tests. No quality testing = no merge.
[ x ] All public methods must have informative docstrings that work nicely with sphinx. For new modules/files, please add/modify the appropriate .rst file in TextAttack/docs/apidoc.'

By only instanciating the `counterfitted_GLOVE_embedding` when necessary, the startup time gets cut by two thirds, while the ram usage decreases by at least two gigabytes.

qiyanjun · 2022-02-24T21:22:22Z

This looks fine. @cogeid mind to help me confirm?

cogeid · 2022-02-25T02:36:02Z

This looks fine. @cogeid mind to help me confirm?

yep! This can be merged once it passes all pytests.

duesenfranz · 2022-03-01T13:05:22Z

@cogeid thanks for pushing this forward! :) Is there anything else I should do to get this merged?

cogeid · 2022-03-01T21:57:29Z

@cogeid thanks for pushing this forward! :) Is there anything else I should do to get this merged?

Thank you for your contribution! It will be merged this week.

qiyanjun · 2022-03-13T19:52:39Z

@srujanjoshi mind to help me check one more on your mac, profiling the RAM size

Speed up startup, massively decrease ram usage

613362e

By only instanciating the `counterfitted_GLOVE_embedding` when necessary, the startup time gets cut by two thirds, while the ram usage decreases by at least two gigabytes.

format issue

92374d3

cogeid approved these changes Mar 1, 2022

View reviewed changes

qiyanjun merged commit 4c91157 into QData:master Mar 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only initialize counterfitted_GLOVE_embedding when needed, massively decreasing ram usage #609

Only initialize counterfitted_GLOVE_embedding when needed, massively decreasing ram usage #609

duesenfranz commented Feb 9, 2022 •

edited

Loading

qiyanjun commented Feb 24, 2022

cogeid commented Feb 25, 2022

duesenfranz commented Mar 1, 2022

cogeid commented Mar 1, 2022

qiyanjun commented Mar 13, 2022

Only initialize counterfitted_GLOVE_embedding when needed, massively decreasing ram usage #609

Only initialize counterfitted_GLOVE_embedding when needed, massively decreasing ram usage #609

Conversation

duesenfranz commented Feb 9, 2022 • edited Loading

What does this PR do?

Summary

Changes

Checklist

qiyanjun commented Feb 24, 2022

cogeid commented Feb 25, 2022

duesenfranz commented Mar 1, 2022

cogeid commented Mar 1, 2022

qiyanjun commented Mar 13, 2022

duesenfranz commented Feb 9, 2022 •

edited

Loading