Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

[LIGHT] Who Am I - Preventing doubleloading model #4742

Merged
merged 2 commits into from
Aug 15, 2022
Merged

Conversation

JackUrb
Copy link
Contributor

@JackUrb JackUrb commented Aug 12, 2022

Patch description
Made a small update to the PACER model plugin that allows for it to be added to other agents (like the ExpandedAttentionDecoderAndPacerAgent) without doubly loading the base model by instead checking to see if the reranking model has been initialized in some other form.

Testing steps
Was consuming 1.2Gb more GPU space than expected and observed logs loading the reranker model twice before this change.

Model works correctly (as tested in the LIGHT game) after this change.

Other information

Copy link
Contributor

@klshuster klshuster left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for upstreaming this!

@JackUrb JackUrb merged commit c17f8ed into main Aug 15, 2022
@JackUrb JackUrb deleted the pacer-no-doubleload branch August 15, 2022 20:36
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants