Added shared embedding option to director model. #4763

leox1v · 2022-08-22T19:00:55Z

Added an option (with flag --director-use-shared-embedding) to share the generation and classification heads of the director. Only two additional parameters are introduced to scale the logits before applying the sigmoid (for the classification).

jxmsML · 2022-08-23T18:13:27Z

projects/director/director_agent.py

        super().__init__(opt, dictionary, **kwargs)

        vocabulary_size = len(dictionary)

        decoder_output_dim = self.decoder.out_dim
-        self.classifier_heads = nn.Linear(decoder_output_dim, vocabulary_size)
+        self.use_shared_embedding = use_shared_embedding


if use_shared_embedding is passing from opt, then you can make sure of that `self.use_shared_embedding = opt.get('use_shared_embedding', False)?

Thanks for the comment. I changed it.

Added shared embedding option to director model.

2d5fdb1

facebook-github-bot added the CLA Signed label Aug 22, 2022

jaseweston approved these changes Aug 22, 2022

View reviewed changes

jxmsML reviewed Aug 23, 2022

View reviewed changes

Clean flag usage for shared-head director.

b5cb635

jxmsML approved these changes Sep 12, 2022

View reviewed changes

leox1v merged commit 885eb21 into main Sep 12, 2022

leox1v deleted the shared_head_director branch September 12, 2022 18:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added shared embedding option to director model. #4763

Added shared embedding option to director model. #4763

leox1v commented Aug 22, 2022

jxmsML Aug 23, 2022

leox1v Aug 29, 2022

Added shared embedding option to director model. #4763

Added shared embedding option to director model. #4763

Conversation

leox1v commented Aug 22, 2022

jxmsML Aug 23, 2022

Choose a reason for hiding this comment

leox1v Aug 29, 2022

Choose a reason for hiding this comment