Skip to content

eos_special_treatment_bug_fix#1

Open
JoachimSchaeffer wants to merge 2 commits intoApolloResearch:masterfrom
JoachimSchaeffer:eos_token_bugfix
Open

eos_special_treatment_bug_fix#1
JoachimSchaeffer wants to merge 2 commits intoApolloResearch:masterfrom
JoachimSchaeffer:eos_token_bugfix

Conversation

@JoachimSchaeffer
Copy link

Previously, the special treatment of the BOS token was not undone for ln2 and lnf.

Now, this is properly undone and a final check is performed after the removal process is completed.

Copy link

@stefan-apollo stefan-apollo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We discussed this in person back when Joachim noticed the bug, just leaving the comments for posteriority. We've fixed this bug (and another bug) in the follow-up code at https://github.com/submarat/removing-layer-norm.

Note: This bug could theoretically make the removal process work less well. However, the bug is not affecting the evals (all LNs removed) and apparently did not have any detrimental effect on performance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants