Skip to content
This repository has been archived by the owner on Oct 31, 2023. It is now read-only.

dumping embeddings for closed-set experiments #4

Open
joocjun opened this issue Feb 21, 2023 · 0 comments
Open

dumping embeddings for closed-set experiments #4

joocjun opened this issue Feb 21, 2023 · 0 comments

Comments

@joocjun
Copy link

joocjun commented Feb 21, 2023

Hello!

I've been trying to dump the embeddings for the closed-set experiments. Unfortunately except for enwiki-0 corpus, which has a 0_valid file, other corpora seem to have a problem with the dumping process. The problem seems to occur while collating the 'is_valid' attribute in the datamodule part(dimension errors). If I'm not mistaken this particular line seems to be the source of trouble.

is_valid = [i for i, _id in enumerate(input_ids) if _id not in [0, 2]]

The non-uniform lengths generated for each instance seem to be causing a dimension error.

At your earliest convenience, could check whether this is a valid error to be fixed??

Thank you

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant