Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lack of documents which are truncated without entity cutoff. #48

Open
GabrielKP opened this issue Mar 15, 2022 · 0 comments
Open

Lack of documents which are truncated without entity cutoff. #48

GabrielKP opened this issue Mar 15, 2022 · 0 comments
Labels
bug Something isn't working

Comments

@GabrielKP
Copy link
Collaborator

Currently it seems that when loading datasets all documents which are truncated, also are excluded because their entities supposedly have been truncated as well.
This means, no document which has all its entities and still has been truncated exists. This seems highly unlikely.

Is there a bug in the code?
Or could it be that for max_seq_len == 128 there indeed is no tacred example in which the entities are preserved but the text is truncated.

@GabrielKP GabrielKP added the bug Something isn't working label Mar 15, 2022
@GabrielKP GabrielKP changed the title Lack of documents which are truncated, but without entity cutoff. Lack of documents which are truncated without entity cutoff. Mar 15, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant