Lack of documents which are truncated without entity cutoff. #48

GabrielKP · 2022-03-15T14:55:21Z

Currently it seems that when loading datasets all documents which are truncated, also are excluded because their entities supposedly have been truncated as well.
This means, no document which has all its entities and still has been truncated exists. This seems highly unlikely.

Is there a bug in the code?
Or could it be that for max_seq_len == 128 there indeed is no tacred example in which the entities are preserved but the text is truncated.

GabrielKP added the bug Something isn't working label Mar 15, 2022

GabrielKP changed the title ~~Lack of documents which are truncated, but without entity cutoff.~~ Lack of documents which are truncated without entity cutoff. Mar 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lack of documents which are truncated without entity cutoff. #48

Lack of documents which are truncated without entity cutoff. #48

GabrielKP commented Mar 15, 2022

Lack of documents which are truncated without entity cutoff. #48

Lack of documents which are truncated without entity cutoff. #48

Comments

GabrielKP commented Mar 15, 2022