Vb/document entity annotation al 5106 #987

vbrodsky · 2023-03-12T22:00:41Z

Story: https://labelbox.atlassian.net/browse/AL-5106
Adding support for DocumentEntity proper annotation type

review-notebook-app · 2023-03-12T22:00:45Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

labelbox/data/annotation_types/ner/document_entity.py

tests/integration/annotation_import/conftest.py

whistler · 2023-03-13T15:00:56Z

tests/integration/annotation_import/conftest.py

+@pytest.fixture
+def configured_project_pdf_entity(client, ontology, rand_gen,
+                                  dataset_pdf_entity):
+    project = client.create_project(name=rand_gen(str),


Couldn't we reuse configured_project_without_data_rows fixture here? Also, we should prefer QueueMode.Batch over QueueMode.Dataset, QueueMode.Dataset is an older paradigm.

will look into both

this is done

tests/integration/conftest.py

whistler · 2023-03-13T23:03:43Z

examples/annotation_import/pdf.ipynb

Interesting that we're getting such a large diff. Maybe some of our commits don't have databooks running, perhaps we should add databooks as a CI job too. No action needed for now, I've made a note for myself.

also maybe the reason is that I ran my notebook via a jupyter server this time, and not VSC? Perhaps we should all standardize our notebook runtime

yes, thats a good idea, I've noticed using jupyter creates huge diffs compared to vscode

It may also be because someone working on this notebook did not have the pre-commit hooks installed.

totally possible as Andrea has updated this notebook

labelbox/data/annotation_types/ner/document_entity.py

whistler · 2023-03-14T13:25:50Z

Should we also have NDJSON serialization tests similar to other examples under tests/data/serialization/ndjson?

vbrodsky · 2023-03-14T16:11:55Z

Should we also have NDJSON serialization tests similar to other examples under tests/data/serialization/ndjson?

good idea

labelbox/data/annotation_types/ner/document_entity.py

whistler · 2023-03-14T13:17:01Z

labelbox/data/serialization/ndjson/objects.py

+    def to_common(self) -> DocumentEntity:
+        return TextEntity(name=self.name, text_selections=self.text_selections)
+
+        return obj.from_common(annotation.value, subclasses, annotation.name,


Realized that there are two returns here, would the second return ever be called?

whistler · 2023-03-14T13:18:11Z

labelbox/data/serialization/ndjson/objects.py

+    text_selections: List[DocumentTextSelection]
+
+    def to_common(self) -> DocumentEntity:
+        return TextEntity(name=self.name, text_selections=self.text_selections)


Should TextEntity be DocumentEntity since that's the return type?

whistler · 2023-03-14T16:22:45Z

examples/annotation_import/pdf.ipynb

yes, thats a good idea, I've noticed using jupyter creates huge diffs compared to vscode

Add DocumentEntity to tests Added integration test

whistler

Great work! Thanks for the updates!

vbrodsky force-pushed the VB/document-entity-annotation_AL-5106 branch 2 times, most recently from 482fa99 to 8200151 Compare March 12, 2023 22:03

vbrodsky requested review from kkim-labelbox and whistler March 12, 2023 22:04

mnoszczak reviewed Mar 12, 2023

View reviewed changes

labelbox/data/annotation_types/ner/document_entity.py Show resolved Hide resolved

vbrodsky force-pushed the VB/document-entity-annotation_AL-5106 branch from 6d2f6cc to 5a2c7ca Compare March 12, 2023 23:15

whistler reviewed Mar 13, 2023

View reviewed changes

vbrodsky force-pushed the VB/document-entity-annotation_AL-5106 branch 4 times, most recently from e5279df to cef16d0 Compare March 13, 2023 21:09

whistler reviewed Mar 13, 2023

View reviewed changes

whistler reviewed Mar 14, 2023

View reviewed changes

vbrodsky force-pushed the VB/document-entity-annotation_AL-5106 branch 2 times, most recently from b031fce to 3ed0e4e Compare March 14, 2023 19:07

vbrodsky added 7 commits March 14, 2023 12:11

Add DocumentEntity class

6b6b1cb

Add NDDocumentEntity class

53381d0

Update tests

d53098b

Add DocumentEntity to tests Added integration test

Refactor ner classes each in a separate file

9d1138c

Formatting

1ed861c

PR refactor test

cde2f2a

PR Get rid of camelcase

0c7be25

vbrodsky force-pushed the VB/document-entity-annotation_AL-5106 branch from 3ed0e4e to 59b060c Compare March 14, 2023 19:11

PR: add ndson test

0b4590f

vbrodsky force-pushed the VB/document-entity-annotation_AL-5106 branch from 59b060c to 0b4590f Compare March 14, 2023 19:36

whistler approved these changes Mar 14, 2023

View reviewed changes

vbrodsky merged commit 3be870d into develop Mar 14, 2023

vbrodsky deleted the VB/document-entity-annotation_AL-5106 branch March 14, 2023 20:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vb/document entity annotation al 5106 #987

Vb/document entity annotation al 5106 #987

vbrodsky commented Mar 12, 2023

review-notebook-app bot commented Mar 12, 2023

whistler Mar 13, 2023

vbrodsky Mar 13, 2023

vbrodsky Mar 14, 2023

whistler Mar 13, 2023

vbrodsky Mar 14, 2023

whistler Mar 14, 2023

kkim-labelbox Mar 14, 2023

vbrodsky Mar 14, 2023 •

edited

Loading

whistler commented Mar 14, 2023

vbrodsky commented Mar 14, 2023

whistler Mar 14, 2023

whistler Mar 14, 2023

vbrodsky Mar 14, 2023

whistler Mar 14, 2023

whistler left a comment

Vb/document entity annotation al 5106 #987

Vb/document entity annotation al 5106 #987

Conversation

vbrodsky commented Mar 12, 2023

review-notebook-app bot commented Mar 12, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vbrodsky Mar 14, 2023 • edited Loading

Choose a reason for hiding this comment

whistler commented Mar 14, 2023

vbrodsky commented Mar 14, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

whistler left a comment

Choose a reason for hiding this comment

vbrodsky Mar 14, 2023 •

edited

Loading