Prefilter graphs in the uploader #3497

davidsoergel · 2020-04-08T18:01:11Z

There is no point in uploading GraphDef data that won't be displayed anyway. In particular, the TensorBoard frontend filters out node attributes larger that 1024 bytes, since it has no good way to present those. So we may as well filter those out before upload to TensorBoard.dev, so as not to waste bandwidth, storage, and read-time processing.

caisq · 2020-04-09T16:41:03Z

tensorboard/dataclass_compat.py



-def migrate_event(event):
+def migrate_event(event, experimental_filter_graph=False):


Please add doc string for the new kwarg.

Oops, done.

caisq · 2020-04-09T16:45:11Z

tensorboard/backend/process_graph.py

      ValueError: If `large_attrs_key is None` while `limit_attr_size != None`.
      ValueError: If `limit_attr_size` is defined, but <= 0.
    """
+    # TODO(@davidsoergel): detect whether a graph has been filtered already


To make this TODO item clearer, I believe you can say in addition: "if it is already filtered, return immediately".

caisq · 2020-04-09T16:49:41Z

tensorboard/dataclass_compat_test.py


+    def test_graph_def_experimental_filter_graph(self):
+        # Create a `GraphDef` and write it to disk as an event.
+        logdir = self.get_temp_dir()


Question about testing strategy: is it really necessary to create a logdir and a writer? If your intention here is to simply test self._migrate_event() (and ultimately dataclass_compat.migrate_event()), you can just construct a Event, pass it to that method and check the result, right?

Oh good point. I had pasted that from the test above but here the logdir part is superfluous. (There too, maybe, but that's out of scope). Thanks!

caisq

LGTM. Thanks for reducing the size of the uploaded graphs.

Since #3497, we parse GraphDefs in dataclass_compat.py during upload. If a graph is corrupt, that parsing fails. Here we catch the resulting exception, issue a warning, and continue (omitting the graph). This also updates tests to use valid GraphDefs where appropriate, as opposed to bytes(1024), which apparently produces inconsistent results with different proto parsers (e.g., OSS vs. Google internal).

There is no point in uploading GraphDef data that won't be displayed anyway. In particular, the TensorBoard frontend filters out node attributes larger that 1024 bytes, since it has no good way to present those. So we may as well filter those out before upload to TensorBoard.dev, so as not to waste bandwidth, storage, and read-time processing.

Since tensorflow#3497, we parse GraphDefs in dataclass_compat.py during upload. If a graph is corrupt, that parsing fails. Here we catch the resulting exception, issue a warning, and continue (omitting the graph). This also updates tests to use valid GraphDefs where appropriate, as opposed to bytes(1024), which apparently produces inconsistent results with different proto parsers (e.g., OSS vs. Google internal).

There is no point in uploading GraphDef data that won't be displayed anyway. In particular, the TensorBoard frontend filters out node attributes larger that 1024 bytes, since it has no good way to present those. So we may as well filter those out before upload to TensorBoard.dev, so as not to waste bandwidth, storage, and read-time processing.

Since #3497, we parse GraphDefs in dataclass_compat.py during upload. If a graph is corrupt, that parsing fails. Here we catch the resulting exception, issue a warning, and continue (omitting the graph). This also updates tests to use valid GraphDefs where appropriate, as opposed to bytes(1024), which apparently produces inconsistent results with different proto parsers (e.g., OSS vs. Google internal).

Prefilter graphs in the uploader (stopgap proposal)

dd8e8dc

googlebot added the cla: yes label Apr 8, 2020

davidsoergel added 3 commits April 9, 2020 12:11

Add test and cleanup

bbe4a03

Remove redundant asserts

69dd0b6

Clarify test

2940eb9

davidsoergel requested a review from caisq April 9, 2020 16:25

Actually trigger the graph filtering in the uploader

ef7a10b

caisq reviewed Apr 9, 2020

View reviewed changes

Simplify test, and other reviewer comments

95da811

davidsoergel requested a review from caisq April 9, 2020 20:19

caisq approved these changes Apr 9, 2020

View reviewed changes

davidsoergel changed the title ~~Prefilter graphs in the uploader (stopgap proposal)~~ Prefilter graphs in the uploader Apr 10, 2020

davidsoergel merged commit 1397867 into master Apr 10, 2020

davidsoergel deleted the prefilter-graphs branch April 10, 2020 12:43

davidsoergel mentioned this pull request Apr 10, 2020

Detect and skip corrupt GraphDefs #3503

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prefilter graphs in the uploader #3497

Prefilter graphs in the uploader #3497

Uh oh!

davidsoergel commented Apr 8, 2020

Uh oh!

caisq Apr 9, 2020

Uh oh!

davidsoergel Apr 9, 2020

Uh oh!

caisq Apr 9, 2020

Uh oh!

davidsoergel Apr 9, 2020

Uh oh!

caisq Apr 9, 2020

Uh oh!

davidsoergel Apr 9, 2020

Uh oh!

caisq left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		def migrate_event(event):
		def migrate_event(event, experimental_filter_graph=False):

Prefilter graphs in the uploader #3497

Prefilter graphs in the uploader #3497

Uh oh!

Conversation

davidsoergel commented Apr 8, 2020

Uh oh!

caisq Apr 9, 2020

Choose a reason for hiding this comment

Uh oh!

davidsoergel Apr 9, 2020

Choose a reason for hiding this comment

Uh oh!

caisq Apr 9, 2020

Choose a reason for hiding this comment

Uh oh!

davidsoergel Apr 9, 2020

Choose a reason for hiding this comment

Uh oh!

caisq Apr 9, 2020

Choose a reason for hiding this comment

Uh oh!

davidsoergel Apr 9, 2020

Choose a reason for hiding this comment

Uh oh!

caisq left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants