SHS-NG M4.3: Port StorageTab to the new backend. #46

vanzin · 2017-08-10T01:34:11Z

This required adding information about StreamBlockId to the store,
which is not available yet via the API. So an internal type was added
until there's a need to expose that information in the API.

The UI only lists RDDs that have cached partitions, and that information
wasn't being correctly captured in the listener, so that's also fixed,
along with some minor (internal) API adjustments so that the UI can
get the correct data.

Because of the way partitions are cached, some optimizations w.r.t. how
often the data is flushed to the store could not be applied to this code;
because of that, some different ways to make the code more performant
were added to the data structures tracking RDD blocks, with the goal of
avoiding expensive copies when lots of blocks are being updated.

squito

I haven't looked at StreamBlocks before, and I have to say I'm totally perplexed even by the current implementation. It looks to me like its just every block on the executors, and nothing to do with streaming at all (https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/storage/BlockStatusListener.scala#L94). I will look at that more. But what you have here seems consistent with that anyway

squito · 2017-10-25T21:17:40Z

core/src/main/scala/org/apache/spark/status/AppStatusListener.scala

I think my prev comment on exec memory remaining with multiple rdds still applies here. you're only updating it for the one RDD with an update, but the memory remaining is actually changing for all rdds cached on this executor

squito · 2017-10-25T21:19:44Z

core/src/main/scala/org/apache/spark/status/AppStatusListener.scala

isn't this whole if/else just liveUpdate(rdd)?

The update call needs an extra argument here...

squito · 2017-10-25T21:29:06Z

core/src/test/scala/org/apache/spark/status/AppStatusListenerSuite.scala

so I don't forget, repeating the comment for adding the test for multiple rdds on one executor

This required adding information about StreamBlockId to the store, which is not available yet via the API. So an internal type was added until there's a need to expose that information in the API. The UI only lists RDDs that have cached partitions, and that information wasn't being correctly captured in the listener, so that's also fixed, along with some minor (internal) API adjustments so that the UI can get the correct data. Because of the way partitions are cached, some optimizations w.r.t. how often the data is flushed to the store could not be applied to this code; because of that, some different ways to make the code more performant were added to the data structures tracking RDD blocks, with the goal of avoiding expensive copies when lots of blocks are being updated.

vanzin force-pushed the shs-ng/M4.3 branch from 2f27e3d to 9fdf63b Compare August 10, 2017 01:36

vanzin force-pushed the shs-ng/M4.2 branch from f70c9bf to 193c0e0 Compare September 28, 2017 17:54

vanzin force-pushed the shs-ng/M4.3 branch from 9fdf63b to 7c775bc Compare September 28, 2017 17:55

vanzin mentioned this pull request Oct 2, 2017

[SPARK-20643][core] Add listener implementation to collect app state. apache/spark#19383

Closed

squito reviewed Oct 25, 2017

View reviewed changes

vanzin force-pushed the shs-ng/M4.2 branch from 193c0e0 to a1f1b55 Compare October 26, 2017 18:28

vanzin force-pushed the shs-ng/M4.3 branch from 7c775bc to f7d1766 Compare October 26, 2017 18:28

vanzin force-pushed the shs-ng/M4.2 branch from a1f1b55 to 4687aed Compare October 26, 2017 21:13

vanzin force-pushed the shs-ng/M4.3 branch from f7d1766 to 83463c0 Compare October 26, 2017 21:13

vanzin force-pushed the shs-ng/M4.2 branch from 4687aed to 0dd7b54 Compare November 6, 2017 19:34

vanzin force-pushed the shs-ng/M4.3 branch from 83463c0 to 0609a67 Compare November 6, 2017 19:34

vanzin closed this Nov 6, 2017

vanzin mentioned this pull request Nov 6, 2017

[SPARK-20647][core] Port StorageTab to the new UI backend. apache/spark#19679

Closed

vanzin deleted the shs-ng/M4.3 branch April 25, 2019 16:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SHS-NG M4.3: Port StorageTab to the new backend. #46

SHS-NG M4.3: Port StorageTab to the new backend. #46

Uh oh!

vanzin commented Aug 10, 2017

Uh oh!

squito left a comment

Uh oh!

squito Oct 25, 2017

Uh oh!

squito Oct 25, 2017

Uh oh!

vanzin Oct 25, 2017

Uh oh!

squito Oct 25, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SHS-NG M4.3: Port StorageTab to the new backend. #46

SHS-NG M4.3: Port StorageTab to the new backend. #46

Uh oh!

Conversation

vanzin commented Aug 10, 2017

Uh oh!

squito left a comment

Choose a reason for hiding this comment

Uh oh!

squito Oct 25, 2017

Choose a reason for hiding this comment

Uh oh!

squito Oct 25, 2017

Choose a reason for hiding this comment

Uh oh!

vanzin Oct 25, 2017

Choose a reason for hiding this comment

Uh oh!

squito Oct 25, 2017

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants