-
Notifications
You must be signed in to change notification settings - Fork 29k
[WIP] Measurement for SPARK-16929. #17112
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #73654 has finished for PR 17112 at commit
|
|
The unit test "Measurement for SPARK-16929." added is the measurement. I calculate the time used for getting median duration in TaskSetManager.scala line 957.
if
if
As we can see, new algorithm( |
|
Put |
|
Test build #73889 has finished for PR 17112 at commit
|
* SPARK-16929: (178 commits) mod Refine test. scheduleAtFixedRate -> scheduleWithFixedDelay Change back to scheduleAtFixedRate Change some comment and unit tests. scheduleAtFixedRate -> scheduleWithFixedDelay Get rid of 'remove' and fix doc in MedianHeap [SPARK-16929] Improve performance when check speculatable tasks. [SPARK-19891][SS] Await Batch Lock notified on stream execution exit [SPARK-19008][SQL] Improve performance of Dataset.map by eliminating boxing/unboxing [SPARK-19886] Fix reportDataLoss if statement in SS KafkaSource [SPARK-19611][SQL] Introduce configurable table schema inference [SPARK-12334][SQL][PYSPARK] Support read from multiple input paths for orc file in DataFrameReader.orc [SPARK-19861][SS] watermark should not be a negative time. [SPARK-19715][STRUCTURED STREAMING] Option to Strip Paths in FileSource [SPARK-19793] Use clock.getTimeMillis when mark task as finished in TaskSetManager. [SPARK-19757][CORE] DriverEndpoint#makeOffers race against CoarseGrainedSchedulerBackend#killExecutors [SPARK-19561][SQL] add int case handling for TimestampType [SPARK-19763][SQL] qualified external datasource table location stored in catalog [SPARK-19859][SS][FOLLOW-UP] The new watermark should override the old one. ...
61b96ff to
cfc7e33
Compare
|
Test build #74765 has finished for PR 17112 at commit
|
What changes were proposed in this pull request?
This pr doesn't target for merging. It's a measurement for #16867, in which store successful taskIds in
successfulTaskIdsSetinTreeSet, thus the time complexity is O(n/2) when get median duration incheckSpeculatableTasks.