Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Profiling tool support for collection and analysis #2590

Merged
merged 35 commits into from
Jun 4, 2021

Conversation

tgravescs
Copy link
Collaborator

Adds in support for more profiling. printing rapids jar, printing sql plane metrics, then everything under Analysis - job and stage metrics and sql metrics aggregated and a check for shuffle skew.

Most of these were already in the code so it was just modifying a bit and then adding tests.

@tgravescs tgravescs added the feature request New feature or request label Jun 4, 2021
@tgravescs tgravescs added this to the May 24 - Jun 4 milestone Jun 4, 2021
@tgravescs tgravescs self-assigned this Jun 4, 2021
@tgravescs
Copy link
Collaborator Author

I'll upmerge to #2588 once its merged

@tgravescs
Copy link
Collaborator Author

build

@tgravescs
Copy link
Collaborator Author

build

@tgravescs tgravescs merged commit e65e826 into NVIDIA:branch-21.06 Jun 4, 2021
@tgravescs tgravescs deleted the profilingToolImp branch June 4, 2021 23:03
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
* Qualification tool

Signed-off-by: Thomas Graves <tgraves@apache.org>

* remove unused func

* Add missing files

* Add checks for format option

* cast columsn to string to write to text

* Revert "Add checks for format option"

This reverts commit 6f5271c.

* cleanup

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* update output dir

* formating

* Update help messages

* update app name

* cleanup

* put test functions back

* fix typo

* add printSQLPlanMetrics and printRapidsJar

* use opt

* Add Analysis

* format output

* more tests

Signed-off-by: Thomas Graves <tgraves@apache.org>

* tests working

* test rearrange utils

* move test file

* move test file right location

* add Analysis Suite

* update test analysis

* add
rapids-4-spark-tools/src/test/resources/ProfilingExpectations/rapids_join_eventlog_jobandstagemetrics_expectation.csv

* add more tests

* more tests

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* remove unneeded expectation file

* Add more analysis tests

* comment

* cleanup

* fix logging include
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
* Qualification tool

Signed-off-by: Thomas Graves <tgraves@apache.org>

* remove unused func

* Add missing files

* Add checks for format option

* cast columsn to string to write to text

* Revert "Add checks for format option"

This reverts commit 6f5271c.

* cleanup

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* update output dir

* formating

* Update help messages

* update app name

* cleanup

* put test functions back

* fix typo

* add printSQLPlanMetrics and printRapidsJar

* use opt

* Add Analysis

* format output

* more tests

Signed-off-by: Thomas Graves <tgraves@apache.org>

* tests working

* test rearrange utils

* move test file

* move test file right location

* add Analysis Suite

* update test analysis

* add
rapids-4-spark-tools/src/test/resources/ProfilingExpectations/rapids_join_eventlog_jobandstagemetrics_expectation.csv

* add more tests

* more tests

Signed-off-by: Thomas Graves <tgraves@nvidia.com>

* remove unneeded expectation file

* Add more analysis tests

* comment

* cleanup

* fix logging include
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants