Skip to content

Conversation

@zuston
Copy link
Member

@zuston zuston commented Sep 22, 2022

What changes were proposed in this pull request?

Set the default disk capacity to the total space

Why are the changes needed?

When shuffle-servers are colocated with NodeManagers, the disk capacity is hard to set. Because when it's equal with the total space, it will cause the most shuffle-servers startup fail due to free-space checking. If not, it will waste some disk resources.

And the default disk capacity is 1TB, it's not out-of-box value. So this PR is to set the default value of 'rss.server.disk.capacity' to the disk total space.

Does this PR introduce any user-facing change?

No

How was this patch tested?

@zuston
Copy link
Member Author

zuston commented Sep 22, 2022

This problem occurred in our internal uniffle cluster. PTAL @jerqi

@jerqi
Copy link
Contributor

jerqi commented Sep 22, 2022

Should we modify the document at the same time?

@zuston
Copy link
Member Author

zuston commented Sep 22, 2022

Should we modify the document at the same time?

Yes. If this direction is OK, I will go ahead to add more tests and doc.

@jerqi
Copy link
Contributor

jerqi commented Sep 22, 2022

It's ok for me.

@codecov-commenter
Copy link

codecov-commenter commented Sep 22, 2022

Codecov Report

Merging #237 (93fbbf6) into master (f63cebb) will decrease coverage by 0.02%.
The diff coverage is 37.50%.

@@             Coverage Diff              @@
##             master     #237      +/-   ##
============================================
- Coverage     59.16%   59.14%   -0.03%     
  Complexity     1332     1332              
============================================
  Files           160      160              
  Lines          8738     8740       +2     
  Branches        820      821       +1     
============================================
- Hits           5170     5169       -1     
- Misses         3303     3305       +2     
- Partials        265      266       +1     
Impacted Files Coverage Δ
...rg/apache/uniffle/storage/common/LocalStorage.java 42.75% <16.66%> (-0.91%) ⬇️
...a/org/apache/uniffle/server/ShuffleServerConf.java 99.18% <100.00%> (-0.01%) ⬇️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@zuston zuston closed this Sep 22, 2022
@zuston zuston reopened this Sep 22, 2022
@zuston zuston requested a review from jerqi September 23, 2022 10:03
Copy link
Contributor

@jerqi jerqi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks @zuston

@jerqi jerqi merged commit f0fbdc4 into apache:master Sep 24, 2022
@zuston
Copy link
Member Author

zuston commented Sep 24, 2022

Thanks @jerqi

@zuston zuston deleted the disk branch September 24, 2022 07:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants