[Improvement] Optimize retry logic in `ShuffleServerGrpcClient#sendShuffleData`

### Code of Conduct

- [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)


### Search before asking

- [X] I have searched in the [issues](https://github.com/apache/incubator-uniffle/issues?q=is%3Aissue) and found no similar issues.


### What would you like to be improved?

Now in `org.apache.uniffle.client.impl.grpc.ShuffleServerGrpcClient#sendShuffleData`, it will retry to send to one shuffle server for a long time and fail after reach `rss.client.send.check.timeout.ms`. Exception as follows:

`Timeout: Task[2852_0] failed because 200 blocks can't be sent to shuffle server in 600000 ms.`

This will cause that client will not send data to other servers.

### How should we improve?

1. Don't retry in `requirePreAllocation` and just retry in upper level
2. Set the default value of `rss.client.retry.max` to a smaller value, such as 10.

### Are you willing to submit PR?

- [X] Yes I am willing to submit a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Improvement] Optimize retry logic in `ShuffleServerGrpcClient#sendShuffleData` #339

Code of Conduct

Search before asking

What would you like to be improved?

How should we improve?

Are you willing to submit PR?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Improvement] Optimize retry logic in ShuffleServerGrpcClient#sendShuffleData #339

Description

Code of Conduct

Search before asking

What would you like to be improved?

How should we improve?

Are you willing to submit PR?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Improvement] Optimize retry logic in `ShuffleServerGrpcClient#sendShuffleData` #339