Skip to content

Commit

Permalink
Change default max-concurrent-requests value to 1024 (#748)
Browse files Browse the repository at this point in the history
  • Loading branch information
ajtejankar authored Jan 29, 2025
1 parent 6580d24 commit ee786c9
Show file tree
Hide file tree
Showing 3 changed files with 3 additions and 3 deletions.
2 changes: 1 addition & 1 deletion docs/reference/launcher.md
Original file line number Diff line number Diff line change
Expand Up @@ -78,7 +78,7 @@ Options:
The maximum amount of concurrent requests for this particular deployment. Having a low limit will refuse clients requests instead of having them wait for too long and is usually good to handle backpressure correctly

[env: MAX_CONCURRENT_REQUESTS=]
[default: 128]
[default: 1024]

--max-best-of <MAX_BEST_OF>
This is the maximum allowed value for clients to set `best_of`. Best of makes `n` generations at the same time, and return the best in terms of overall log probability over the entire generated sequence
Expand Down
2 changes: 1 addition & 1 deletion launcher/src/main.rs
Original file line number Diff line number Diff line change
Expand Up @@ -397,7 +397,7 @@ struct Args {
/// The maximum amount of concurrent requests for this particular deployment.
/// Having a low limit will refuse clients requests instead of having them
/// wait for too long and is usually good to handle backpressure correctly.
#[clap(default_value = "128", long, env)]
#[clap(default_value = "1024", long, env)]
max_concurrent_requests: usize,

/// This is the maximum allowed value for clients to set `best_of`.
Expand Down
2 changes: 1 addition & 1 deletion router/src/main.rs
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ use tracing_subscriber::{EnvFilter, Layer};
#[derive(Parser, Debug)]
#[clap(author, version, about, long_about = None)]
struct Args {
#[clap(default_value = "128", long, env)]
#[clap(default_value = "1024", long, env)]
max_concurrent_requests: usize,
#[clap(default_value = "2", long, env)]
max_best_of: usize,
Expand Down

0 comments on commit ee786c9

Please sign in to comment.