Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add rate limiting configuration for LLM providers #276

Open
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

hoangnb24
Copy link
Contributor

  • Introduce rate limit parameters (requests/sec and max bucket size) for all LLM providers
  • Update webui.py to include new rate limit inputs in UI and function signatures
  • Modify utils.py to create InMemoryRateLimiter for each LLM provider
  • Add rate limiter configuration to all supported LLM models

hoangnb24 and others added 2 commits February 12, 2025 11:29
- Introduce rate limit parameters (requests/sec and max bucket size) for all LLM providers
- Update webui.py to include new rate limit inputs in UI and function signatures
- Modify utils.py to create InMemoryRateLimiter for each LLM provider
- Add rate limiter configuration to all supported LLM models
Add rate limiting configuration for LLM providers
@CLAassistant
Copy link

CLAassistant commented Feb 12, 2025

CLA assistant check
All committers have signed the CLA.

hoangnb24 and others added 4 commits February 12, 2025 12:00
- Move rate limiter creation to a single location at the beginning of the function
- Update parameter names for specific providers (e.g., Mistral's `model` instead of `model_name`)
- Fix Google provider's API key parameter to `google_api_key`
- Remove redundant configuration options and simplify model initialization
Refactor rate limiter initialization in LLM provider configuration
Conflict files:
- webui.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants