feat: Add GPT OSS 20B and 120B #145

jcabrero · 2025-08-26T16:18:54Z

This PR adds docker compose files for GPT OSS 20B and 120B. Additionally it adds small fixes to two small problems.

nilai-api/src/nilai_api/routers/private.py

jcabrero · 2025-08-26T16:23:55Z

nilai-api/src/nilai_api/routers/private.py

+    limit = MODEL_CONCURRENT_RATE_LIMIT.get(
+        chat_request.model, MODEL_CONCURRENT_RATE_LIMIT.get("default", 50)
+    )


This change is the most relevant. If the MODEL_CONCURRENT_RATE_LIMIT doesn't exist for such given model, it switches to "default" which should work for any model and otherwise 50. This prevents a failure state in most cases.

Copilot

Pull Request Overview

This PR adds support for GPT OSS 20B and 120B models by creating their docker compose configurations, while also implementing defensive programming fixes for model validation and rate limiting.

Adds docker compose files for GPT OSS 20B and 120B model deployments
Implements null/empty string validation for model IDs in the state management
Replaces exception-based rate limiting with default fallback logic

Reviewed Changes

Copilot reviewed 8 out of 9 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
nilai-api/src/nilai_api/state.py	Adds null/empty validation for model_id parameter
nilai-api/src/nilai_api/routers/private.py	Replaces KeyError exception with default fallback for rate limits
nilai-api/src/nilai_api/config/config.yaml	Adds rate limit configuration for new GPT OSS 20B model and default
docker/vllm.Dockerfile	Updates base image to custom jcabrero/vllm version
docker/compose/docker-compose.gpt-20b-gpu.yml	New docker compose configuration for GPT OSS 20B
docker/compose/docker-compose.gpt-120b-gpu.yml	New docker compose configuration for GPT OSS 120B
.env.sample	Adds BRAVE_SEARCH_API environment variable
.env.ci	Adds BRAVE_SEARCH_API environment variable for CI

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

docker/compose/docker-compose.gpt-120b-gpu.yml

fix: various small fixes to model handling

f7a9f57

jcabrero force-pushed the feat/add_gpt_oss branch from 40a1762 to bedab1b Compare August 26, 2025 16:19

jcabrero commented Aug 26, 2025

View reviewed changes

nilai-api/src/nilai_api/routers/private.py Show resolved Hide resolved

feat: added docker compose files for GPT 20B and 120B

a728f91

jcabrero force-pushed the feat/add_gpt_oss branch from bedab1b to a728f91 Compare August 26, 2025 16:23

jcabrero commented Aug 26, 2025

View reviewed changes

jcabrero requested review from blefo and Copilot and removed request for Copilot August 27, 2025 07:28

Copilot AI reviewed Aug 27, 2025

View reviewed changes

docker/compose/docker-compose.gpt-120b-gpu.yml Show resolved Hide resolved

docker/compose/docker-compose.gpt-120b-gpu.yml Show resolved Hide resolved

fix: error when handling invalid models

f7a02cd

jcabrero force-pushed the feat/add_gpt_oss branch from fd7c6eb to f7a02cd Compare August 27, 2025 07:43

feat: increase max_attempts

a33ab07

jcabrero merged commit ccebd99 into main Aug 27, 2025
8 checks passed

jcabrero deleted the feat/add_gpt_oss branch August 27, 2025 09:38

jcabrero linked an issue Oct 8, 2025 that may be closed by this pull request

Add new models to the catalogue #128

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add GPT OSS 20B and 120B #145

feat: Add GPT OSS 20B and 120B #145

Uh oh!

jcabrero commented Aug 26, 2025

Uh oh!

Uh oh!

jcabrero Aug 26, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Add GPT OSS 20B and 120B #145

feat: Add GPT OSS 20B and 120B #145

Uh oh!

Conversation

jcabrero commented Aug 26, 2025

Uh oh!

Uh oh!

jcabrero Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants