feat(provider): add Anthropic max_tokens and thinking budget override settings by ding113 · Pull Request #689 · ding113/claude-code-hub

ding113 · 2026-01-31T09:13:32Z

Summary

Add two new provider-level settings for claude and claude-auth type providers that allow overriding Anthropic API request parameters:

anthropicMaxTokensPreference: Override max_tokens field (range: 1-64,000 or "inherit")
anthropicThinkingBudgetPreference: Override thinking.budget_tokens field (range: 1,024-32,000 or "inherit")

Key Behaviors

When thinking budget is set, thinking.type is automatically forced to "enabled"
Runtime clamping: if budget_tokens >= max_tokens, it's clamped to max_tokens - 1
UI shows these fields only for claude/claude-auth provider types
Includes "Max Out (32000)" quick-fill button for convenience
Values stored as JSON strings ("inherit" or numeric like "32000")

Changes

Core Logic

src/lib/anthropic/provider-overrides.ts - Override logic with audit trail support

Tests

tests/unit/proxy/anthropic-provider-overrides.test.ts - 36 unit tests covering all scenarios

Database

drizzle/0060_bored_gertrude_yorkes.sql - Migration adding two varchar(20) columns

Type System & Repository

src/types/provider.ts - Added types to Provider, ProviderDisplay, Create/Update interfaces
src/drizzle/schema.ts - Added database columns
src/repository/provider.ts + transformers.ts - Field mappings

UI & Forms

routing-section.tsx - New UI section with toggle, input, validation
provider-form-context.tsx + provider-form-types.ts - Form state management
provider-form/index.tsx - Form submission handling

i18n

Added translations for 5 languages (en, zh-CN, zh-TW, ja, ru)

Proxy Integration

src/app/v1/_lib/proxy/forwarder.ts - Integrated into proxy pipeline after Codex overrides

Verification

TypeScript typecheck passing
36 new unit tests passing
Full test suite (1786 tests) passing
Production build successful

Greptile Overview

Greptile Summary

Adds two provider-level override settings for Anthropic claude/claude-auth providers that allow administrators to force-override max_tokens (1-64,000) and thinking.budget_tokens (1,024-32,000) API parameters.

Key Changes

Core Override Logic: provider-overrides.ts implements override application with runtime clamping (budget < max_tokens), minimum budget validation (>= 1024), and comprehensive audit trail support
Thinking Budget Rectifier: New reactive error handler in thinking-budget-rectifier.ts that detects budget_tokens < 1024 API errors and automatically sets maximum values (32000/64000) with one retry attempt
Database Schema: Migration 0060 adds two varchar(20) columns to providers table; migration 0061 adds enable_thinking_budget_rectifier boolean to system_settings
Type System: Added AnthropicMaxTokensPreference and AnthropicThinkingBudgetPreference types throughout Provider interfaces (Provider, ProviderDisplay, Create/Update)
Validation: Comprehensive Zod schemas with regex validation, range checks, and cross-field validation ensuring budget < max_tokens
UI Implementation: New section in routing-section.tsx with toggle controls, numeric inputs, and "Max Out (32000)" quick-fill button, conditionally rendered for claude/claude-auth providers only
Proxy Integration: Integrated into forwarder.ts pipeline after Codex overrides, with proper special settings persistence and audit trail tracking
i18n: Complete translations for 5 languages (en, zh-CN, zh-TW, ja, ru)

Test Coverage

36 unit tests for provider-overrides.ts covering all scenarios including clamping, minimum validation, audit tracking
Additional tests for thinking-budget-rectifier.ts covering error detection and rectification
All 1786 tests passing per PR description

Notable Patterns

Values stored as JSON strings ("inherit" or numeric like "32000") to support UI flexibility
When thinking budget is set, thinking.type automatically forced to "enabled"
Runtime clamping prevents invalid API requests where budget >= max_tokens
Graceful handling when clamping would violate 1024 minimum (skips thinking override entirely)

Confidence Score: 5/5

This PR is safe to merge with very low risk - excellent test coverage, well-structured implementation, and comprehensive validation
The implementation demonstrates exceptional quality with 36 comprehensive unit tests, proper edge case handling (clamping, minimum validation), complete type safety, robust validation schemas, and clean separation of concerns. The audit trail support and special settings integration follow existing patterns consistently.
No files require special attention - all critical files have strong test coverage and follow established architectural patterns

Important Files Changed

Filename	Overview
src/lib/anthropic/provider-overrides.ts	Core override logic with proper clamping, minimum budget validation, and comprehensive audit trail support
tests/unit/proxy/anthropic-provider-overrides.test.ts	Comprehensive test coverage with 36 tests covering all edge cases, clamping logic, and audit functionality
drizzle/0060_bored_gertrude_yorkes.sql	Adds two varchar(20) columns for anthropic preferences, includes unrelated timezone default change
src/drizzle/schema.ts	Schema definitions for new provider and system settings columns with proper comments
src/lib/validation/schemas.ts	Comprehensive validation with regex, range checks (1-64000, 1024-32000), and cross-field validation ensuring budget < max_tokens
src/app/v1/_lib/proxy/forwarder.ts	Integrates anthropic overrides into proxy pipeline with proper audit trail and special settings persistence
src/app/v1/_lib/proxy/thinking-budget-rectifier.ts	Reactive rectifier for budget_tokens < 1024 API errors, sets max values (32000/64000)

Sequence Diagram

sequenceDiagram
    participant Client
    participant ProxyForwarder
    participant ProviderOverrides
    participant Validation
    participant ThinkingRectifier
    participant AnthropicAPI

    Client->>ProxyForwarder: POST /v1/messages
    Note over ProxyForwarder: Load provider config
    
    ProxyForwarder->>Validation: Validate request
    Validation-->>ProxyForwarder: Valid
    
    alt Provider is claude/claude-auth
        ProxyForwarder->>ProviderOverrides: applyAnthropicProviderOverridesWithAudit()
        Note over ProviderOverrides: Check preference settings
        
        alt Preferences set (not "inherit")
            ProviderOverrides->>ProviderOverrides: Override parameters
            Note over ProviderOverrides: Clamp budget to limit minus one
            Note over ProviderOverrides: Skip if budget below minimum
            ProviderOverrides->>ProviderOverrides: Build audit trail
        end
        
        ProviderOverrides-->>ProxyForwarder: Modified request with audit
        ProxyForwarder->>ProxyForwarder: Store audit in special settings
    end
    
    ProxyForwarder->>AnthropicAPI: Forward request
    
    alt Error: budget validation failure
        AnthropicAPI-->>ProxyForwarder: 400 error (budget constraint)
        ProxyForwarder->>ThinkingRectifier: detectThinkingBudgetRectifierTrigger()
        ThinkingRectifier-->>ProxyForwarder: Trigger detected
        
        alt First retry
            ProxyForwarder->>ThinkingRectifier: rectifyThinkingBudget()
            Note over ThinkingRectifier: Set budget to maximum
            Note over ThinkingRectifier: Increase limit if needed
            ThinkingRectifier-->>ProxyForwarder: Rectified request
            ProxyForwarder->>ProxyForwarder: Store rectifier audit
            ProxyForwarder->>AnthropicAPI: Retry with rectified request
        end
    end
    
    AnthropicAPI-->>ProxyForwarder: Success response
    ProxyForwarder-->>Client: Return response

… settings Add two new provider-level settings for claude/claude-auth type providers: - anthropicMaxTokensPreference: Override request max_tokens (1-64000) - anthropicThinkingBudgetPreference: Override thinking.budget_tokens (1024-32000) Key behaviors: - When thinking budget is set, thinking.type is forced to 'enabled' - Runtime clamping: budget_tokens clamped to max_tokens-1 if needed - UI shows these fields only for claude/claude-auth providers - Values stored as JSON strings ('inherit' or numeric like '32000') Changes: - Core override logic with audit trail (src/lib/anthropic/provider-overrides.ts) - 36 unit tests covering all scenarios - Database migration for new varchar columns - Type system updates (Provider, ProviderDisplay, Create/Update interfaces) - Form UI with validation and 'Max Out' quick-fill button - i18n translations for 5 languages (en, zh-CN, zh-TW, ja, ru) - Proxy pipeline integration after Codex overrides

coderabbitai · 2026-01-31T09:13:52Z

📝 Walkthrough

Walkthrough

为 Anthropic（Claude）提供商增加参数级覆写与思考预算整流：数据库迁移、类型与验证、后端覆写与审计、代理整流与重试、前端表单与多语言文案、快照与测试等按迁移顺序的变更。

Changes

Cohort / File(s)	Summary
数据库迁移与快照 `drizzle/0060_bored_gertrude_yorkes.sql`, `drizzle/meta/0060_snapshot.json`, `drizzle/0061_exotic_trauma.sql`, `drizzle/meta/0061_snapshot.json`, `drizzle/meta/_journal.json`	为 `providers` 表新增 `anthropic_max_tokens_preference` 与 `anthropic_thinking_budget_preference` 字段，移除 `notification_target_bindings.schedule_timezone` 的默认约束，向 `system_settings` 添加 `enable_thinking_budget_rectifier` 布尔列并更新快照与 journal。
前端多语言与配置文案 `messages/en/settings/providers/form/sections.json`, `messages/ja/.../sections.json`, `messages/ru/.../sections.json`, `messages/zh-CN/.../sections.json`, `messages/zh-TW/.../sections.json`, `messages/*/settings/config.json`	添加多语言文案条目：provider 表单中的 `anthropicOverrides`（maxTokens、thinkingBudget）和系统设置说明键 `enableThinkingBudgetRectifier` 及其描述。
前端表单状态与组件 `src/app/[locale]/settings/providers/_components/forms/.../provider-form-context.tsx`, `.../provider-form-types.ts`, `.../index.tsx`, `.../sections/routing-section.tsx`	Provider 表单状态加入两个 Anthropic preference 字段及 reducer action，提交数据包含这两项，并新增“Anthropic Overrides” UI 区块（继承/自定义、数值输入、最大化按钮等）。
类型、验证与转换 `src/types/provider.ts`, `src/lib/validation/schemas.ts`, `src/repository/_shared/transformers.ts`, `src/types/system-config.ts`, `src/types/special-settings.ts`	新增 Anthropic preference 类型与 Provider/ProviderDisplay 字段，扩展 Create/Update Provider 验证（范围校验与跨字段 budget < max_tokens 校验），在转换器与系统类型中映射/添加 `enableThinkingBudgetRectifier` 与 thinking_budget_rectifier 特殊设置类型。
后端持久化与仓库 `src/repository/provider.ts`, `src/repository/system-config.ts`, `src/drizzle/schema.ts`	在 provider 的创建/查询/更新流程中持久化并返回新的 preference 字段；系统配置仓库支持读写 `enableThinkingBudgetRectifier` 并提供默认值。
Anthropic 覆写与审计逻辑 `src/lib/anthropic/provider-overrides.ts`, `src/lib/utils/special-settings.ts`	新增模块实现 Anthropic 覆写（修改 max_tokens、thinking.budget_tokens、归一化与 clamping），并提供带审计的变更函数；将 thinking_budget_rectifier 纳入 special-setting key 构建以支持去重与持久化。
代理/转发与重试整流 `src/app/v1/_lib/proxy/forwarder.ts`, `src/app/v1/_lib/proxy/thinking-budget-rectifier.ts`	在转发流程中集成 Anthropic 覆写与可审计持久化，新增思考预算整流器：检测触发、就地修复请求（设置 thinking.type / budget / max_tokens）、记录 before/after、单次重试与错误标记逻辑。
系统设置缓存与动作 `src/lib/config/system-settings-cache.ts`, `src/actions/system-config.ts`, `src/actions/providers.ts`	将 enableThinkingBudgetRectifier 纳入默认缓存与系统设置保存/更新流程；getProviders 动作返回包含新的 provider preference 字段。
UI 系统设置页面 `src/app/[locale]/settings/config/_components/system-settings-form.tsx`, `src/app/[locale]/settings/config/page.tsx`	系统设置表单增加 “Enable Thinking Budget Rectifier” 开关并将其状态读写到保存/返回流程。
测试覆盖 `tests/unit/proxy/anthropic-provider-overrides.test.ts`, `src/app/v1/_lib/proxy/thinking-budget-rectifier.test.ts`	新增全面单元测试：Anthropic 覆写（包含审计、边界与 clamping 场景）以及 Thinking Budget Rectifier 的检测与修复行为测试。

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

feat: thinking signature rectifier #576: 与本 PR 在代理 forwarder、整流/重试逻辑及 system_settings 列添加上有直接重叠修改。
feat: 统一请求特殊设置命中的展示方式 #574: 修改 special-settings 构建逻辑，与本 PR 对 buildSettingKey/特殊设置去重的更改存在代码级交集。
feat: 审计并展示供应商级参数覆写（specialSettings） #557: 在 special-settings 审计与持久化流（provider 参数覆写审计）上与本 PR 有高度相关的实现重叠。

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 46.43% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	PR标题准确概括了主要变更：为Anthropic提供商添加max_tokens和thinking budget覆盖设置。标题简洁、清晰，准确反映了核心功能。
Description check	✅ Passed	PR描述准确关联了本次变更内容，详细解释了新增Anthropic提供商覆写设置的功能，包括核心行为、更改范围和验证状态。

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/anthropic-provider-token-overrides

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist · 2026-01-31T09:14:00Z

Summary of Changes

Hello @ding113, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the control over Anthropic API requests by allowing administrators to define global override preferences for max_tokens and thinking.budget_tokens at the provider level. This provides greater flexibility in managing resource consumption and model behavior for Claude models, ensuring that API calls adhere to desired constraints even if client requests vary.

Highlights

New Anthropic Provider Settings: Introduced two new provider-level settings: anthropicMaxTokensPreference to override max_tokens (range: 1-64,000) and anthropicThinkingBudgetPreference to override thinking.budget_tokens (range: 1,024-32,000). These settings are applicable only to claude and claude-auth type providers.
Automatic thinking.type Enforcement: When anthropicThinkingBudgetPreference is set, the thinking.type parameter in the Anthropic API request is automatically forced to 'enabled' to ensure the budget is respected.
Runtime Clamping for Thinking Budget: Implemented runtime clamping logic where if thinking.budget_tokens is set to be greater than or equal to max_tokens, it will be automatically reduced to max_tokens - 1 to prevent invalid API requests.
UI and Localization Support: Added dedicated UI fields for these new settings, visible only for Claude-type providers, including a 'Max Out (32000)' quick-fill button for convenience. Full localization support has been added for 5 languages.
Robust Validation and Audit Trail: New validation schemas ensure correct numeric ranges and logical consistency (e.g., thinking budget < max tokens). An audit trail is generated for each override application, detailing before and after values for transparency.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces provider-level override settings for Anthropic's max_tokens and thinking.budget_tokens, which is a valuable feature for fine-tuning provider behavior. The implementation is comprehensive, covering database schema changes, backend logic with auditing, extensive unit tests, and a well-designed UI. Overall, the code quality is high.

I have identified a high-severity issue in the runtime clamping logic that could lead to invalid API requests and a medium-severity issue with the validation logic for updates. Addressing these points will enhance the feature's robustness and prevent potential runtime errors.

gemini-code-assist · 2026-01-31T09:16:05Z

src/lib/anthropic/provider-overrides.ts

+  if (thinkingBudget !== null) {
+    ensureCloned();
+    const existingThinking = isPlainObject(output.thinking) ? output.thinking : {};
+    let budgetTokens = thinkingBudget;
+    const currentMaxTokens = typeof output.max_tokens === "number" ? output.max_tokens : null;
+    if (currentMaxTokens !== null && budgetTokens >= currentMaxTokens) {
+      budgetTokens = currentMaxTokens - 1;
+    }
+    const nextThinking: Record<string, unknown> = {
+      ...existingThinking,
+      type: "enabled",
+      budget_tokens: budgetTokens,
+    };
+    output.thinking = nextThinking;
+  }


The current clamping logic for thinking.budget_tokens can result in an invalid value being sent to the Anthropic API. According to the documentation and your own validation schema, thinking.budget_tokens must be at least 1024. However, if max_tokens is less than 1025 (e.g., 1000 from a client request) and a thinkingBudget override is active (e.g., 1024), the clamping logic budgetTokens = currentMaxTokens - 1 will set budget_tokens to 999. This is an invalid value and will cause the API request to fail.

To prevent sending invalid requests, I suggest modifying the logic to only apply the thinking override if the final clamped value is valid (i.e., >= 1024).

ensureCloned(); const existingThinking = isPlainObject(output.thinking) ? output.thinking : {}; let budgetTokens = thinkingBudget; const currentMaxTokens = typeof output.max_tokens === "number" ? output.max_tokens : null; if (currentMaxTokens !== null && budgetTokens >= currentMaxTokens) { budgetTokens = currentMaxTokens - 1; } // Per Anthropic docs, budget_tokens must be at least 1024. // Only apply the override if the final value is valid to avoid sending a request that is guaranteed to fail. if (budgetTokens >= 1024) { const nextThinking: Record<string, unknown> = { ...existingThinking, type: "enabled", budget_tokens: budgetTokens, }; output.thinking = nextThinking; }

gemini-code-assist · 2026-01-31T09:16:05Z

src/lib/validation/schemas.ts

+  .superRefine((data, ctx) => {
+    const maxTokens = data.anthropic_max_tokens_preference;
+    const budget = data.anthropic_thinking_budget_preference;
+    if (maxTokens && maxTokens !== "inherit" && budget && budget !== "inherit") {
+      const maxTokensNum = Number.parseInt(maxTokens, 10);
+      const budgetNum = Number.parseInt(budget, 10);
+      if (budgetNum >= maxTokensNum) {
+        ctx.addIssue({
+          code: z.ZodIssueCode.custom,
+          message: "thinking.budget_tokens must be less than max_tokens",
+          path: ["anthropic_thinking_budget_preference"],
+        });
+      }
+    }
+  });


The superRefine validation on UpdateProviderSchema to ensure thinking.budget_tokens is less than max_tokens is not fully effective. Since UpdateProviderSchema fields are optional, this check only runs if both anthropic_max_tokens_preference and anthropic_thinking_budget_preference are included in the same update request. If a user updates only one of these fields, the validation will not trigger, potentially allowing an invalid state to be saved (e.g., setting a budget that is higher than the already-persisted max_tokens).

While the runtime clamping provides a safeguard, it's better to prevent such configurations at the form submission level. I recommend moving this cross-field validation logic into the editProvider server action, where you can fetch the existing provider data, merge it with the update payload, and then perform the validation on the complete entity before saving.

greptile-apps

_{5 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-01-31T09:18:11Z

drizzle/0060_bored_gertrude_yorkes.sql

@@ -0,0 +1,3 @@
+ALTER TABLE "notification_target_bindings" ALTER COLUMN "schedule_timezone" DROP DEFAULT;--> statement-breakpoint


Unrelated change to notification_target_bindings.schedule_timezone included in migration. Check if this was intentional or should be in a separate migration.

Prompt To Fix With AI

This is a comment left during a code review. Path: drizzle/0060_bored_gertrude_yorkes.sql Line: 1:1 Comment: Unrelated change to `notification_target_bindings.schedule_timezone` included in migration. Check if this was intentional or should be in a separate migration. How can I resolve this? If you propose a fix, please make it concise.

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In
`@src/app/`[locale]/settings/providers/_components/forms/provider-form/sections/routing-section.tsx:
- Around line 549-550: The hardcoded visible strings in the SelectItem
components (e.g., the "Custom" label for value="custom" in routing-section.tsx
and the similar items at the other mentioned lines) must be replaced with i18n
keys and translations; update the JSX in the RoutingSection component (the
SelectItem children) to call the project's translation helper (e.g.,
useTranslations/t function) with a new key like
settings.providers.routing.custom instead of the literal "Custom", and add
corresponding entries for zh-CN, zh-TW, en, ja, ru in the locale
JSON/translation files; ensure you update both occurrences referenced (around
value="custom" and the other SelectItem at lines ~614-615) so all user-facing
options use the new i18n keys.

In `@src/lib/anthropic/provider-overrides.ts`:
- Around line 61-75: The clamp logic for budgetTokens can push it below
Anthropic's minimum (1024); after computing budgetTokens (in the block that uses
normalizeNumericPreference/provider.anthropicThinkingBudgetPreference and
currentMaxTokens), add a check: if budgetTokens < 1024 then do not set
output.thinking (i.e., skip the ensureCloned()/assignment to
nextThinking/output.thinking) so the thinking override is not applied; otherwise
proceed to build nextThinking and assign it as currently done.

🧹 Nitpick comments (4)

src/types/provider.ts (1)

33-37: 明确 null 与 "inherit" 的语义差异

类型允许 null 与 "inherit"，但未说明 null 的含义；建议补充注释或统一语义，避免 API/DB/UI 行为含糊。

Based on learnings In TypeScript interfaces, explicitly document and enforce distinct meanings for null and undefined. Example: for numeric limits like limitTotalUsd, use 'number | null | undefined' when null signifies explicitly unlimited (e.g., matches DB schema or special UI logic) and undefined signifies 'inherit default'. This pattern should be consistently reflected in type definitions across related fields to preserve semantic clarity between database constraints and UI behavior.

Also applies to: 137-139
src/lib/validation/schemas.ts (3)
28-56: 建议将魔法数字提取为常量

Anthropic 相关的限制值（1, 64000, 1024, 32000）直接硬编码在 schema 中。参考 PROVIDER_LIMITS 的模式，建议将这些限制提取到 @/lib/constants/provider.constants.ts 中，便于维护和复用。
♻️ 建议的重构方案

在 provider.constants.ts 中添加：
export const ANTHROPIC_LIMITS = {
  MAX_TOKENS: { MIN: 1, MAX: 64000 },
  THINKING_BUDGET: { MIN: 1024, MAX: 32000 },
} as const;
然后在 schema 中引用这些常量，保持与其他限制配置的一致性。
34-41: 错误消息语言不一致

新增的 Anthropic schema 使用英文错误消息，而文件中其他验证消息（如 Line 62 的 "用户名不能为空"）使用中文。建议保持一致性，统一使用中文或在 UI 层通过 i18n 映射。

764-779: 补充 UI 提示，说明两个字段的约束关系和可能的自动调整行为

当前验证逻辑确实存在边界情况：如果仅通过 API 部分更新其中一个字段，schema 无法验证新值与已存在值的组合。不过实际上，UI 表单始终同时提交两个字段（lines 210-211），所以 schema 的 superRefine 验证能正常工作。

建议在 UI 层补充说明：

在 thinking budget 字段的 Tooltip（目前在 line 581-582 处为空）中添加提示文字，说明 budget < max_tokens 的约束

说明如果提交的值不满足约束会发生什么（是否自动调整）

这样用户在直接调用 API 进行部分更新时，也能了解到这个隐含约束。

...app/[locale]/settings/providers/_components/forms/provider-form/sections/routing-section.tsx

src/lib/anthropic/provider-overrides.ts

github-actions · 2026-01-31T09:27:07Z

src/lib/anthropic/provider-overrides.ts

+    let budgetTokens = thinkingBudget;
+    const currentMaxTokens = typeof output.max_tokens === "number" ? output.max_tokens : null;
+    if (currentMaxTokens !== null && budgetTokens >= currentMaxTokens) {
+      budgetTokens = currentMaxTokens - 1;


[HIGH] [LOGIC-BUG] thinking.budget_tokens can be clamped below 1024

File: src/lib/anthropic/provider-overrides.ts:68

Why this is a problem: applyAnthropicProviderOverrides clamps budget_tokens to max_tokens - 1. If the client request has max_tokens <= 1024, this produces budget_tokens < 1024 while also forcing thinking.type = "enabled", which is very likely to create an invalid Anthropic request (and break requests when this override is enabled). This also violates the intended minimum enforced in validation (thinking.budget_tokens >= 1024).

Concrete example: max_tokens = 100, provider override thinkingBudget = 10240 => budget_tokens becomes 99.

Suggested fix:

const MIN_THINKING_BUDGET_TOKENS = 1024; // ... const currentMaxTokens = typeof output.max_tokens === "number" ? output.max_tokens : null; if (currentMaxTokens !== null && budgetTokens >= currentMaxTokens) { budgetTokens = currentMaxTokens - 1; if (budgetTokens < MIN_THINKING_BUDGET_TOKENS) { // max_tokens too small to safely enable thinking; skip the thinking override return output; } }

Add a unit test covering the max_tokens <= 1024 clamping case to lock this behavior.

github-actions · 2026-01-31T09:27:20Z

...app/[locale]/settings/providers/_components/forms/provider-form/sections/routing-section.tsx

+                      <SelectItem value="inherit">
+                        {t("sections.routing.anthropicOverrides.maxTokens.options.inherit")}
+                      </SelectItem>
+                      <SelectItem value="custom">Custom</SelectItem>


[MEDIUM] [STANDARD-VIOLATION] Hardcoded UI label "Custom" bypasses i18n

File: src/app/[locale]/settings/providers/_components/forms/provider-form/sections/routing-section.tsx:549 (also at :614)

Why this is a problem: Project rule: "i18n Required - All user-facing strings must use i18n (5 languages supported). Never hardcode display text". The literal Custom is rendered in the provider form, so it will never be translated.

Suggested fix:

<SelectItem value="custom"> {t("sections.routing.anthropicOverrides.maxTokens.options.custom")} </SelectItem>

Apply the same change for thinkingBudget, and add the corresponding options.custom keys to messages/*/settings/providers/form/sections.json.

github-actions

Code Review Summary

No significant issues identified in this PR.

PR Size: XL

Lines changed: 4,585 (4,404 additions + 181 deletions)
Files changed: 21

Note on size: While this is an XL PR, the changes are well-organized and follow a clear pattern. The bulk comes from comprehensive test coverage (36 tests) and i18n translations (5 languages × multiple keys). The core logic is concise and focused.

Review Coverage

Logic and correctness - Clean
Security (OWASP Top 10) - Clean
Error handling - Clean (proper logging in all catch blocks)
Type safety - Clean (comprehensive Zod schemas with cross-field validation)
Documentation accuracy - Good (clear comments explaining behavior)
Test coverage - Excellent (36 unit tests, 1786 total tests passing)
Code clarity - Good (follows existing patterns, clear separation of concerns)

Key Strengths

Comprehensive Testing: 36 unit tests covering all scenarios including edge cases (clamping logic, provider type filtering, audit trail)
Layered Validation: Three-tier validation approach prevents invalid configurations:
- UI validation (HTML min/max)
- Schema validation (prevents misconfigured providers)
- Runtime clamping (handles dynamic edge cases)
Proper Error Handling: All async operations have proper .catch() blocks with contextual logging
Type Safety: Strong Zod schemas with cross-field validation ensuring budget_tokens < max_tokens
Immutability: Core logic properly clones request object before modification
i18n Compliance: All user-facing strings properly internationalized for 5 languages

Migration Note

The migration 0060_bored_gertrude_yorkes.sql includes ALTER TABLE "notification_target_bindings" ALTER COLUMN "schedule_timezone" DROP DEFAULT which fixes a schema drift issue (previous migration snapshot had a default value that wasn't in the schema definition). This is a legitimate schema sync fix generated by Drizzle.

Automated review by Claude AI

github-actions

Code Review Summary

This PR introduces provider-level Anthropic parameter overrides and touches DB migrations, validation, proxy forwarding, admin UI, and i18n. Given the XL size (4585 lines across 21 files), the change-set is difficult to validate as a single unit; consider splitting to reduce merge risk and ease regression review.

PR Size: XL

Lines changed: 4585
Files changed: 21
Suggested split (XL):
- PR 1: Core override logic + proxy integration + unit tests (src/lib/anthropic/*, src/app/v1/_lib/proxy/forwarder.ts, tests/unit/proxy/*)
- PR 2: Provider schema/repo/types + migrations (src/lib/validation/schemas.ts, src/types/provider.ts, src/repository/*, src/drizzle/schema.ts, drizzle/*)
- PR 3: Admin UI + i18n (src/app/[locale]/settings/providers/**, messages/*)

Issues Found

Category	High	Medium
Logic/Bugs	1	0
Security	0	0
Error Handling	0	0
Types	0	0
Comments/Docs	0	1
Tests	0	1
Simplification	0	0

Critical Issues (Must Fix)

(none)

High Priority Issues (Should Fix)

[HIGH] [LOGIC-BUG] thinking.budget_tokens can be clamped below 1024 when client max_tokens <= 1024, which likely produces invalid Anthropic requests (see src/lib/anthropic/provider-overrides.ts:68).
[TEST-EDGE-CASE] Add coverage for the max_tokens <= 1024 clamping/skip behavior in tests/unit/proxy/anthropic-provider-overrides.test.ts.
[MEDIUM] [STANDARD-VIOLATION] Hardcoded UI label Custom bypasses i18n (see src/app/[locale]/settings/providers/_components/forms/provider-form/sections/routing-section.tsx:549).

Review Coverage

Automated review by Codex AI

github-actions

Labeled PR #689 as size/XL.
Posted inline review comments on:
- src/lib/anthropic/provider-overrides.ts:68 — potential invalid thinking.budget_tokens when clamped below 1024.
- src/app/[locale]/settings/providers/_components/forms/provider-form/sections/routing-section.tsx:549 — hardcoded UI label Custom bypasses i18n (also occurs at :614).
Submitted the required “Code Review Summary” review comment on PR #689.

…rrors Add reactive rectifier that automatically fixes Anthropic API errors when thinking.budget_tokens is below the minimum threshold of 1024. Trigger: Error message 'thinking.enabled.budget_tokens: Input should be greater than or equal to 1024' Rectification: - Set thinking.type = 'enabled' - Set thinking.budget_tokens = 32000 (maximum) - Set max_tokens = 64000 if missing or < 32001 Features: - System setting toggle: enableThinkingBudgetRectifier (default: true) - Follows same pattern as thinking-signature-rectifier - Retries once on same provider after rectification - Full audit trail via ThinkingBudgetRectifierSpecialSetting Changes: - Core rectifier logic with detect/rectify functions (22 unit tests) - Forwarder integration in retry loop (section 2.6) - System settings full chain (schema/types/cache/repo/actions/validation) - UI toggle in system settings form - i18n translations for 5 languages (en, zh-CN, zh-TW, ja, ru) - Database migration for new column

- Fix budget_tokens clamping below API minimum (1024) - Skip thinking override when clamped value would be invalid - Replace hardcoded "Custom" with i18n keys (5 languages) - Add unit tests for 1024 boundary behavior

…errides feat(provider): add Anthropic max_tokens and thinking budget override settings

github-project-automation bot added this to Claude Code Hub Roadmap Jan 31, 2026

github-project-automation bot moved this to Backlog in Claude Code Hub Roadmap Jan 31, 2026

github-actions bot added enhancement New feature or request area:provider area:Anthropic area:i18n area:UI size/XL Extra Large PR (> 1000 lines) labels Jan 31, 2026

gemini-code-assist bot reviewed Jan 31, 2026

View reviewed changes

greptile-apps bot reviewed Jan 31, 2026

View reviewed changes

coderabbitai bot requested changes Jan 31, 2026

View reviewed changes

...app/[locale]/settings/providers/_components/forms/provider-form/sections/routing-section.tsx Outdated Show resolved Hide resolved

src/lib/anthropic/provider-overrides.ts Show resolved Hide resolved

github-actions bot reviewed Jan 31, 2026

View reviewed changes

ding113 added 2 commits January 31, 2026 17:47

fix(provider): address bugbot review comments for Anthropic overrides

8485de5

- Fix budget_tokens clamping below API minimum (1024) - Skip thinking override when clamped value would be invalid - Replace hardcoded "Custom" with i18n keys (5 languages) - Add unit tests for 1024 boundary behavior

coderabbitai bot approved these changes Jan 31, 2026

View reviewed changes

ding113 merged commit ab59d3c into dev Feb 2, 2026
9 of 11 checks passed

github-project-automation bot moved this from Backlog to Done in Claude Code Hub Roadmap Feb 2, 2026

github-actions bot pushed a commit that referenced this pull request Feb 2, 2026

Merge pull request #689 from ding113/feat/anthropic-provider-token-ov…

9a59a9c

…errides feat(provider): add Anthropic max_tokens and thinking budget override settings

This was referenced Feb 3, 2026

release v0.5.3 #712

Merged

feat(gemini): add Google Search web access preference for Gemini providers #721

Merged

release v0.5.4 #731

Merged

coderabbitai bot mentioned this pull request Feb 10, 2026

feat(anthropic): add adaptive thinking override support #758

Merged

9 tasks

		@@ -0,0 +1,3 @@
		ALTER TABLE "notification_target_bindings" ALTER COLUMN "schedule_timezone" DROP DEFAULT;--> statement-breakpoint

Uh oh!

Conversation

ding113 commented Jan 31, 2026 • edited by greptile-apps bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Behaviors

Changes

Core Logic

Tests

Database

Type System & Repository

UI & Forms

i18n

Proxy Integration

Verification

Greptile Overview

Greptile Summary

Key Changes

Test Coverage

Notable Patterns

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

coderabbitai bot commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Uh oh!

gemini-code-assist bot commented Jan 31, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Code Review Summary

PR Size: XL

Review Coverage

Key Strengths

Migration Note

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Code Review Summary

PR Size: XL

Issues Found

Critical Issues (Must Fix)

High Priority Issues (Should Fix)

Review Coverage

Uh oh!

ding113 commented Jan 31, 2026 •

edited by greptile-apps bot

Loading

coderabbitai bot commented Jan 31, 2026 •

edited

Loading