-
-
Notifications
You must be signed in to change notification settings - Fork 181
Closed
Labels
enhancementNew feature or requestNew feature or request
Milestone
Description
Anthropic 的缓存时间分为 5 min 和 1 hour,二者缓存写入和读取费率略有不同。
默认情况下,客户端会请求 5min 缓存。而对于长对话需求,1h 缓存更有可能命中,即可以降低缓存的创建开销。
差异化计费主要依赖响应体message_start中的usage字段实现。
event: message_start
data: {"type":"message_start","message":{"model":"claude-sonnet-4-5-20250929","id":"msg_013EzTQuQxsySLNwQeLiDHfH","type":"message","role":"assistant","content":[],"stop_reason":null,"stop_sequence":null,"usage":{"input_tokens":0,"cache_creation_input_tokens":797,"cache_read_input_tokens":118215,"cache_creation":{"ephemeral_5m_input_tokens":0,"ephemeral_1h_input_tokens":797},"output_tokens":24,"service_tier":"standard"}} }
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request
Projects
Status
Done