What are the x-ratelimit-remaining-tokens and the x-ratelimit-remaining-requests headers mean? #138918

honeming · 2024-09-17T16:14:44Z

honeming
Sep 17, 2024

Select Topic Area

Question

Body

I just sent my first request to the gpt-4o-2024-08-06 model. The rate limits from the document are:

Rate limits	Value
Requests per minute	10
Requests per day	50
Tokens per request	8000 in, 4000 out

The request succeeded and I got following headers:

key	value
x-ratelimit-remaining-tokens	19999342
x-ratelimit-remaining-requests	199998

I think the headers have something to do with the rate limits but I don’t know what they exactly mean.

Answered by rajbos

Sep 18, 2024

The rate limits refer to the overview in the documentation:

You have 12k tokens (in+out) per request and you can make up to 150 requests a day (with the Free / Copilot Individual level). That amounts to 1.8 million tokens a day, which is a lot lower then you have in that number, so I am not sure how that is calculated.

Perhaps they have temporarily higher limits during the beta? I have the lower numbers by the way. I've used a PAT here:

key	value
x-ratelimit-remaining-tokens	1997968
x-ratelimit-remaining-requests	19998

I noticed this info in the response as well:

"usage": {
      "completion_tokens": 7,
      "prompt_tokens": 24,
      "total_tokens": 31
    }

View full answer

rajbos · 2024-09-18T19:21:34Z

rajbos
Sep 18, 2024

The rate limits refer to the overview in the documentation:

You have 12k tokens (in+out) per request and you can make up to 150 requests a day (with the Free / Copilot Individual level). That amounts to 1.8 million tokens a day, which is a lot lower then you have in that number, so I am not sure how that is calculated.

Perhaps they have temporarily higher limits during the beta? I have the lower numbers by the way. I've used a PAT here:

key	value
x-ratelimit-remaining-tokens	1997968
x-ratelimit-remaining-requests	19998

I noticed this info in the response as well:

"usage": {
      "completion_tokens": 7,
      "prompt_tokens": 24,
      "total_tokens": 31
    }

1 reply

rajbos Sep 18, 2024

Same number size when using the GITHUB_TOKEN in the Codespace

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Community

What are the x-ratelimit-remaining-tokens and the x-ratelimit-remaining-requests headers mean? #138918

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

GitHub Community

What are the x-ratelimit-remaining-tokens and the x-ratelimit-remaining-requests headers mean? #138918

honeming Sep 17, 2024

Select Topic Area

Body

Replies: 1 comment · 1 reply

rajbos Sep 18, 2024

rajbos Sep 18, 2024

honeming
Sep 17, 2024

Replies: 1 comment 1 reply

rajbos
Sep 18, 2024