Replies: 1 comment 1 reply
-
The rate limits refer to the overview in the documentation: You have 12k tokens (in+out) per request and you can make up to 150 requests a day (with the Free / Copilot Individual level). That amounts to 1.8 million tokens a day, which is a lot lower then you have in that number, so I am not sure how that is calculated. Perhaps they have temporarily higher limits during the beta? I have the lower numbers by the way. I've used a PAT here:
I noticed this info in the response as well: "usage": {
"completion_tokens": 7,
"prompt_tokens": 24,
"total_tokens": 31
} |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Select Topic Area
Question
Body
I just sent my first request to the
gpt-4o-2024-08-06
model. The rate limits from the document are:The request succeeded and I got following headers:
I think the headers have something to do with the rate limits but I don’t know what they exactly mean.
Beta Was this translation helpful? Give feedback.
All reactions