重要
- Billing for premium requests began on June 18, 2025 for all paid Copilot plans, and the request counters were only set to zero for paid plans.
- Premium request counters reset on the 1st of each month. See Monitoring your Copilot usage and entitlements.
- Certain requests may experience rate limits to accommodate high demand. Rate limits restrict the number of requests that can be made within a specific time period.
What is a request?
A request is any interaction where you ask Copilot to do something for you—whether it’s generating code, answering a question, or helping you through an extension. Each time you send a prompt in a chat window or trigger a response from Copilot, you’re making a request.
What are premium requests?
Some Copilot features use more advanced processing power and count as premium requests. The number of premium requests a feature consumes can vary depending on the feature and the AI model used.
Premium features
The following Copilot features can use premium requests:
How do request allowances work per plan?
If you use Copilot 免费版, your plan comes with up to 2,000 code completion requests and up to 50 premium requests per month. All chat interactions count as premium requests.
If you're on a paid plan, you get unlimited code completions and unlimited chat interactions using the included models (GPT-4.1 and GPT-4o). Rate limiting is in place to accommodate for high demand. See Rate limits for GitHub Copilot.
Paid plans also receive a monthly allowance of premium requests, which can be used for advanced chat interactions, code completions using premium models, and other premium features. For an overview of the amount of premium requests included in each plan, see 适用于 GitHub Copilot 的计划.
What happens to unused requests at the end of the month?
Unused requests for the previous month do not carry over to the following month.
What if I run out of premium requests?
注意
Additional premium requests are not available to:
- Users on Copilot 免费版. To access more premium requests, upgrade to a paid plan.
- Users who subscribe, or have subscribed, to Copilot 专业版 or Copilot Pro+ through GitHub Mobile on iOS or Android.
If you're on a paid plan and use all of your premium requests, you can still use Copilot with one of the included models for the rest of the month. This is subject to change. Response times for the included models may vary during periods of high usage. Requests to the included models may be subject to rate limiting. See Rate limits for GitHub Copilot.
If you need more premium requests beyond your monthly allowance, you can:
- Set a spending limit for additional premium requests. See 防止超支.
- Upgrade to a higher plan.
These actions can be taken by organization owners, billing managers, and personal account users.
重要
By default, all budgets are set to zero and premium requests over the allowance are rejected unless a budget has been created. Additional premium requests beyond your plan’s included amount are billed at 0.04 美元 per request.
Model multipliers
The available models vary depending on your Copilot plan. See 适用于 GitHub Copilot 的计划.
注意
The models included with Copilot plans are subject to change.
Each model has a premium request multiplier, based on its complexity and resource usage. If you are on a paid Copilot plan, your premium request allowance is deducted according to this multiplier.
GPT-4.1 and GPT-4o are the included models, and do not consume any premium requests if you are on a paid plan.
If you use Copilot 免费版, you have access to a limited number of models, and each model will consume one premium request when used. For example, if you make a request using the o3-mini model, your interaction will consume one premium request, not 0.33 premium requests.
Model | Multiplier for paid plans | Multiplier for Copilot 免费版 |
---|---|---|
GPT-4.1 | 0 | 1 |
GPT-4o | 0 | 1 |
GPT-4.5 | 50 | Not applicable |
Claude Sonnet 3.5 | 1 | 1 |
Claude Sonnet 3.7 | 1 | Not applicable |
Claude Sonnet 3.7 Thinking | 1.25 | Not applicable |
Claude Sonnet 4 | 1 | Not applicable |
Claude Opus 4 | 10 | Not applicable |
Gemini 2.0 Flash | 0.25 | 1 |
Gemini 2.5 Pro | 1 | Not applicable |
o1 | 10 | Not applicable |
o3 | 1 | Not applicable |
o3-mini | 0.33 | 1 |
o4-mini | 0.33 | Not applicable |
Examples of premium request usage
Premium request usage is based on the model’s multiplier and the feature you’re using. For example:
- Using GPT-4.5 in Copilot Chat: With a 50× multiplier, one interaction counts as 50 premium requests.
- Using GPT-4.1 on Copilot 免费版: Each interaction counts as 1 premium request.
- Using GPT-4.1 on a paid plan: No premium requests are consumed.