Skip to main content

Understanding and managing requests in Copilot

Learn about requests in Copilot, including premium requests, how they work, and how to manage your usage effectively.

重要

  • Billing for premium requests began on June 18, 2025 for all paid Copilot plans, and the request counters were only set to zero for paid plans.
  • Premium request counters reset on the 1st of each month. See Monitoring your Copilot usage and entitlements.
  • Certain requests may experience rate limits to accommodate high demand. Rate limits restrict the number of requests that can be made within a specific time period.

What is a request?

A request is any interaction where you ask Copilot to do something for you—whether it’s generating code, answering a question, or helping you through an extension. Each time you send a prompt in a chat window or trigger a response from Copilot, you’re making a request.

What are premium requests?

Some Copilot features use more advanced processing power and count as premium requests. The number of premium requests a feature consumes can vary depending on the feature and the AI model used.

Premium features

The following Copilot features can use premium requests:

How do request allowances work per plan?

If you use Copilot 免费版, your plan comes with up to 2,000 code completion requests and up to 50 premium requests per month. All chat interactions count as premium requests.

If you're on a paid plan, you get unlimited code completions and unlimited chat interactions using the included models (GPT-4.1 and GPT-4o). Rate limiting is in place to accommodate for high demand. See Rate limits for GitHub Copilot.

Paid plans also receive a monthly allowance of premium requests, which can be used for advanced chat interactions, code completions using premium models, and other premium features. For an overview of the amount of premium requests included in each plan, see 适用于 GitHub Copilot 的计划.

What happens to unused requests at the end of the month?

Unused requests for the previous month do not carry over to the following month.

What if I run out of premium requests?

注意

Additional premium requests are not available to:

  • Users on Copilot 免费版. To access more premium requests, upgrade to a paid plan.
  • Users who subscribe, or have subscribed, to Copilot 专业版 or Copilot Pro+ through GitHub Mobile on iOS or Android.

If you're on a paid plan and use all of your premium requests, you can still use Copilot with one of the included models for the rest of the month. This is subject to change. Response times for the included models may vary during periods of high usage. Requests to the included models may be subject to rate limiting. See Rate limits for GitHub Copilot.

If you need more premium requests beyond your monthly allowance, you can:

  • Set a spending limit for additional premium requests. See 防止超支.
  • Upgrade to a higher plan.

These actions can be taken by organization owners, billing managers, and personal account users.

重要

By default, all budgets are set to zero and premium requests over the allowance are rejected unless a budget has been created. Additional premium requests beyond your plan’s included amount are billed at 0.04 美元 per request.

Model multipliers

The available models vary depending on your Copilot plan. See 适用于 GitHub Copilot 的计划.

注意

The models included with Copilot plans are subject to change.

Each model has a premium request multiplier, based on its complexity and resource usage. If you are on a paid Copilot plan, your premium request allowance is deducted according to this multiplier.

GPT-4.1 and GPT-4o are the included models, and do not consume any premium requests if you are on a paid plan.

If you use Copilot 免费版, you have access to a limited number of models, and each model will consume one premium request when used. For example, if you make a request using the o3-mini model, your interaction will consume one premium request, not 0.33 premium requests.

ModelMultiplier for paid plansMultiplier for Copilot 免费版
GPT-4.101
GPT-4o01
GPT-4.550Not applicable
Claude Sonnet 3.511
Claude Sonnet 3.71Not applicable
Claude Sonnet 3.7 Thinking1.25Not applicable
Claude Sonnet 41Not applicable
Claude Opus 410Not applicable
Gemini 2.0 Flash0.251
Gemini 2.5 Pro1Not applicable
o110Not applicable
o31Not applicable
o3-mini0.331
o4-mini0.33Not applicable

Examples of premium request usage

Premium request usage is based on the model’s multiplier and the feature you’re using. For example:

  • Using GPT-4.5 in Copilot Chat: With a 50× multiplier, one interaction counts as 50 premium requests.
  • Using GPT-4.1 on Copilot 免费版: Each interaction counts as 1 premium request.
  • Using GPT-4.1 on a paid plan: No premium requests are consumed.

Footnotes

  1. Copilot 编码智能体 uses a fixed multiplier of 1 for the premium requests it uses, and may use multiple premium requests in response to one user prompt.

  2. Agent mode uses one premium request per user prompt, multiplied by the model's rate.