Rate limits
Rate limiting is a mechanism used to control the number of requests a user or application can make in a given time period. GitHub uses rate limits to ensure everyone has fair access to GitHub Copilot and to protect against abuse.
GitHub enforces rate limits for several reasons.
- Capacity: There is a limited amount of computing power available to serve all Copilot users. Rate limiting helps prevent the system from being overloaded.
- High usage: Popular features and models may receive bursts of requests. Rate limits ensure no single user or group can monopolize these resources.
- Fairness: Rate limits ensure that all users have equitable access to Copilot.
- Abuse mitigation: Without rate limits, malicious actors could exploit Copilot, leading to degraded service for everyone or even denial of service.
Additional usage limits
Your Copilot plan includes an allowance of AI credits. If you exhaust these included credits, you can continue working by either upgrading your plan or setting a budget for Additional usage to access more AI credits. If you exceed the allowed additional usage, you can pay for the additional usage you've already consumed to unlock more and continue.
For more information, see Usage-based billing for individuals.
What to do if you hit a limit
If you receive a limit error when using Copilot, you should:
- Wait and try again. Rate limits are temporary. Often, waiting a short period and trying again resolves the issue.
- Check your usage. If you’re making frequent or automated requests (for example, rapid-fire completions or large-scale usage), consider adjusting your usage pattern.
- Upgrade your plan. If you are on an individual Copilot plan, upgrading your plan will allow for additional usage.