sanjay yadav

Posted on May 19

How to Avoid GitHub Token Rate Limiting Issues | Complete Guide for DevOps Teams

#github #cicd #devops #automation

Introduction

Your CI/CD pipeline was working fine until suddenly every build started failing with a GitHub API 403 error.

I faced this exact issue during a production deployment. Everything looked fine—no code changes, no infrastructure issues—but pipelines kept failing.

At first, we thought it was a bug in the pipeline, but nothing pointed to the actual issue.

After debugging for hours, it became clear that the problem was not the code, but GitHub API rate limiting.

If you are facing GitHub API rate limit exceeded errors in CI/CD pipelines, this guide will help you fix them effectively.

Quick Fix (TL;DR for busy DevOps engineers): Use authenticated tokens, reduce unnecessary API calls, implement caching, and switch to GitHub Apps for scalable systems.
What is GitHub API Rate Limiting?
GitHub API rate limiting restricts how many API requests you can make within a specific time window. This ensures fair usage and prevents abuse.

In real-world DevOps workflows, this limit can quickly become a bottleneck.

Why GitHub API Rate Limit Errors Happen in CI/CD

In most DevOps setups, pipelines frequently interact with GitHub APIs:

Fetching repositories
Triggering workflows
Checking build statuses
Managing pull requests
Common causes include:

Frequent API polling
Multiple services making requests at the same time
Unauthenticated API usage
This is where GitHub API rate limit exceeded errors typically occur.

GitHub API Rate Limits Explained (Token Types)

Unauthenticated Requests: 60 requests per hour
Personal Access Token (PAT): 5000 requests per hour
GitHub Actions Token: approximately 1000 requests per hour
Using authenticated requests significantly increases your available limits.

Token Management Strategies

Use GitHub Apps Instead of Personal Tokens
GitHub Apps provide higher rate limits and better security.

# .github/workflows/deploy.yml
name: Deploy with GitHub App
on:
  push:
    branches: [main]

jobs:
  deploy:
    runs-on: ubuntu-latest
    steps:
      - name: Checkout
        uses: actions/checkout@v4
        with:
          token: ${{ secrets.GITHUB_APP_TOKEN }}

      - name: Deploy
        run: |
          # Your deployment logic here

Implement Token Rotation

Rotate tokens regularly to avoid hitting limits:

#!/bin/bash
# token-rotation.sh
OLD_TOKEN=$1
NEW_TOKEN=$2
# Update secrets in repository
gh secret set GITHUB_TOKEN --body "$NEW_TOKEN"

Use Repository-Specific Tokens

Different tokens for different purposes

env:
DEPLOY_TOKEN: ${{ secrets.DEPLOY_TOKEN }}
NOTIFY_TOKEN: ${{ secrets.NOTIFY_TOKEN }}
BACKUP_TOKEN: ${{ secrets.BACKUP_TOKEN }}

Rate Limit Handling in Code

Implement Exponential Backoff

This function retries API calls when rate limits are hit.

import time
import random
from functools import wraps

def retry_with_backoff(max_retries=3, base_delay=1):
    def decorator(func):
        @wraps(func)
        def wrapper(*args, **kwargs):
            for attempt in range(max_retries):
                try:
                    return func(*args, **kwargs)
                except Exception as e:
                    if "rate limit" in str(e).lower() and attempt < max_retries - 1:
                        delay = base_delay * (2 ** attempt) + random.uniform(0, 1)
                        time.sleep(delay)
                        continue
                    raise
            return None
        return wrapper
    return decorator

@retry_with_backoff(max_retries=5, base_delay=2)
def make_github_request(url, headers):
    response = requests.get(url, headers=headers)
    if response.status_code == 429:
        retry_after = int(response.headers.get('Retry-After', 60))
        time.sleep(retry_after)
        raise Exception("Rate limited")
    return response

Check Rate Limit Headers

Always monitor rate limit headers in your requests:

def check_rate_limit(response):
    remaining = int(response.headers.get('X-RateLimit-Remaining', 0))
    reset_time = int(response.headers.get('X-RateLimit-Reset', 0))

    if remaining < 100:  # Warning threshold
        print(f"Warning: Only {remaining} requests remaining")
        print(f"Rate limit resets at: {reset_time}")

    return remaining, reset_time

GitHub API Rate Limit Architecture (CI/CD Flow)

CI/CD Pipeline Optimization
Batch API Requests
Combine multiple operations into single requests:

# Instead of multiple individual requests
- name: Get PR details
  run: |
    # Bad: Multiple API calls
    gh pr view ${{ github.event.pull_request.number }} --json title
    gh pr view ${{ github.event.pull_request.number }} --json body
    gh pr view ${{ github.event.pull_request.number }} --json files

    # Good: Single API call
    gh pr view ${{ github.event.pull_request.number }} --json title,body,files

Cache API Responses

Use GitHub Actions cache to reduce API calls:

``- name: Cache API response
uses: actions/cache@v3
with:
path: ~/.cache/github-api
key: ${{ runner.os }}-api-cache-${{ github.sha }}
restore-keys: |
${{ runner.os }}-api-cache-

name: Use cached data run: | if [ -f ~/.cache/github-api/data.json ]; then echo "Using cached data" else echo "Fetching fresh data" gh api repos/${{ github.repository }}/commits > ~/.cache/github-api/data.json fi`

Optimize Workflow Triggers

Reduce unnecessary workflow runs:

Only run on specific paths

on:
push:
branches: [main]
paths:
- 'src/'
- 'package.json'
- '.github/workflows/'

Skip workflows for draft PRs

name: Skip for draft PRs if: github.event.pull_request.draft == true run: exit 0

Monitoring and Alerting

Set Up Rate Limit Monitoring

name: Monitor rate limits
run: |
response=$(gh api rate_limit)
remaining=$(echo $response | jq '.rate.remaining')

if [ $remaining -lt 100 ]; then
echo ":⚠️:Rate limit low: $remaining requests remaining"
fi
`

Create Rate Limit Dashboard

`
import requests
import json
from datetime import datetime

def monitor_rate_limits(token):
headers = {'Authorization': f'token {token}'}
response = requests.get('https://api.github.com/rate_limit', headers=headers)

data = response.json()
rate = data['rate']

print(f"Remaining: {rate['remaining']}")
print(f"Reset time: {datetime.fromtimestamp(rate['reset'])}")

if rate['remaining'] < 100:
    # Send alert to Slack/Teams
    send_alert(f"GitHub rate limit low: {rate['remaining']} remaining")

GitHub Token Types Comparison

Choosing the right authentication method directly impacts pipeline stability.

What Happens When Rate Limit is Exceeded?

Best Practices Summary

Use GitHub Apps for higher rate limits
Implement exponential backoff
Monitor API headers
Cache API responses
Batch API requests
Optimize workflows
Rotate tokens regularly

If you are working with cloud and DevOps setups, these guides may help:

https://www.kubeblogs.com/k3s-vs-kubernetes/
https://www.kubeblogs.com/aws-t2-vs-t3-vs-t4g/
https://www.kubeblogs.com/aws-gp2-vs-gp3/
https://www.kubeblogs.com/s3-security-best-practices/

FAQ:

What is GitHub API rate limiting?

GitHub API rate limiting restricts how many API requests you can make within a defined time window.

Why do I get a 403 error in GitHub API?

You get a 403 error when you exceed your API rate limit or use unauthenticated requests.

How can I fix GitHub API rate limit exceeded errors?

Use authenticated tokens, reduce unnecessary API calls, and implement caching strategies.

Can GitHub Actions hit rate limits?

Yes, GitHub Actions can hit rate limits, especially in workflows that make frequent API calls.

Conclusion

GitHub API rate limiting is a common issue in DevOps workflows, but it is predictable once you understand how it works.

By using authenticated tokens, reducing API calls, implementing caching, and leveraging GitHub Apps, you can prevent unexpected failures in your CI/CD pipelines.

Handling API limits properly is essential for building stable and scalable automation systems.

DEV Community