Karthikeyan

Posted on Feb 14

DevFlow: AI-Powered Git Workflow Automation with GitHub Copilot CLI

#devchallenge #githubchallenge #cli #githubcopilot

GitHub Copilot CLI Challenge Submission

This is a submission for the GitHub Copilot CLI Challenge

What I Built

I built DevFlow - a CLI tool that automates your entire Git workflow using GitHub Copilot CLI as its AI brain. Think of it as your AI pair programmer that handles the boring parts of Git so you can focus on writing code.

DevFlow does three things really well:

Generates commit messages from your changes using AI (no more "fix stuff" commits)
Creates PR descriptions automatically by analyzing your commits
Names branches semantically from GitHub issues

The tool was born from a simple frustration: I spend way too much time crafting commit messages and PR descriptions. I wanted to automate this, but every existing tool either used generic templates or required me to write the message myself anyway. So I built DevFlow to actually understand my code changes and generate meaningful descriptions.

Install it:

npm install -g git-devflow

Links:

🔗 GitHub: https://github.com/karthik-zoro-96/devflow-cli
📦 npm: https://www.npmjs.com/package/git-devflow

Demo

1. Smart Commit Messages

Make your changes, run devflow commit, and get 3 AI-generated conventional commit messages to choose from. Copilot analyzes your git diff and generates proper feat:/fix:/chore: messages.

2. Automated PR Descriptions

Running devflow pr create analyzes your commits, auto-pushes your branch, and creates a PR with a properly structured description including summary, changes, and testing notes.

3. Branch Naming from Issues

devflow branch create --issue 7 fetches the GitHub issue and generates a semantic branch name like feature/7-add-changelog-generation.

Core Features:

🤖 AI-powered commit message generation
📝 Automatic PR description creation
🌿 Branch naming from GitHub issues or descriptions
⚙️ AI model selection (Sonnet 4.5 / Haiku 4.5 / GPT-4.1)
🔐 Secure token storage with 600 file permissions
🚀 Auto-push branches before PR creation
💰 Smart quota management with automatic free model fallback

My Experience with GitHub Copilot CLI

Building DevFlow taught me that integrating with Copilot CLI isn't just about calling an API - it's about handling edge cases, quota limits, and shell escaping nightmares. Here's what I learned:

The Architecture: Copilot CLI as the Product

Most people use Copilot CLI during development. I used it as the product itself. Every DevFlow command pipes data to Copilot CLI and parses the response:

const diff = await git.diff();
const response = await copilot.generateCommitMessage(diff);
const commits = parseResponse(response);

Simple architecture, complex execution.

Challenge #1: Shell Escaping Hell

My first attempt at calling Copilot CLI looked innocent:

execSync(`copilot -p "${prompt}"`)

This broke immediately. Why? Git diffs contain backticks, quotes, dollar signs, and every special character bash can interpret. The prompt would either fail to parse or, worse, execute unintended commands.

First attempt at fixing: escape everything manually. Failed.

Second attempt: write prompts to temp files. Better, but cluttered with file I/O.

Final solution: Use execFileAsync with an args array instead of shell execution:

const args: string[] = [];
if (model) args.push('--model', model);
args.push('-s', '--prompt', prompt);

await execFileAsync('copilot', args, {
  maxBuffer: 1024 * 1024,
  timeout: 60000,
});

This passes arguments directly to the Copilot binary without any shell interpretation. No escaping needed. Problem solved.

Lesson: When calling external CLIs with user data, never use shell execution. Always use the args array approach.

Challenge #2: SSH Keys and Custom Hosts

I have two GitHub accounts (work and personal) with different SSH keys. My personal SSH config uses a custom host alias:

Host github-personal
  HostName github.com
  IdentityFile ~/.ssh/id_ed25519_personal

My git remote: git@github-personal:user/repo.git

DevFlow's initial regex for parsing remotes: git@github.com:user/repo.git

Result: DevFlow couldn't find my repository!

Every time I tried to create a PR, it failed with "Not a GitHub repository". I spent hours debugging before realizing the regex only matched the standard GitHub hostname.

The fix:

// Handle ANY SSH host, not just github.com
const sshMatch = url.match(/git@([^:]+):(.+?)\/(.+?)(\.git)?$/);
if (sshMatch) {
  return {
    owner: sshMatch[2],
    repo: sshMatch[3]
  };
}

Lesson: Never assume standard configurations. Developers customize everything.

Challenge #3: Quota Management

Here's what nobody tells you about building with Copilot CLI: you'll hit quota limits constantly during development.

Each test run consumed quota. I burned through my monthly allowance in two days of testing. When users hit limits, the tool would crash with cryptic errors.

My solution: Automatic fallback to free models

async execWithQuotaRetry(prompt: string): Promise<string> {
  const model = config.getCopilotModel(); // e.g., claude-sonnet-4.5

  try {
    return await this.execCopilotPrompt(prompt, model);
  } catch (error) {
    // Detect quota errors
    if (this.isQuotaError(error)) {
      console.log('⚠️  Premium request limit reached');
      console.log('   Retrying with gpt-4.1 (free)...');

      // Automatically retry with free model
      return await this.execCopilotPrompt(prompt, 'gpt-4.1');
    }
    throw error;
  }
}

When the premium model fails due to quota, DevFlow automatically retries with GPT-4.1 (free tier). Users get a gentle warning and keep working. No crashes.

I also added model cost visibility:

🤖 Model: claude-sonnet-4.5  Balanced  ·  1 premium request(s) per prompt

Users can see exactly what they're spending before each request.

Lesson: Plan for quota limits from day one. Free model fallbacks aren't nice-to-have, they're essential.

Challenge #4: Parsing AI Responses

Copilot CLI doesn't return structured JSON - it returns natural language text. Parsing this reliably is harder than it sounds.

For commit messages, I asked for this format:

1. feat: add user authentication
2. fix: resolve login bug  
3. chore: update dependencies

What I got back varied wildly:

Sometimes numbered lists
Sometimes markdown lists with -
Sometimes just sentences
Sometimes with bold markdown **

My solution: Multiple parsing strategies

parseCommitMessages(response: string): string[] {
  const lines = response.split('\n');

  for (const line of lines) {
    // Try pattern 1: "1. message"
    let match = line.match(/^\d+[\.)]\s*(.+)$/);
    if (match) messages.push(stripMarkdown(match[1]));

    // Try pattern 2: Direct conventional format
    match = line.match(/^(feat|fix|chore)(\(.+?\))?: .+/i);
    if (match) messages.push(stripMarkdown(line));

    // Try pattern 3: "- message"
    match = line.match(/^[-*]\s+(.+)$/);
    if (match) messages.push(stripMarkdown(match[1]));
  }
}

Lesson: AI output is unpredictable. Always have fallback parsing strategies.

Challenge #5: Silent Failures

Copilot CLI sometimes fails with zero error output. The process just exits with code 1 and nothing in stderr.

This was a nightmare to debug. Users would see "unknown error" with no explanation.

My solution: Read Copilot's log files

private readLatestCopilotLog(): string | undefined {
  const logDir = join(homedir(), '.copilot', 'logs');
  const files = readdirSync(logDir)
    .filter(f => f.startsWith('process-'))
    .sort()
    .reverse();

  const content = readFileSync(join(logDir, files[0]), 'utf-8');
  const errorLines = content
    .split('\n')
    .filter(line => /\[ERROR\]/.test(line));

  // Return the actual error message
  return errorLines[errorLines.length - 1];
}

When Copilot fails silently, DevFlow reads the most recent log file and extracts the actual error. This revealed authentication issues, model availability problems, and network errors that stderr didn't show.

Lesson: When stderr is empty, check log files. They often contain the real error.

What Went Right

Despite all the challenges, some things worked beautifully:

1. Copilot's natural language understanding

I didn't need to fine-tune prompts extensively. "Generate 3 conventional commit messages" worked on the first try. The quality was genuinely impressive.

2. The execFile approach

Once I switched from shell execution to direct args passing, 90% of my bugs disappeared.

3. Fallback strategies

Having rule-based fallbacks meant DevFlow never crashes. When AI fails, it falls back gracefully.

4. Model cost transparency

Showing users exactly what each request costs builds trust and helps them manage quota.

The Stats

4 days to build
15 quota limit errors encountered and fixed
3 complete rewrites of the SSH URL parser
~600 lines of TypeScript
100% uptime with fallbacks (never crashes)

What I'd Do Differently

1. Start with execFile from day one

Don't waste time on shell escaping. Go straight to the args array approach.

2. Build quota monitoring earlier

I added model cost display as an afterthought. Should have been there from the start.

3. Test with weird SSH configs immediately

I only discovered the custom host issue when deploying. Should have tested edge cases earlier.

4. Read Copilot CLI docs twice

I missed useful flags and features because I skimmed the docs. RTFM saves time.

The Magic Moment

Despite all the debugging pain, there was one moment that made it all worth it:

I ran devflow pr create on a messy feature branch with 8 commits. Copilot analyzed them and generated this:

Title: Add comprehensive README with features, setup, and usage guide

Summary:
Complete documentation overhaul transforming the minimal README into 
a comprehensive guide with branding, detailed instructions, and complete 
feature documentation.

Changes:
- Added DevFlow logo, tagline, and GitHub Copilot CLI Challenge badge
- Documented all 5 core features (commits, PRs, branches, config, auto-push)
- Added npm installation command and interactive setup instructions
- Comprehensive examples for all commands
- Explained how DevFlow uses Copilot CLI as its AI engine

It was perfect. Better than I would have written myself. That's when I knew this tool was actually useful.

Final Thoughts

GitHub Copilot CLI is excellent for natural language generation tasks. It's less excellent when you try to pipe complex, special-character-filled text through bash. But with proper error handling, quota management, and fallback strategies, you can build genuinely useful tools on top of it.

The future of CLI tools is AI-powered. DevFlow is just the beginning.

Try DevFlow: npm install -g git-devflow

Built with ❤️ using GitHub Copilot CLI.

DEV Community