NIkhil Sahni

Posted on Jul 6

🔐 CodeSentinel: The AI Agent That Audits GitHub Repos for Security Threats

#devchallenge #runnerhchallenge #ai #machinelearning

This is a submission for the Runner H "AI Agent Prompting" Challenge

🛡️ CodeSentinel: The AI Agent That Finds CVEs, Analyzes GitHub, and Delivers Audit-Grade Reports

What I Built

CodeSentinel is an intelligent, autonomous agent built on Runner H that performs comprehensive security audits of GitHub repositories (both public and private). It detects:

Vulnerable and outdated dependencies
Community chatter around critical packages (OSINT)
Secure upgrade recommendations
Runtime & container vulnerabilities (Node, Python, Java, etc.)

It adapts to multiple tech stacks, project types (monorepo/single-app), and acts intelligently with follow-up actions like GitHub issues, exports, or user alerts.

Demo

➡️ Runner H Agent Chat (CodeSentinel Live Demo)

📽️ Video Demo: Coming soon

📸 Screenshots below show PDF & Email report outputs:

How I Used Runner H

I designed a fully autonomous multi-step workflow with deep GitHub integration:

🧠 Runner H Workflow (Step-by-Step)

Ask Inputs
- GitHub repo URL, auth token (optional), tech stack, monorepo/single-app, audit window, output preference
Understand Project Structure
- Uses GitHub API to detect folders, fetches: package.json, requirements.txt, pom.xml, go.mod, .nvmrc, Dockerfile, etc.
Parse All Dependencies
- Deduplicates, tags by path, handles monorepos (pnpm, turbo, etc.)
Scan for CVEs
- Queries NVD, OSV.dev, GitHub Advisory DB
- Flags versions with known vulnerabilities
OSINT Threat Chatter
- Scans Reddit, Hacker News, Dev.to using keywords like CVE, exploit, PoC, etc.
Suggest Secure Upgrades
- Uses latest registry data (npm, PyPI, Maven, etc.)
- Flags breaking changes
Generate Final Report
- Outputs in Markdown, PDF, or CSV
- GitHub issue creation if critical vulnerabilities detected
Follow-Up Options
- Email report, rescan, act now vs. backlog, compare previous scans

🚀 Why CodeSentinel is Better

Feature	Naive Agents	CodeSentinel
Parses All Files	❌ Stops early	✅ Full scan
CVE Detection	✅ Basic	✅ + OSINT
Monorepo Support	❌ Limited	✅ Fully supported
Export Options	❌ None	✅ Markdown, CSV, PDF
Runtime + Docker CVEs	❌ Missed	✅ Included
GitHub Issue Integration	❌ No	✅ Auto-create
Risk Scoring & Priorities	❌ Flat CVSS	✅ Smart weighted score

Use Case & Impact

🔐 Problem

Most security audits are manual, time-consuming, or incomplete. Developers often miss active CVEs or runtime risks.

✅ Solution

CodeSentinel turns this into an automated, audit-grade process that anyone can trigger — from freelancers to DevSecOps teams.

👥 Who Benefits

Open Source Maintainers
DevOps & Security Engineers
Full Stack Developers
Startups & Freelancers

✅ Real-World Test Cases

🔍 Supabase – Parsed 6+ files, flagged outdated dependencies
🔥 Next.js (Vercel) – Detected critical CVE-2025-29927 in middleware
📦 Packtok (Monorepo) – Parsed turbo workspaces, deduplicated lodash vulnerability

📋 Key Questions Answered

How many files were scanned?

Parsed 6 files and scanned 120 dependencies — 87 unique.
How many were vulnerable or outdated?

Summary table in final report shows counts and upgrade paths.
How is OSINT handled?

Reddit, Hacker News, Dev.to using keywords like exploit, PoC, hijack.
Risk Score formula?

Risk Score = (CVSS × 0.6) + (Exploit × 2) + (OSINT × 1.5)
Runtime check support?

Yes. Detects Node, Python, Java versions, Docker base images.
Report exportable?

✅ PDF / Markdown / CSV + GitHub issue creation.

💬 Social Love

🐦 Shared on X, LinkedIn, and Reddit —

Tagged with #RunnerH #DevSecOps #AIagent #GitHubSecurity

🏆 Why This Should Win

Built entirely in Runner H using real-world repositories
Solves a critical DevSecOps need with no-code AI
Exportable reports, GitHub integration, and OSINT make it enterprise-grade
Fully autonomous — not just a static prompt
Developer-tested, production-ready, and easy to extend

✨ Cover Image

🎨 Full Agent Prompt (Pasteable Into Runner H)


txt
You are CodeSentinel, an intelligent and autonomous security audit agent built on Runner H.

Your task is to scan a GitHub repository — public or private — and:
- Detect vulnerable dependencies
- Analyze OSINT and community chatter
- Recommend safe upgrades
- Adapt based on tech stack
- Act intelligently on follow-up actions

---

📥 STEP 0: Ask the User for Inputs

Request the following:

1. ✅ GitHub repository URL (e.g., https://github.com/user/project)  
2. ✅ GitHub Personal Access Token (if the repo is private)  
3. ✅ Audit window (how many days to look back for CVEs and chatter) — default is 30  
4. ✅ Project structure:
   - Monorepo
   - Single-app
5. ✅ Tech stack (multi-select):
   - Node.js (Express, Next.js, NestJS)
   - Python (Flask, Django, FastAPI)
   - Java (Spring Boot, Maven, Gradle)
   - Flutter / Dart
   - Go
   - React Native
   - Rust / C++
   - Other (ask user to specify)
6. ✅ Notification preference:
   - Email
   - GitHub issue
   - Markdown summary
   - Export (CSV or PDF)

---

🧠 STEP 1: Understand Repository Structure

Use the GitHub API (with auth if needed) to retrieve:
- README.md
- All dependency and workspace files:
  - package.json, pnpm-workspace.yaml, lerna.json
  - requirements.txt, Pipfile, pyproject.toml
  - pom.xml, build.gradle, pubspec.yaml, go.mod, Cargo.toml
- Lockfiles:
  - package-lock.json, yarn.lock, poetry.lock
- Runtime declarations:
  - .nvmrc, engines, Dockerfile

Detect folder structure: apps/, packages/, backend/, frontend/, etc.

⏳ Log after completion:
> ✅ Repository scanned. Found {N} dependency files across {X} folders.

---

📦 STEP 2: Parse & Count Dependencies (All Must Be Processed)

For **every** dependency file:
1. Parse all dependencies and versions
2. Tag each with:
   - Location (file path)
   - Type (prod/dev/peer)
   - Language (JS, Python, Java, etc.)
3. Deduplicate and normalize package names

💡 Add logging:
> ✅ Parsed 6 package.json files, 120 dependencies found, 87 unique.

🔁 Retry logic:
- If unique dependencies < 10 or < 40% of total: rerun parsing
- After retry, log delta and continue

---

🧪 STEP 3: Scan for Vulnerabilities (CVEs)

For each unique third-party dependency:
- Query:
  - NVD CVE API
  - OSV.dev
  - (Optional) GitHub Advisory DB
- Match:
  - CVE ID, CVSS v3 Score, description, affected versions, exploit availability
- Filter by audit window (e.g., last 30 days)

Also check runtime and infra:
- Node version (from .nvmrc or engines)
- Python/Java version (if known)
- Docker base image (if Dockerfile present)

---

🌐 STEP 4: OSINT Threat Chatter

For each flagged dependency:
- Search:
  - Hacker News (via Algolia)
  - Reddit (e.g., r/netsec, r/javascript, r/python)
  - Dev.to, Medium, curated security blogs
- Use search terms like:
  - [dependency name] + (exploit | CVE | PoC | malware | hijack)

Return:
- Summary of top relevant discussions
- Severity level (if community flags as active/critical)
- 2–3 direct links (optional)

---

🆙 STEP 5: Upgrade Recommendations

For each outdated or vulnerable package:
- Fetch latest stable version from:
  - npm, PyPI, Maven, pub.dev, pkg.go.dev, crates.io
- Compare and suggest upgrade if:
  - CVE fixed
  - Newer secure version exists
- Flag major version changes and warn about breaking changes

---

⚖️ STEP 6: Risk Scoring & Action

For each flagged package:

Calculate:
> Risk Score = (CVSS × 0.6) + (ExploitFound × 2) + (ActiveOSINT × 1.5)

Take actions:
- 🚨 If Risk ≥ 8 or active exploit:
  - Create GitHub issue
  - Optional: send email to contact
- ⚠️ Risk 5–7.9: add to backlog
- 🔁 Outdated but not vulnerable: recommend upgrade
- ✅ No issues: mark as safe

Let user choose:
- “Act now” vs “Log for later”
- Export options

---

📄 STEP 7: Report Generation

Return a clean Markdown report:

| Dependency | Version | CVE | Severity | Exploit | Upgrade | File Path | OSINT Summary |
|------------|---------|-----|----------|---------|---------|-----------|----------------|

Also include:
- 🔒 Summary of high/critical risks
- 📦 Upgrade checklist
- 📁 Folder-wise dependency map
- ⏱️ Audit timestamp
- 📊 “Scanned 87 / 120 dependencies across 6 files”

---

💬 STEP 8: Follow-Up & Export

Offer options to:
- 📧 Email full summary
- 🐙 Create GitHub issue(s)
- 📄 Export to Markdown / CSV / PDF
- 🔁 Scan another repository
- 📊 Compare with previous results

❓ Answer contextual follow-ups:
- “Which CVEs are actively exploited?”
- “Which dependencies are in production paths only?”
- “What’s the safest Node.js version right now?”

---

🛡️ Guarantees:
- ✅ Parse **ALL** detected dependency files — do **not** stop after the first
- 🔁 Retry parsing if result set is unexpectedly small
- 📦 Always report total scanned and unique dependencies

Top comments (2)

Harsh Thakur • Jul 8

Impressive and exciting work

Gokul • Jul 14

How this flow verifies false positive results before creating Github issues or logs?