DEV Community: Abdelrahman Farag

We analysed 396 breaking dependency releases. Here's what they have in common.

Abdelrahman Farag — Fri, 22 May 2026 03:01:14 +0000

Renovate opened 23 upgrade PRs in your repo this week. One of them will break your CI. You just don't know which one yet.

That's the problem DepCast solves.

The idea

Most dependency tooling tells you what changed — semver, changelogs, release notes. None of it tells you how risky that change is before you merge.

DepCast computes a Compatibility Risk Score (CRS) for any npm release and rates it SAFE / WAIT / AVOID:

CRS = w₁·V(r) + w₂·E(r) + w₃·D(t) + w₄·H(m)

Signal	What it measures
V(r)	API surface volatility — fraction of exported symbols removed
E(r)	Downstream exposure — normalised weekly download count
D(t)	Observed failures — GitHub issues opened in first 24h post-release
H(m)	Maintainer history — SIR propagation R₀ from prior releases

Try it in 30 seconds

npx depcast-check --package chalk --version 5.0.0

Output:

DepCast CRS Check
-----------------------------------------------
Package:  chalk@5.0.0  (prior: 4.1.2)
-----------------------------------------------
V(r):  0.000  [....................]  API volatility       pattern C
E(r):  0.611  [############........]  Downstream exposure  (439M weekly downloads)
D(t):  0.000  [....................]  Observed failures    (0 issues/24h)
H(m):  0.030  [#...................]  Maintainer history   (R0=1.162)
-----------------------------------------------
CRS:   0.186   SAFE
-----------------------------------------------
Recommendation: Release looks safe. Proceed with publish.

With a GitHub token you get the live D(t) propagation signal:

npx depcast-check \
  --package moment \
  --version 2.0.0 \
  --github-token $GITHUB_TOKEN

Drop it into GitHub Actions

- name: DepCast compatibility risk check
  run: |
    npx depcast-check \
      --package ${{ env.PACKAGE_NAME }} \
      --version ${{ env.PACKAGE_VERSION }} \
      --threshold 0.60 \
      --fail-on avoid \
      --github-token ${{ secrets.GITHUB_TOKEN }}

Exit code 1 blocks the publish. Exit code 0 lets it through. No config files. No setup. One workflow step.

The research behind the score

The weights aren't guessed — they come from an empirical study of 396 confirmed breaking releases across three ecosystems, fitted via logistic regression.

Validation: AUC-ROC = 0.853 on a held-out set of 346 releases (306 breaking + 40 non-breaking controls).

Cross-ecosystem R₀ comparison

We fitted a SIR epidemiological propagation model to every release, measuring how fast a breaking change spreads through the dependency graph.

Ecosystem	n	R₀ median	Pattern C	First issue (median)
npm	306	1.44	62.4%	1.0h
PyPI	25	0.99	24.0%	1.1h
pub.dev	25	10.3	72.0%	1.1h

PyPI is sub-critical (R₀ < 1): breakage is self-limiting. The pip ecosystem's explicit pinning culture absorbs shocks.
npm is moderately super-critical: one breaking release propagates to ~1.4× as many downstream consumers each generation.
pub.dev is massively super-critical (R₀ ≈ 10): Flutter's null-safety migration forced a simultaneous ecosystem-wide update wave, creating a near-perfect propagation storm.

The Pattern C finding

Here's the surprising one: 62.4% of breaking npm releases have V(r) = 0.

That means the API surface looks identical between versions — no exported symbols removed, no type signature changes. Pure semver-based tooling scores these as safe. They are not.

These are Pattern C releases: breaking changes hidden in behaviour, not surface. The only signal that catches them is D(t) — the live issue rate after the release goes out. This is why the propagation signal matters, and why we built the aggregator.

The top AVOID-rated releases in the dataset

Package	Version	CRS	Why
glob	9.0.0	0.631	Complete API surface removed + high issue rate
semver	7.0.0	0.622	718M weekly downloads — highest E(r) in dataset
moment	2.0.0	0.580	D(t)=0.395 — 39.5% Dependabot rejection rate

The Renovate integration

If you use Renovate, you can run DepCast as a consumer-side gate — scoring every upgrade PR Renovate opens before you merge it.

A PR to add DepCast to the Renovate community tools list is currently open: renovatebot/renovate#43563.

The consumer gate composite action is in packages/depcast-consumer/ — it labels PRs with the CRS rating and blocks AVOID-rated upgrades before they reach your main branch.

What's next

depcast-check@1.0.0 is live on npm — install it now.
The full paper (7 pages, IEEE format) is on Zenodo: 10.5281/zenodo.20361569
Submitting to MSR 2027 (Mining Software Repositories).
Phase 5.6: reaching 500 opt-in repos to build a statistically significant live D(t) signal. If you add the consumer gate to your repo, you help make the model better for everyone.

GitHub: github.com/ahafarag/depcast
npm: npm install -g depcast-check
Preprint: doi.org/10.5281/zenodo.20361569

Antigravity 2.0 CLI Through the Eyes of a DevOps Engineer — What Google I/O 2026 Means for Infrastructure Automation

Abdelrahman Farag — Fri, 22 May 2026 02:29:16 +0000

This is a submission for the Google I/O Writing Challenge

Google I/O 2026 was, by any measure, an AI-agents showcase. Gemini 3.5 Flash, Gemini Omni, intelligent eyewear — the keynote was packed. But the announcement that made me sit up straight wasn't a model release or a consumer gadget. It was a developer tool: Antigravity 2.0, and specifically its new CLI.

I'm a Cloud and DevOps engineer. I spend my days inside AWS — writing Ansible playbooks, rotating IAM keys, remediating CVEs across fleets of EC2 instances, debugging Lambda permissions, and wiring up WAF logging compliance. My terminal is my office. So when Google announced a CLI that can orchestrate autonomous coding agents from the command line, integrate into CI/CD pipelines, and share the same agent harness as the full desktop IDE — it landed differently for me than it might for a frontend developer watching the keynote for Flutter updates.

This article is my honest first-look assessment: what Antigravity CLI actually offers, why it matters for infrastructure work, where the gaps are, and whether it changes anything for engineers already deep in the AWS ecosystem.

What Google Actually Shipped

Let me break down what Antigravity 2.0 is, because the branding is doing a lot of heavy lifting.

The original Antigravity launched in late 2025 as an AI-first IDE — essentially a VS Code fork with Gemini baked in. Version 2.0, announced at I/O on May 19, is no longer a single product. It's now a platform with five surfaces:

Antigravity 2.0 Desktop — a standalone app (not the old IDE) built around agent orchestration, with dynamic subagents, scheduled tasks, and voice commands
Antigravity CLI — a terminal-native tool rewritten in Go, designed for automation and CI pipeline integration
Antigravity SDK — programmatic access to the agent harness for custom agent behaviors on your own infrastructure
Managed Agents in the Gemini API — server-side agent execution in isolated Linux environments, billed per-run
Gemini Enterprise Agent Platform — the enterprise deployment path through Google Cloud

The CLI replaces the older Gemini CLI (Google is encouraging migration), and it preserves key capabilities: Agent Skills, Hooks, Subagents, and Extensions (now called plugins).

Underneath it all, the default model is Gemini 3.5 Flash — which scored 76.2% on Terminal-Bench 2.1, 1656 Elo on GDPval-AA (a real-world agentic benchmark), and 83.6% on MCP Atlas for tool-use reliability. Google claims it's 4x faster than comparable frontier models and often at less than half the cost ($1.50 input / $9.00 output per million tokens).

Why This Matters for DevOps — Not Just App Development

Most coverage of Antigravity 2.0 focuses on app development: scaffolding a React app, generating unit tests, vibe-coding a web UI. Fair enough — that's who Google was pitching to in the keynote. But the architectural decisions here have deeper implications for infrastructure engineers.

The CLI Is Built for Pipelines, Not Just Humans

The fact that the CLI shares the same agent harness as the desktop app is the key detail. This means agents running in your terminal have the same capabilities as agents in the GUI — subagent spawning, parallel execution, tool use — but wrapped in a surface designed for scripted automation.

For a DevOps engineer, this immediately suggests use cases:

CVE remediation workflows: Imagine an agent that reads a vulnerability scan report, identifies affected packages across your fleet, generates the patching commands per OS (AL2, AL2023, Ubuntu, RHEL all need different approaches), and stages the Ansible playbook. Today I do this manually, cross-referencing CVE databases, checking package managers, testing in staging. An agentic CLI that can spawn subagents per OS family and work in parallel would compress a multi-day task into hours.
IAM policy auditing: Feed the agent your CloudTrail logs and current IAM policies, ask it to identify over-permissioned roles and generate least-privilege replacements. The parallel subagent architecture means it could analyze multiple roles simultaneously.
Incident investigation: When a production batch job fails — say, a missing file that should have been generated by an upstream process — the agent could trace the execution path across CloudWatch logs, check cron configurations, and identify the parameter mismatch. This is the kind of multi-step, multi-source investigation that eats up entire afternoons.

Scheduled Tasks Turn Agents Into Persistent Automation

Antigravity 2.0 introduces scheduled tasks — agents that run on cron schedules or at fixed times, without manual prompting. This moves AI agents from "interactive assistant" to "background automation," which is exactly the abstraction DevOps engineers already think in.

Imagine scheduling an agent to run nightly that checks your ECR image scan results, compares against your vulnerability thresholds, and opens tickets (or even PRs) for anything that needs attention. That's not science fiction — that's a cron job with an LLM in the loop.

The Honest Assessment: Where I'm Skeptical

Here's where I stop being excited and start being a DevOps engineer.

Trust and Auditability

In my current work, every change to production infrastructure goes through a documented, auditable process. When I remediate a penetration test finding — say, removing a webshell from a Tomcat server — I document exactly what I did, why, and what the before/after state looks like. The output is a formal Word document in a specific security template.

Agentic AI that "independently navigates complex tasks" is exactly the kind of thing that makes security auditors nervous. The fundamental question isn't "can the agent do it?" but "can I prove what the agent did, and why, and that nothing else was touched?" Google mentions "hardened Git policies" and "credential masking," but the audit trail story for agent-executed infrastructure changes is still immature across the industry, not just for Google.

The AWS-Shaped Elephant in the Room

Antigravity is deeply integrated into Google's ecosystem: Google AI Studio, Firebase, Cloud Run, Android Studio. For someone already working in Google Cloud, the idea-to-production flow is cohesive.

But I work in AWS. My infrastructure is EC2, Lambda, S3, RDS, CloudWatch, WAF. My CI/CD runs through GitLab. My configuration management is Ansible via SSM.

The Antigravity CLI is model-agnostic to a degree — it supports Claude Sonnet 4.5 and GPT-OSS alongside Gemini — but the integrations are Google-first. There's no native connection to AWS CloudFormation, no SSM integration, no understanding of IAM policy syntax out of the box.

This isn't a dealbreaker. The SDK surface and plugin architecture could theoretically be extended to AWS tooling. But "could theoretically" and "works today in production" are separated by a lot of engineering effort. For now, the value proposition is strongest for teams already in or willing to move into Google's orbit.

Cost at Scale

The new AI Ultra plan is $100/month with 5x the Pro limits. Ultra Premium is $200/month with 20x limits. Managed Agents bill per run, not per token, and Google warns that long-running agents can get expensive.

For a solo DevOps engineer or small team, the math works if the agent saves even a few hours per week. But for an enterprise running dozens of agents across multiple pipelines? The cost model needs careful evaluation, especially since Managed Agents abstract away the token-level granularity that lets you optimize spend.

What I'd Actually Try First

If I were to integrate Antigravity CLI into my workflow today, I'd start narrow:

Documentation generation: I already spend significant time writing remediation documents — describing findings, listing steps taken, documenting before/after states. An agent that watches my terminal session, understands the context from the CVE or pentest finding, and drafts the compliance document would be immediately valuable and low-risk (I'd review everything before submission).
Policy analysis: Feed it an IAM policy JSON and ask it to identify violations of least-privilege principles, cross-referenced against actual CloudTrail usage. This is read-only analysis, so the blast radius of a mistake is zero.
Runbook generation: Convert my mental models for incident response into structured runbooks. The agent can ask clarifying questions, identify gaps, and produce something that a junior engineer could actually follow.

Notice what's missing: I wouldn't let it execute infrastructure changes unsupervised. Not yet. Maybe not for a long time. The agent is most valuable to me as an accelerator for the cognitive work around infrastructure — the analysis, the documentation, the planning — rather than as an autonomous executor.

The Bigger Picture: Agents Are Coming to Infrastructure Whether We're Ready or Not

Google isn't the only one here. Anthropic has Claude Code. Amazon has their own agentic coding tools. The entire industry is converging on the idea that AI agents should be able to plan, code, test, and deploy software with minimal human intervention.

For DevOps engineers, this creates an interesting tension. Our entire discipline exists because "move fast and break things" doesn't work when the things you break are production databases and customer-facing services. We introduced CI/CD, infrastructure as code, and policy-as-code precisely to add guardrails to the deployment process.

Now the industry wants to put AI agents inside those guardrails. The question isn't whether this will happen — it will. The question is whether the tooling will mature fast enough to earn the trust of the people responsible for keeping systems running.

Antigravity 2.0 is a serious step. The CLI-first surface, the parallel agent architecture, the scheduled task capability, and the enterprise deployment path show that Google is thinking about production workflows, not just demos. But the integration depth outside Google's ecosystem, the audit story, and the cost model all need more time.

My Verdict

Antigravity 2.0 CLI is the most DevOps-relevant announcement from Google I/O 2026. Not because it solves my problems today, but because it's the first time a major platform has shipped an agent orchestration tool that speaks the language of infrastructure automation: terminals, pipelines, scheduled execution, and programmatic SDKs.

If you're already in Google Cloud, this is worth trying immediately. If you're in AWS like me, keep a close eye on the SDK and plugin ecosystem — the architecture is right, even if the integrations aren't there yet.

The era of agentic DevOps is arriving. The engineers who learn to supervise AI agents effectively — knowing when to delegate and when to intervene — will have a significant advantage. Antigravity 2.0 is one of the first tools purpose-built for that transition.

Abdelrahman Farag is an AWS Cloud & DevOps Engineer at Sopra Steria and an MSc candidate in AI Research at UIMP. He works across cloud infrastructure, security remediation, and applied AI.

How I set up a production AWS server in 5 minutes with one bash script

Abdelrahman Farag — Fri, 03 Apr 2026 04:58:18 +0000

Every time I spin up a new EC2 instance I used to spend hours configuring the same things: Docker, Nginx, SSL, firewall. After 18 years as a DevOps engineer I finally automated all of it.
One command:
sudo bash setup.sh yourdomain.com you@email.com

It installs Docker, configures Nginx as reverse proxy, gets a Let's Encrypt SSL certificate with auto-renewal, and locks down the firewall. Works on Ubuntu 20.04 and 22.04.
I packaged it here if you want it: https://faragman31.gumroad.com/l/sgckzk
Happy to answer any questions in the comments.