DEV Community

# agents

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
OpenAI Operator scores 43% on hard web tasks. We scored 81%. Here are all 300 runs.

OpenAI Operator scores 43% on hard web tasks. We scored 81%. Here are all 300 runs.

1
Comments
4 min read
Browser-action receipts are not logs

Browser-action receipts are not logs

Comments
5 min read
A plugin for Observability + Budget Guardrails built with Hermes Agent

Hermes Agent Challenge Submission: Build With Hermes Agent

A plugin for Observability + Budget Guardrails built with Hermes Agent

2
Comments 1
4 min read
Nexus

Hermes Agent Challenge Submission: Build With Hermes Agent

Nexus

4
Comments
2 min read
From Blender Demos to Agent Toolchains: Why Terminal Skills Matter

From Blender Demos to Agent Toolchains: Why Terminal Skills Matter

Comments
6 min read
Project Mentor AI: Leveraging Hermes Agent for Autonomous Research, Planning, and Architecture Design

Hermes Agent Challenge Submission: Build With Hermes Agent

Project Mentor AI: Leveraging Hermes Agent for Autonomous Research, Planning, and Architecture Design

1
Comments
3 min read
AI and programming: A double-edged sword

AI and programming: A double-edged sword

Comments
2 min read
Context engineering is an architecture strategy, not a model swap

Context engineering is an architecture strategy, not a model swap

Comments
6 min read
How I Stopped My Support Agent From Having Amnesia

How I Stopped My Support Agent From Having Amnesia

Comments
3 min read
We benchmarked an 84% token reduction. Then we open sourced the protocol.

We benchmarked an 84% token reduction. Then we open sourced the protocol.

Comments
5 min read
Hermes Agent's Brain: How Its Skills & Memory System Actually Works

Hermes Agent Challenge Submission: Write About Hermes Agent

Hermes Agent's Brain: How Its Skills & Memory System Actually Works

3
Comments
6 min read
Why your AI agent keeps hallucinating (and how data testing fixes it)

Why your AI agent keeps hallucinating (and how data testing fixes it)

Comments
4 min read
HealthHermes: A Private AI Health Companion That Remembers Everything and Runs on Your Own Machine

Hermes Agent Challenge Submission: Build With Hermes Agent

HealthHermes: A Private AI Health Companion That Remembers Everything and Runs on Your Own Machine

3
Comments
6 min read
How the itrstats tax assistant works: one query, every layer

How the itrstats tax assistant works: one query, every layer

Comments
10 min read
One Open Source Project a Day (No. 69): Academic Research Skills - A Full-Pipeline AI Agent Suite for Academic Research

One Open Source Project a Day (No. 69): Academic Research Skills - A Full-Pipeline AI Agent Suite for Academic Research

Comments
10 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.