DEV Community: The_resa

Building an AI Agent Skill for Multi-Database Literature Collection

The_resa — Thu, 07 May 2026 01:50:13 +0000

Literature search is one of the most foundational steps in medical research. Before protocol design, evidence synthesis, peer review, or manuscript writing, researchers must first identify the right body of evidence.

A weak literature search process creates downstream problems: missing key studies, incomplete evidence mapping, duplicated screening effort, and poor reproducibility in review workflows. This becomes even more critical in biomedical research, where studies are distributed across multiple databases such as PubMed, CrossRef, OpenAlex, Semantic Scholar, and specialized repositories.

The Multi-Database Literature Collector's core task is simple but essential:

Build a cross-database candidate literature pool for a biomedical topic, clinical question, translational problem, method query, or research-planning need.

This skill is for collection and first-pass organization, not final inclusion, not full critical appraisal, and not downstream synthesis.

Typical use cases

cross-database literature collection
search strategy construction
candidate paper aggregation
first-pass evidence organization before deduplication, screening, layered reading, or review planning.

Watch the Skill Demo

To better understand how this Literature Search Skill works in real workflows, you can watch the demonstration video below.

All content is provided for research purposes only and is not intended for clinical use or medical advice. Any medical text or data shown in this video is for demonstration purposes only.

You can also explore the full skill documentation here:

Explore More AIPOCH Medical Research Skills

AIPOCH provides a curated collection of Medical Research Agent Skills designed for medical research workflows across:

Evidence Insights
Protocol Design
Data Analysis
Academic Writing

Rather than isolated prompts, these skills are built for structured execution and reproducibility.

You can explore the full repository here:

GitHub Repository:
AIPOCH medical-research-skills

Skill Collection:
awesome-medical-research-skills

If you find this repository useful, consider giving it a star! ⭐ It helps more researchers discover Medical Research Agent Skills and supports the continued development of this library.

A Structured Comparison of 2 Disease Mechanism Agent Skills

The_resa — Wed, 29 Apr 2026 02:11:31 +0000

Originally published on AIPOCH

As AI-powered medical research workflows become more common, agent skills are increasingly used to support disease mechanism analysis.

However, even when two skills are designed for the same task, their quality can differ significantly depending on workflow structure, evidence handling, and execution design.

To better understand these differences,
I compared two disease mechanism agent skills using MedSkillAudit, a standardized framework for evaluating medical research agent skills.

AIPOCH disease-mechanism-evidence-map

The Agent Skill Description: Systematically maps mechanism evidence for a disease from molecules to pathways, cell types, tissues, biological consequences, and clinical phenotypes. Always use this skill when a user needs a layered mechanism evidence chain rather than a flat summary or immediate gap analysis. Formal literature citations must be real and verifiable.

FreedomAI tooluniverse-multiomic-disease-characterization

The Agent Skill Description: Comprehensive multi-omics disease characterization integrating genomics, transcriptomics, proteomics, pathway, and therapeutic layers for systems-level understanding. Produces a detailed multi-omics report with quantitative confidence scoring, cross-layer gene concordance analysis, biomarker candidates, therapeutic opportunities, and mechanistic hypotheses.

How We Evaluated Agent Skill?

This comparison is not based on opinions or isolated examples.

We used AIPOCH MedSkillAudit, an evaluation framework designed to assess agent skills.

Both agent skills were tested under identical conditions: evaluated according to the standardized settings defined by MedSkillAudit.

Core Capability Section Results Analysis

The Core Capability section of MedSkillAudit is a static quality evaluation of the agent skill itself. At this stage, MedSkillAudit evaluates how well the agent skill is designed.

It covers eight key dimensions:

Functional Suitability
Reliability
Performance & Context
Agent Usability
Human Usability
Security
Maintainability
Agent-Specific Capability

In this comparison, AIPOCH disease-mechanism-evidence-map achieved a Core Capability score of 87/100, while FreedomAI tooluniverse-multiomic-disease-characterization scored 80/100.

Although both skills show strong design quality, the scoring reveals clear differences in how each skill is structured.

AIPOCH performs particularly well in:

Functional Suitability (92% vs 83%)
Reliability (75% vs 67%)
Performance & Context (88% vs 63%)
Agent Usability (88% vs 81%)
Human Usability (88% vs 75%)
Maintainability (92% vs 83%)

Both skills perform equally well in:

Security (100% vs 100%)
Agent-Specific Capability (80% vs 80%)

One of the most noticeable differences in the Core Capability evaluation appears in the Reliability dimension.
For AIPOCH disease-mechanism-evidence-map, Literature verification requirement is strong; gap: no defined behavior when no verifiable citations are available at all.
For FreedomAI tooluniverse-multiomic-disease-characterization, Rich fallback sections exist, but deterministic handling of
ambiguity and section completeness is uneven.

Medical Task Section Results Analysis

The Dynamic Evaluation section of MedSkillAudit measures how an agent skill performs during actual task execution.

In this benchmark:

AIPOCH disease-mechanism-evidence-map passed 24/25
FreedomAI tooluniverse-multiomic-disease-characterization passed 20/25

Final Score Comparison

AIPOCH disease-mechanism-evidence-map scored 86/100
FreedomAI tooluniverse-multiomic-disease-characterization scored 77/100

This 9-point difference suggests that AIPOCH demonstrates stronger overall agent skill quality across both design architecture and execution performance.

In other words, AIPOCH is not only better structured at the skill level, but also performs more consistently when applied to real disease mechanism research tasks.

It is important to note that a higher score does not mean one skill should replace the other. The most suitable choice depends on specific research goals, task requirements, workflow preferences, and other practical considerations.

Use Case: Disease Mechanism Evidence Map

The AIPOCH disease-mechanism-evidence-map skill is designed to systematically maps mechanism evidence for a disease from molecules to pathways, cell types, tissues, biological consequences, and clinical phenotypes.

Primary Use Cases:

Rapid understanding of disease mechanism architecture.
Mechanism hypothesis building before study design.
Disease introduction / discussion framework construction.
Mechanism-oriented evidence synthesis before gap analysis.
Mechanism-chain inspection for translational thinking.

If you would like to explore more details of this skill, you can visit the AIPOCH Disease Mechanism Evidence Map Skill Page.

Explore More AIPOCH Medical Research Agent Skills

You can explore more medical research skills in the AIPOCH Agent Skills Collection or access implementation details through the AIPOCH GitHub Repository.

Disclaimer

This AI-assisted article is provided for informational and research purposes only and does not constitute medical advice, clinical guidance, diagnostic recommendations, treatment decisions, publication acceptance recommendations, or formal scientific peer review decisions.

The comparisons and analysis presented in this article are based on standardized evaluation results from MedSkillAudit and are intended as structured references for understanding agent skill quality. They should not replace independent judgment from qualified researchers, reviewers, editors, clinicians, or healthcare professionals.

I Tested 2 Peer Review Agent Skills for Medical Research — Here’s What Happened

The_resa — Tue, 28 Apr 2026 09:49:21 +0000

Originally published on AIPOCH

Peer review is one of the slowest and least standardized parts of academic research. I wanted to test whether open-source AI agent skills could actually help.

In this article, I compare two peer review agent skills:

AIPOCH Peer Review Skill The Agent Skill Description: Conduct professional peer reviews for papers or theses, providing structured evaluations and improvement suggestions; use when you need a pre-submission assessment, an internal review, or academic quality control.
K-Dense Peer Review Skill The Agent Skill Description: Structured manuscript/grant review with checklist-based evaluation. Use when writing formal peer reviews with specific criteria methodology assessment, statistical validity, reporting standards compliance (CONSORT/STROBE), and constructive feedback. Best for actual review writing, manuscript revision. For evaluating claims/evidence quality use scientific-critical-thinking; for quantitative scoring frameworks use scholar-evaluation.

How We Evaluated Agent Skill?

This comparison is not based on opinions or isolated examples.

We used AIPOCH MedSkillAudit, an evaluation framework designed to assess agent skills.

Both agent skills were tested under identical conditions: evaluated according to the standardized settings defined by MedSkillAudit.

Core Capability Section Results Analysis

The Core Capability section of MedSkillAudit is a static quality evaluation of the agent skill itself. At this stage, MedSkillAudit evaluates how well the agent skill is designed.

It covers eight key dimensions:

Functional Suitability
Reliability
Performance & Context
Agent Usability
Human Usability
Security
Maintainability
Agent-Specific Capability

In this comparison, AIPOCH Peer Review Skill achieved a Core Capability score of 84/100, while K-Dense Peer Review Skill scored 80/100.

Although both peer review skills show strong design quality, the scoring reveals meaningful differences in how each skill is architected.

AIPOCH performs particularly well in:

Functional Suitability (92% vs 83%)
Reliability (75% vs 67%)
Performance & Context (88% vs 75%)
Agent Usability (88% vs 81%)
Human Usability (100% vs 88%)

K-Dense performs strongly in:

Maintainability (83% vs 75%)

Medical Task Section Results Analysis

The Dynamic Evaluation section of MedSkillAudit measures how an agent skill performs during actual task execution.

In this benchmark:

AIPOCH Peer Review Skill passed 20/20
K-Dense Peer Review Skill passed 17/20

This result shows that AIPOCH demonstrates stronger runtime consistency and broader scenario coverage across real peer review workflows.

Final Score Comparison

AIPOCH Peer Review Skill scored 86/100
K-Dense Peer Review Skill scored 75/100

This 11-point difference suggests that AIPOCH demonstrates stronger overall agent skill quality across both design architecture and execution performance.In other words, AIPOCH is not only better structured at the skill level, but also performs more effectively when applied to real peer review tasks.

It is important to note that a higher score does not mean one skill should replace the other, and the best choice depends on the reviewer’s goals, manuscript type, and review stage.

Use Case: AIPOCH Peer Review Skill Overview

To help researchers better understand how the AIPOCH Peer Review Skill works, we created a brief demonstration video that showcases its overall workflow and practical academic use cases.

Watch here：

All content is provided for research purposes only and is not intended for clinical use or medical advice. Any medical text or data shown in this video is for demonstration purposes only.

The AIPOCH Peer Review Skill is designed to conduct professional peer reviews for papers or theses, providing structured evaluations and improvement suggestions. It is especially useful when researchers need a pre-submission assessment, an internal review, or academic quality control.

Peer Review Agent Skill Key Features

Its key features include:

Structured end-to-end review workflow: Overall evaluation → methods/results check → issue organization → recommendation.
Major vs. minor issue triage: Separates publication-blocking problems from polish-level improvements.
Actionable revision suggestions: Each issue is paired with concrete steps to fix or strengthen the work.
Recommendation with rationale: Clear accept/revise/reject guidance with reasons and improvement path.
Reusable templates and checklists: Supports consistent formatting and comprehensive coverage (see referenced files).

This makes it particularly valuable for:

Pre-submission manuscript check: Before submitting to a journal/conference to identify major risks and revision priorities.
Internal lab/group review: For advisor or team quality control prior to external dissemination.
Thesis/dissertation evaluation: To assess academic rigor, structure, and defensibility before committee review.
Revision planning after feedback: To translate reviewer/editor comments into an actionable improvement roadmap.
Quality assurance for research outputs: To ensure methods, reporting, and conclusions meet disciplinary standards.

This practical workflow is also one of the key reasons why AIPOCH performs strongly in MedSkillAudit’s Dynamic Evaluation section.

If you would like to explore the detailed MedSkillAudit evaluation of this skill—including Core Capability, Dynamic Evaluation, and the Final Overall Score—you can view the full assessment on the AIPOCH Peer Review Skill Evaluation Report.

Explore more AIPOCH Agent Skills

You can explore more workflow-focused research skills in the AIPOCH Agent Skills Collection or access implementation details through the AIPOCH GitHub Repository.

Disclaimer

This content is provided for informational purposes only and does not constitute medical advice, clinical guidance, publication acceptance recommendations, or formal peer review decisions. MedSkillAudit is designed to evaluate the quality of agent skills, not to replace expert judgment from qualified researchers, reviewers, editors, or healthcare professionals. Users should independently verify all academic, methodological, and clinical conclusions before making research, publication, or medical decisions. Any reliance on this content is at the user’s own discretion and risk.

Stop Using Generic Prompts: 102 Medical Research Agent Skills for Smarter Research Workflows

The_resa — Mon, 27 Apr 2026 03:26:29 +0000

Originally published on AIPOCH

Discover AIPOCH Awesome Med Research Skills — a curated collection of 102 medical research agent skills built for literature review, study design, data analysis, academic writing, and more. Designed for researchers who want structured AI workflows instead of generic prompts.

Today, we’re launching AIPOCH Awesome Med Research Skills — a curated collection of 102 high-quality medical research Agent Skills, each designed with embedded professional research logic.

You can explore the Awesome Med Research Skills here: AIPOCH Open-Source Repository on GitHub

If you find it useful, consider giving it a ⭐ to support the project!

What is Awesome Med Research Skills?

Awesome Med Research Skills is a growing collection of specialized agent skills built specifically for medical research scenarios. It currently includes 102 high-quality skills.

Awesome Med Research Skills aims to help researchers more effectively organize questions, connect evidence, and advance research.

How These Skills Work?

Instead of relying on generic prompts, we encode professional medical research logic into these agent skills:

Literature authenticity constraints: Implementing hard rules
Research type identification: We first determine the study type, then execute different logical pathways
Medical-specific prompt logic

We encode professional medical research logic into these agent skills.

Key Features of Awesome Med Research Skills

Modular Skill Architecture for Team Scaling

Skills are composable, replaceable, and extensible, suitable for both individual use and team collaboration
Can be assembled from single-task execution to multi-step workflow pipelines

Built for Real Medical Research Scenarios

Covers real workflows: topic selection, literature search, study design, writing, graphical abstracts, and more.
Not adapted from generic content templates — designed specifically for medical research contexts.

What’s Next

This release includes 102 curated skills, but this is just the beginning. We’re continuing to expand the collection.

Try It

If you’re working in medical research and want a more structured way to use AI agent, Try Awesome Med Research Skills here.

Explore More AI Agent Skills

Researchers and AI agents can explore the growing library of medical research agent skills through multiple resources:

Open-Source Repository on GitHub
AIPOCH Medical Research Agent Skills List – Browse all skills organized by category, from Evidence Insights to Academic Writing.
Full Agent Skills Overview – Learn about the purpose, workflow integration, and capabilities of each skill in detail.

Agent Skills Explained: From Prompts to Structured AI Workflows

The_resa — Mon, 20 Apr 2026 02:38:50 +0000

Agent Skills are modular, reusable units of procedural knowledge that allow AI agents to perform specific tasks.

People hear “AI agent” and assume it’s already a complete system. Then they hear “agent skills” and things get fuzzy—are these tools? prompts? plugins?
The short answer: agent skills are structured capabilities that help AI agents do specific tasks.

According to a DataCamp article, agent skills are “portable, self-contained units of domain knowledge and procedural logic” that define how to perform workflows, not just what to know.
Similarly, Spring AI describes them as “modular folders of instructions, scripts, and resources that AI agents can discover and load on demand.”

Instead of giving an AI a long messy instruction every time, you package the instructions into a reusable “mini-workflow” it can run anytime.

So if we strip away the buzzwords, a consistent pattern appears:

👉 Agent Skills = structured workflows + reusable logic + optional code/tools

Why Agent Skills Exist?

Prompts are great… until they aren’t.
If you’ve ever written a long prompt and noticed the AI ignored half of it, you’ve already seen the limitation.

As prompts grow larger and more complex, important instructions can get diluted inside the model’s context window, leading to inconsistent behavior.

Agent skills address this by:

Keeping logic modular
Loading only when needed
Separating concerns (instead of one giant prompt blob)

According to Microsoft documentation, skills are:

Advertised to the agent
Loaded only when relevant
Expanded with additional resources as needed

This architectural idea is often called “progressive disclosure.”

How Agent Skills Actually Work？

Let’s walk through a simplified version of how they function:

The agent receives a task
It evaluates what kind of work is needed
It selects a matching skill
It loads that skill
It executes the workflow
It returns a structured result

The important idea here is selective loading. The AI is not guessing everything from scratch each time—it’s using prebuilt procedures. That’s why agent systems feel more stable than raw prompting.

Where to Find Agent Skills？

If you look around, you’ll notice there isn’t just one place. The ecosystem is still forming, and it’s a bit fragmented—somewhat like the early days of app stores. A lot of skills live on platforms like GitHub, where developers publish reusable workflows for others to try. However, many of agent skills collections are broad, not specialized. They cover many use cases—but not always with deep domain precision. So What if You Need Domain-Specific Agent Skills?

AIPOCH for Medical Research Agent Skills

If you’re specifically looking for medical research agent skills, that’s exactly where AIPOCH comes in.
As explained in this introduction to AIPOCH, AIPOCH offers a curated library of Medical Research Agent Skills built around medical research workflows—things like:

evidence Insights
protocol design
data analysis
academic writing

You can explore the complete agent skills library here:
👉 Medical Research Skills Github Repo

⭐ If you find this repository useful, consider giving it a star! It helps more researchers discover Medical Research Agent Skills and supports the continued development of this library.

Instead of asking an AI to figure everything out from scratch, you’re using predefined research processes that are already structured and consistent. That makes a difference.

FAQs

Are there agent skills for medical research?

Yes, there are specialized agent skills designed for medical research workflows. AIPOCH provides curated medical research agent skills for tasks such as evidence extraction, protocol design, data analysis, and academic writing. These skills are designed to support more structured and consistent research workflows, which can be helpful when working with complex scientific tasks.

What are ai agent skills?

Agent skills are modular, reusable workflows that allow AI agents to perform specific tasks in a structured and consistent way.

How do agent skills work in practice?

Agent skills work by being selected and loaded by an AI agent when needed. The agent evaluates a task, chooses the relevant skill, executes its predefined workflow, and returns structured results. This makes outputs more reliable compared to raw prompting.

What are some real-world agent skills examples?

Common agent skills examples include data analysis workflows, literature review summarization, content generation, and academic writing support.

Where can I find agent skills or an agent skills hub?

You can find agent skills in open-source repositories (such as GitHub), developer platforms, and curated libraries often referred to as agent skills hubs. These hubs provide reusable workflows for different domains, from general automation to specialized fields like medical research.

Disclaimer
This content is for informational purposes only. It does not constitute medical, clinical, or professional advice. AIPOCH Medical Research Agent Skills are designed to support research workflows. They are not intended to replace professional judgment, clinical decision-making, or peer-reviewed validation. AI-generated outputs may be incomplete or inaccurate and should be independently verified before use. References to third-party tools, platforms, or publications are provided for context only and do not imply endorsement or affiliation.

Hermes Agent: Why does it feels different from other agents

The_resa — Thu, 16 Apr 2026 08:10:07 +0000

Originally published on AIPOCH

If you’ve been browsing GitHub, AI forums, or even a few niche newsletters lately, you’ve probably seen the name Hermes Agent pop up more than once.

What Is Hermes Agent?

According to the official documentation, Hermes Agent is:

“The self-improving AI agent built by Nous Research. The agent with a built-in learning loop — it creates skills from experience, improves them during use, nudges itself to persist knowledge, and builds a deepening model of who you are across sessions.”
(Source: Nous Research Hermes Agent docs)

Let 's translate that into something more grounded:

Unlike traditional AI tools, it includes a built-in learning loop that allows it to turn past tasks into reusable skills, refine those skills over time, and retain knowledge across sessions. As it continues to operate, Hermes gradually builds a more personalized understanding of the user, making its behavior more context-aware and efficient.

It keeps memory
It learns by doing
It accumulates context and experience

What's The Core Idea Of Hermes Agent?

The Core Idea of Hermes Agent is "A Self-Improving Loop".

This is where Hermes starts to separate itself.

The official concept is simple but powerful:

When Hermes completes a task, it can convert that solution into a reusable “skill.”
(Source: Hermes documentation + feature overview)

So instead of repeating work, it builds a library of experience.

Think of it like this:

First time → slow, exploratory
Second time → faster
Third time → almost automatic

That loop looks like:

Receive a task
Execute using tools
Save solution as a skill
Reuse skill later

This “skill system” is explicitly documented as a core mechanism of Hermes Agent.

Why It Feels Different From Other AI Agents

The memory system of Hermes Agent is one of its standout features.
Hermes doesn’t just “remember conversations”—it builds a long-term understanding of your work, preferences, and past tasks.

And yes, that changes everything.

According to the Hermes Agent documentation, Hermes uses structured files like:

MEMORY.md → agent's personal notes
USER.md → user-specific information

These files act as the foundation of long-term knowledge. Agent's memory files have Char Limit

Why Hermes Agent Memory Matters

Hermes remembers:

Preferences
Workflows
Past decisions

So you don’t have to explain things again.

It gets better with use

The more you use Hermes:

The more it learns
The more it adapts
The more efficient it becomes

It enables real automation

Without memory, automation breaks easily.

With memory, Hermes can:

Maintain consistency
Reuse past solutions
Improve task execution

It supports personalization at scale

Hermes Agent + AIPOCH Medical Agent Skills

AIPOCH Medical Agent Skills can work with Hermes Agent.

Hermes Agent brings persistence, memory, and a built-in learning loop—it can execute tasks, refine its approach over time, and retain what it learns across sessions. On the other side, AIPOCH provides Medical Research Agent Skills designed for medical research workflows, like Evidence Insights, Protocol Design, Data Analysis, and Academic Writing.

Hermes gives you a learning agent.
AIPOCH gives it medical research agent skills.

These capabilities are designed to support research workflows, and outputs should always be reviewed before real-world use.

Explore AIPOCH Medical Research Agent Skills

Researchers and AI agents can explore the growing library of medical research agent skills through multiple resources:

Open-Source Repository on GitHub
AIPOCH Medical Research Agent Skills List – Browse all skills organized by category, from Evidence Insights to Academic Writing.
Full Agent Skills Overview – Learn about the purpose, workflow integration, and capabilities of each skill in detail.

⭐ If you find this repository useful, consider giving it a star! It helps more researchers discover Medical Research Agent Skills and supports the continued development of this library. Open-Source Repository on GitHub.

FAQs About Hermes Agent

What is Hermes Agent in simple terms?

In simple terms, Hermes Agent is an AI agent that learns from experience, improves itself over time, and gets better at understanding you.

Is Hermes Agent open source?

Yes, Hermes Agent is an open-source project developed by Nous Research.

Why Hermes Agent Feels Different From Other AI Agents?

Hermes focuses more on persistent memory and reusable skills.

Does Hermes Agent use its own AI model?

Hermes is an agent framework and can connect to different LLMs.

What are Hermes Agent skills?

Skills are reusable task workflows generated by the agent after completing tasks, allowing future automation. (Source: Hermes documentation)

Disclaimer: This AI-assisted content is for informational purposes only. The content may be incomplete or inaccurate, and should be independently verified. The AI systems and agent workflows discussed (including Hermes Agent and AIPOCH Medical Research Agent Skills) are intended to support research processes, not to replace professional judgment. AI-generated outputs may be incomplete or inaccurate and should be independently reviewed. References to third-party projects do not imply official endorsement or affiliation.

Medical Research Agent Skills: Blind Review Sanitizer

The_resa — Tue, 14 Apr 2026 01:54:30 +0000

You can explore a growing collection of Medical Research Agent Skills on AIPOCH Github.

If you find it useful, consider giving it a ⭐ to support the project!

If you want to explore more about this skill—including Complete Workflow Example, Common Patterns, Quality Checklist, Common Pitfalls, Troubleshooting, References, and more—please visit this page: Blind Review Sanitizer.

What is Blind Review Sanitizer?

Automatically anonymize academic manuscripts for double-blind peer review by removing author identifiers, institutional affiliations, acknowledgments, and excessive self-citations while preserving document formatting and scholarly content integrity.

What Are The Key Capabilities of This Agent Skill?

Author Identity Removal: Automatically detect and redact author names, institutional affiliations, and contact information using pattern matching and customizable rules
Acknowledgment Section Sanitization: Identify and remove or flag acknowledgment sections that may reveal author identity through funding sources or personal thanks
Self-Citation Detection and Neutralization: Identify first-person citations and excessive self-references that could deanonymize the submission
Multi-Format Document Support: Process DOCX, Markdown, and plain text files with format-aware sanitization strategies
Audit Trail Generation: Create detailed logs of all redactions made for verification and transparency

Limitations and Considerations

Important Limitations:

Not Foolproof: Automated sanitization cannot guarantee complete anonymity. Always perform manual verification.
Context Blindness: Pattern matching may miss context-dependent identifiers or incorrectly flag legitimate content.
Image Processing: This tool processes text only. Images, figures, and embedded objects may contain identifying information not detected.
LaTeX Support: Limited support for LaTeX source files. Consider using LaTeX-specific tools for LaTeX manuscripts.
Language Support: Optimized for English and Chinese. Other languages may have reduced accuracy.

Ethical and Legal Considerations:

Author Consent: Ensure all authors consent to anonymization before submission
Copyright: Anonymization does not change copyright ownership
Data Availability: Some journals require non-anonymized versions for data/code availability statements
Post-Acceptance: Plan for deanonymization process after paper acceptance

Explore AIPOCH Agent Skills

Researchers and AI agents can explore the growing library of medical research agent skills through multiple resources:

Open-Source Repository on GitHub
AIPOCH Medical Research Agent Skills List – Browse all skills organized by category.
Full Agent Skills Overview – Learn about the purpose and capabilities of each skill in detail.

These resources make it easy to explore, validate, and experiment with AIPOCH’s growing library.

AIPOCH Medical Skill Auditor: How We Evaluates Agent Skills?

The_resa — Thu, 09 Apr 2026 06:48:57 +0000

You can explore a growing collection of Medical Research Agent Skills on the AIPOCH Github.

If you find it useful, consider giving it a ⭐ to support the project!

What is Medical Skill Auditor?

AIPOCH Medical Skill Evaluator is a framework for assessing the quality of AIPOCH's Agent Skills. Its core function is to perform a comprehensive quality check on a Skill before it is released to users.

How does Medical Skill Auditor Work?

Veto Gates

To enforce strict quality control, Skill Auditor is designed with two layers of veto mechanisms. Any failure in these checks may lead to immediate rejection of a skill.

Skill Veto

Take the agent skill “medical-research-literature-reader-pro” as an example：

Operational Stability
Structural Consistency
Result Determinism
System Security

Research Veto

Take the agent skill “medical-research-literature-reader-pro” as an example：

Scientific Integrity
Practice Boundaries
Methodological Ground
Code Usability

Core Capability

Take the agent skill “medical-research-literature-reader-pro” as an example：

Evaluates a skill’s design and contract against key dimensions such as Functional Suitability, Reliability, Performance & Context, Agent Usability, Human Usability, Security, Agent-Specific and Maintainability.

Medical Task
Take the agent skill “medical-research-literature-reader-pro” as an example：

Assesses actual outputs of a skill with layered criteria.

For skill testing, the AI automatically generates inputs. The number of inputs in specific categories will increase or decrease depending on the complexity of the skill. The following 7 inputs represent the most comprehensive version.

Canonical
Variant A
Edge
Variant B
Stress
Scope Boundary
Adversarial

Skill Complexity Classification

Label	Code/Rank	Definition
Simple	S	Narrow task scope
Moderate	M	Moderate branching or multiple task types
Complex	C	Broad or multi-step specialized skill

Simple (S): 3 inputs

Moderate (M): 5 inputs

Complex (C): 7 inputs

Final Score

Take the agent skill “medical-research-literature-reader-pro” as an example：

The Skill Evaluator uses a two-stage scoring system: static evaluation (design quality, accounting for 40%) and dynamic evaluation (runtime performance, accounting for 60%). The final overall score is derived by combining both.

Static (40%)
Dynamic (60%)

Final Score = Static Score × 40% + Dynamic Score × 60%

You can view evaluation results for selected AIPOCH skills here.

Feedback and possible future directions

This framework is still under active development.Right now it is only applied to a subset of AIPOCH’s skills, but we’re considering expanding it more broadly.

😎AIPOCH – 450+ Modular Agent Skills for Medical Research

The_resa — Wed, 25 Mar 2026 09:10:00 +0000

Hi! I’m part of the team building AIPOCH, an open-source library of 450+ executable Agent Skills designed specifically for medical research workflows.

AIPOCH GitHub Repository
see our Website here

Why we built AIPOCH?

Most medical research AI tools today are essentially a bundle of prompt engineering + fixed toolchains + a UI. They handle "published knowledge" well (like summarizing a paper), but they fall apart the moment you say: "Now validate this hypothesis using my own cohort data." Existing tools often lack a persistent research context. There is no version-controlled hypothesis tracking, no seamless link between literature evidence and actual data execution. We wanted to move beyond point-solutions to a modular, extensible protocol.

What is AIPOCH？

AIPOCH is a curated library of 450+ Medical Research Agent Skills, built to work with OpenClaw and other AI agent platforms, including OpenCode and Claude. To achieve this, we have encoded specialized medical research logic directly into our Skills.

Scientific Integrity Constraints
Study type identification
Medically Specialized Prompt Logic

A Skill is a structured capability package consisting of:

skill.md: A "contract" containing YAML metadata (trigger logic) and specific operational steps.
Python Scripts: Executable engines called directly via bash under the guidance of the skill.md.

In the context of AIPOCH, we define our developed skills as structured capability packages designed for professional medical research tasks, utilizing skill.md as the trigger contract and Python scripts as the execution engine. We have embedded medical research constraints directly into our skill.md, references, and Python scripts.

AIPOCH Medical Skill Auditor (in development)

What is Medical Skill Auditor?

Skill Auditor is AIPOCH’s evaluation framework under active development for scoring Medical Research Agent Skills with rigorous, multi‑dimensional quality metrics. It’s intended to go beyond static descriptions by measuring both core capability and real execution performance—giving users and developers a clearer, data‑driven understanding of skill quality.

How does it work?

🧰 Core Capability
Evaluates a skill’s design and contract against key dimensions such as Functional Suitability, reliability, performance & context, Agent Usability, human usability, Security, Agent-Specific and maintainability.

📊 Medical Task
Assesses actual outputs of a skill with layered criteria, weighting general competence and category‑specific behaviors to reflect real‑world execution quality.

🚫Veto Gates
To enforce strict quality control, Skill Auditor is designed with two layers of veto mechanisms. Any failure in these checks may lead to immediate rejection of a skill.

Skill Veto
Operational Stability
Structural Consistency
Result Determinism
System Security
Research Veto
Scientific Integrity
Practice Boundaries
Methodological Ground
Code Usability

The Most Frustrating Moment

One of our biggest early mistakes was using a cheaper LLM to "vibe coding" the initial batch of scripts.
On the surface, it worked. The scripts ran, and the logic seemed okay. The nightmare only surfaced during our audit: we realized the executing agent was silently correcting the script's logic on the fly. Because the agent read the intent in skill.md, it would "patch" the sloppy edge cases and vague error branches in the Python code during execution.
The result? We were burning massive amounts of extra tokens just to fix errors that shouldn't have existed. It didn't throw an error; it just showed up on the API bill.
We eventually scrapped the lot. We learned the hard way: Quantity isn't a moat; high-quality scripts are.

All questions/feedback welcome！😎😎😎

DEV Community: The_resa

Building an AI Agent Skill for Multi-Database Literature Collection

Typical use cases

Watch the Skill Demo

Explore More AIPOCH Medical Research Skills

Recommended Reading

A Structured Comparison of 2 Disease Mechanism Agent Skills

How We Evaluated Agent Skill?

Core Capability Section Results Analysis

Medical Task Section Results Analysis

Final Score Comparison

Use Case: Disease Mechanism Evidence Map

Explore More AIPOCH Medical Research Agent Skills

I Tested 2 Peer Review Agent Skills for Medical Research — Here’s What Happened

How We Evaluated Agent Skill?

Core Capability Section Results Analysis

Medical Task Section Results Analysis

Final Score Comparison

Use Case: AIPOCH Peer Review Skill Overview

Peer Review Agent Skill Key Features

Explore more AIPOCH Agent Skills

Stop Using Generic Prompts: 102 Medical Research Agent Skills for Smarter Research Workflows

What is Awesome Med Research Skills?

How These Skills Work?

Key Features of Awesome Med Research Skills

Modular Skill Architecture for Team Scaling

Built for Real Medical Research Scenarios

What’s Next

Try It

Explore More AI Agent Skills

Agent Skills Explained: From Prompts to Structured AI Workflows

Why Agent Skills Exist?

How Agent Skills Actually Work？

Where to Find Agent Skills？

AIPOCH for Medical Research Agent Skills

FAQs

Are there agent skills for medical research?

What are ai agent skills?

How do agent skills work in practice?

What are some real-world agent skills examples?

Where can I find agent skills or an agent skills hub?

Hermes Agent: Why does it feels different from other agents

What Is Hermes Agent?

What's The Core Idea Of Hermes Agent?

Why It Feels Different From Other AI Agents

Why Hermes Agent Memory Matters

Hermes Agent + AIPOCH Medical Agent Skills

Explore AIPOCH Medical Research Agent Skills

FAQs About Hermes Agent

What is Hermes Agent in simple terms?

Is Hermes Agent open source?

Why Hermes Agent Feels Different From Other AI Agents?

Does Hermes Agent use its own AI model?

What are Hermes Agent skills?

Medical Research Agent Skills: Blind Review Sanitizer

What is Blind Review Sanitizer?

What Are The Key Capabilities of This Agent Skill?

Limitations and Considerations

Important Limitations:

Ethical and Legal Considerations:

Explore AIPOCH Agent Skills

AIPOCH Medical Skill Auditor: How We Evaluates Agent Skills?

What is Medical Skill Auditor?

How does Medical Skill Auditor Work?

Veto Gates

Skill Veto

Research Veto

Core Capability

Final Score

Feedback and possible future directions

😎AIPOCH – 450+ Modular Agent Skills for Medical Research

Why we built AIPOCH?

What is AIPOCH？

AIPOCH Medical Skill Auditor (in development)

What is Medical Skill Auditor?

How does it work?

The Most Frustrating Moment