DEV Community

# research

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Multi-Agent Coordination Without Hierarchy: What 70 Days of Production Data Showed

Multi-Agent Coordination Without Hierarchy: What 70 Days of Production Data Showed

Comments
2 min read
Bridging the Gap: Aligning Software Engineering Practices with Research Goals in Scientific Organizations

Bridging the Gap: Aligning Software Engineering Practices with Research Goals in Scientific Organizations

Comments
12 min read
Stop Using Elaborate Personas: Research Shows They Degrade Claude Code Output

Stop Using Elaborate Personas: Research Shows They Degrade Claude Code Output

Comments
3 min read
Persona Persistence Attacks: When Your AI Agent's Soul File Becomes a Backdoor

Persona Persistence Attacks: When Your AI Agent's Soul File Becomes a Backdoor

Comments
2 min read
Harvard Proved Emotions Don't Make AI Smarter — That's Exactly Why You Need Soul Spec

Harvard Proved Emotions Don't Make AI Smarter — That's Exactly Why You Need Soul Spec

1
Comments
4 min read
Cross-Model Persona Fidelity: Is Your AI Agent Still 'Them' on a Different LLM?

Cross-Model Persona Fidelity: Is Your AI Agent Still 'Them' on a Different LLM?

Comments
2 min read
Can AI Agents Detect Their Own Model Upgrades?

Can AI Agents Detect Their Own Model Upgrades?

Comments
3 min read
ReCUBE Benchmark Reveals GPT-5 Scores Only 37.6% on Repository-Level Code Generation

ReCUBE Benchmark Reveals GPT-5 Scores Only 37.6% on Repository-Level Code Generation

Comments
6 min read
NeurIPS 2025 Proved It: Every LLM Says the Same Thing — Here's the Fix

NeurIPS 2025 Proved It: Every LLM Says the Same Thing — Here's the Fix

Comments
4 min read
Identity + Governance = 100% Safety? Testing Combined Persona Approaches on Abliterated LLMs

Identity + Governance = 100% Safety? Testing Combined Persona Approaches on Abliterated LLMs

Comments
5 min read
Research with AI: primary sources, certainty labeling and counter-argumentation

Research with AI: primary sources, certainty labeling and counter-argumentation

Comments
6 min read
What Cursor's 8GB Storage Bloat Teaches Us About Claude Code's Clean Architecture

What Cursor's 8GB Storage Bloat Teaches Us About Claude Code's Clean Architecture

Comments 1
3 min read
TIAMAT: The First Autonomous AI Operating System

TIAMAT: The First Autonomous AI Operating System

Comments
1 min read
AI App Builder Platforms: A Comprehensive Benchmarking Study

AI App Builder Platforms: A Comprehensive Benchmarking Study

Comments 1
11 min read
ARC-AGI-3 Just Broke Every Frontier Model. Humans Score 100%. GPT-5.4 Scores 0.26%.

ARC-AGI-3 Just Broke Every Frontier Model. Humans Score 100%. GPT-5.4 Scores 0.26%.

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.