DEV Community

AI Explained Series' Articles

Back to pueding's Series
Is Grep All You Need? Grep vs Vector Retrieval for Agentic Search
Cover image for Is Grep All You Need? Grep vs Vector Retrieval for Agentic Search

Is Grep All You Need? Grep vs Vector Retrieval for Agentic Search

2
Comments
7 min read
MCP SEP-2468: RFC 9207 Iss Parameter for OAuth Mix-Up Defense
Cover image for MCP SEP-2468: RFC 9207 Iss Parameter for OAuth Mix-Up Defense

MCP SEP-2468: RFC 9207 Iss Parameter for OAuth Mix-Up Defense

1
Comments
8 min read
Camouflage Injection Paper: Camouflage Detection Gap
Cover image for Camouflage Injection Paper: Camouflage Detection Gap

Camouflage Injection Paper: Camouflage Detection Gap

Comments
7 min read
OpenSCAD Pantheon Benchmark: Human-In-The-Loop vs Autonomous Coding Agents
Cover image for OpenSCAD Pantheon Benchmark: Human-In-The-Loop vs Autonomous Coding Agents

OpenSCAD Pantheon Benchmark: Human-In-The-Loop vs Autonomous Coding Agents

Comments
8 min read
Boiling the Frog Paper: Multi-Turn Norm Erosion vs Single-Prompt Agent Safety
Cover image for Boiling the Frog Paper: Multi-Turn Norm Erosion vs Single-Prompt Agent Safety

Boiling the Frog Paper: Multi-Turn Norm Erosion vs Single-Prompt Agent Safety

Comments
8 min read
Cursor Composer 2.5: Targeted Textual Feedback RL
Cover image for Cursor Composer 2.5: Targeted Textual Feedback RL

Cursor Composer 2.5: Targeted Textual Feedback RL

Comments
8 min read
CDD Paper: Context-Driven Decomposition for RAG Knowledge Conflict
Cover image for CDD Paper: Context-Driven Decomposition for RAG Knowledge Conflict

CDD Paper: Context-Driven Decomposition for RAG Knowledge Conflict

Comments
9 min read
Gemini 3.5 Flash: Agent-First Model Design
Cover image for Gemini 3.5 Flash: Agent-First Model Design

Gemini 3.5 Flash: Agent-First Model Design

Comments 1
8 min read
OmniRetrieval: Source-Native Query Dispatch
Cover image for OmniRetrieval: Source-Native Query Dispatch

OmniRetrieval: Source-Native Query Dispatch

Comments
6 min read
Claude Opus 4.8: Parallel-Subagent Dynamic Workflows
Cover image for Claude Opus 4.8: Parallel-Subagent Dynamic Workflows

Claude Opus 4.8: Parallel-Subagent Dynamic Workflows

Comments
6 min read
AgentDoG 1.5: Small Inline Guard Models for Agent Actions
Cover image for AgentDoG 1.5: Small Inline Guard Models for Agent Actions

AgentDoG 1.5: Small Inline Guard Models for Agent Actions

Comments
7 min read
GrepSeek Trains a Search Agent to Use Shell Commands: GRPO-Trained Shell-Command Search
Cover image for GrepSeek Trains a Search Agent to Use Shell Commands: GRPO-Trained Shell-Command Search

GrepSeek Trains a Search Agent to Use Shell Commands: GRPO-Trained Shell-Command Search

Comments
6 min read
Harness-1: State-Externalizing Search Harness
Cover image for Harness-1: State-Externalizing Search Harness

Harness-1: State-Externalizing Search Harness

Comments
7 min read
Microsoft MAI-Code-1-Flash: Adaptive Solution-Length Control
Cover image for Microsoft MAI-Code-1-Flash: Adaptive Solution-Length Control

Microsoft MAI-Code-1-Flash: Adaptive Solution-Length Control

1
Comments
6 min read
Token Budgets Paper: Affine-Typed Budget Ownership
Cover image for Token Budgets Paper: Affine-Typed Budget Ownership

Token Budgets Paper: Affine-Typed Budget Ownership

Comments
6 min read
MCP 2026-07-28 RC: Stateless Transport
Cover image for MCP 2026-07-28 RC: Stateless Transport

MCP 2026-07-28 RC: Stateless Transport

Comments
9 min read
MarginGate: Margin-Gated Verification for Batch-Invariant Decoding
Cover image for MarginGate: Margin-Gated Verification for Batch-Invariant Decoding

MarginGate: Margin-Gated Verification for Batch-Invariant Decoding

Comments
5 min read
MCP SEP-2106: Full JSON Schema 2020-12 in Tool I/O
Cover image for MCP SEP-2106: Full JSON Schema 2020-12 in Tool I/O

MCP SEP-2106: Full JSON Schema 2020-12 in Tool I/O

Comments 1
7 min read
AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation
Cover image for AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation

AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation

Comments
6 min read
Agent-Harness Scaling Law: Feedback Quality Predicts Success, Not Raw Compute: Effective Feedback Compute (EFC)
Cover image for Agent-Harness Scaling Law: Feedback Quality Predicts Success, Not Raw Compute: Effective Feedback Compute (EFC)

Agent-Harness Scaling Law: Feedback Quality Predicts Success, Not Raw Compute: Effective Feedback Compute (EFC)

Comments
7 min read
Google Releases DiffusionGemma: Parallel Block Decoding
Cover image for Google Releases DiffusionGemma: Parallel Block Decoding

Google Releases DiffusionGemma: Parallel Block Decoding

3
Comments
6 min read
MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)
Cover image for MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)

MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)

Comments
6 min read
Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training
Cover image for Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training

Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training

1
Comments
6 min read
NVIDIA RTX Spark Superchip: Unified CPU–GPU Memory
Cover image for NVIDIA RTX Spark Superchip: Unified CPU–GPU Memory

NVIDIA RTX Spark Superchip: Unified CPU–GPU Memory

Comments 1
8 min read
NVIDIA Blackwell Leads AgentPerf, the First Agentic-AI Infra Benchmark: Trajectory-Replay Benchmarking
Cover image for NVIDIA Blackwell Leads AgentPerf, the First Agentic-AI Infra Benchmark: Trajectory-Replay Benchmarking

NVIDIA Blackwell Leads AgentPerf, the First Agentic-AI Infra Benchmark: Trajectory-Replay Benchmarking

Comments
6 min read
Google Releases Gemma 4 12B: Encoder-Free Multimodal Projection
Cover image for Google Releases Gemma 4 12B: Encoder-Free Multimodal Projection

Google Releases Gemma 4 12B: Encoder-Free Multimodal Projection

Comments
6 min read
Microsoft FastContext: a Repo-Explorer Subagent Cuts Coding-Agent Tokens 60%: Explorer-Subagent Context Offloading
Cover image for Microsoft FastContext: a Repo-Explorer Subagent Cuts Coding-Agent Tokens 60%: Explorer-Subagent Context Offloading

Microsoft FastContext: a Repo-Explorer Subagent Cuts Coding-Agent Tokens 60%: Explorer-Subagent Context Offloading

Comments
7 min read
FlashMemory Cuts DeepSeek-V4's KV Cache to 13.5%: Lookahead Sparse Attention
Cover image for FlashMemory Cuts DeepSeek-V4's KV Cache to 13.5%: Lookahead Sparse Attention

FlashMemory Cuts DeepSeek-V4's KV Cache to 13.5%: Lookahead Sparse Attention

Comments
6 min read
NVIDIA Blackwell Sweeps MLPerf Training 6.0: Strong Scaling
Cover image for NVIDIA Blackwell Sweeps MLPerf Training 6.0: Strong Scaling

NVIDIA Blackwell Sweeps MLPerf Training 6.0: Strong Scaling

Comments
6 min read
AMD ATOM + ATOMesh: Prefill/decode Disaggregation on ROCm
Cover image for AMD ATOM + ATOMesh: Prefill/decode Disaggregation on ROCm

AMD ATOM + ATOMesh: Prefill/decode Disaggregation on ROCm

Comments
7 min read
Agent Leaderboards Mislead Under Distribution Shift (IBM): Predictive Validity
Cover image for Agent Leaderboards Mislead Under Distribution Shift (IBM): Predictive Validity

Agent Leaderboards Mislead Under Distribution Shift (IBM): Predictive Validity

Comments
7 min read
GLM-5.2 Becomes the Top Open-Weights Model: Active vs Total Parameters
Cover image for GLM-5.2 Becomes the Top Open-Weights Model: Active vs Total Parameters

GLM-5.2 Becomes the Top Open-Weights Model: Active vs Total Parameters

Comments
6 min read
Baidu Unlimited OCR Holds the KV Cache Constant for 40+ Pages: Reference Sliding Window Attention
Cover image for Baidu Unlimited OCR Holds the KV Cache Constant for 40+ Pages: Reference Sliding Window Attention

Baidu Unlimited OCR Holds the KV Cache Constant for 40+ Pages: Reference Sliding Window Attention

Comments
8 min read
OpenAI and Broadcom's Jalapeño, a Custom Inference ASIC: Inference ASIC vs GPU
Cover image for OpenAI and Broadcom's Jalapeño, a Custom Inference ASIC: Inference ASIC vs GPU

OpenAI and Broadcom's Jalapeño, a Custom Inference ASIC: Inference ASIC vs GPU

5
Comments
7 min read
Qwen-AgentWorld Trains a Language Model as a World Model for RL Agents: World Model as a Decoupled RL Simulator
Cover image for Qwen-AgentWorld Trains a Language Model as a World Model for RL Agents: World Model as a Decoupled RL Simulator

Qwen-AgentWorld Trains a Language Model as a World Model for RL Agents: World Model as a Decoupled RL Simulator

Comments
6 min read
CacheWeaver Reorders RAG Evidence for Prefix-Cache Reuse: Prefix-Cache-Aware Evidence Reordering
Cover image for CacheWeaver Reorders RAG Evidence for Prefix-Cache Reuse: Prefix-Cache-Aware Evidence Reordering

CacheWeaver Reorders RAG Evidence for Prefix-Cache Reuse: Prefix-Cache-Aware Evidence Reordering

Comments 3
7 min read
SGLang v0.5.14: LPLB Expert-Parallel Load Balancing
Cover image for SGLang v0.5.14: LPLB Expert-Parallel Load Balancing

SGLang v0.5.14: LPLB Expert-Parallel Load Balancing

Comments
7 min read
S-Agent: Spatial Tool-Use Makes an 8B Agent Rival GPT-5.4 on Spatial Reasoning: Spatio-Temporal Evidence Accumulation
Cover image for S-Agent: Spatial Tool-Use Makes an 8B Agent Rival GPT-5.4 on Spatial Reasoning: Spatio-Temporal Evidence Accumulation

S-Agent: Spatial Tool-Use Makes an 8B Agent Rival GPT-5.4 on Spatial Reasoning: Spatio-Temporal Evidence Accumulation

Comments
6 min read