Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
AI Explained Series' Articles
Back to pueding's Series
Is Grep All You Need? Grep vs Vector Retrieval for Agentic Search
pueding
pueding
pueding
Follow
May 21
Is Grep All You Need? Grep vs Vector Retrieval for Agentic Search
#
agents
#
llm
#
rag
#
ai
2
reactions
Comments
Add Comment
7 min read
MCP SEP-2468: RFC 9207 Iss Parameter for OAuth Mix-Up Defense
pueding
pueding
pueding
Follow
May 22
MCP SEP-2468: RFC 9207 Iss Parameter for OAuth Mix-Up Defense
#
ai
#
agents
#
mcp
#
security
1
reaction
Comments
Add Comment
8 min read
Camouflage Injection Paper: Camouflage Detection Gap
pueding
pueding
pueding
Follow
May 23
Camouflage Injection Paper: Camouflage Detection Gap
#
ai
#
agents
#
security
#
llm
Comments
Add Comment
7 min read
OpenSCAD Pantheon Benchmark: Human-In-The-Loop vs Autonomous Coding Agents
pueding
pueding
pueding
Follow
May 24
OpenSCAD Pantheon Benchmark: Human-In-The-Loop vs Autonomous Coding Agents
#
ai
#
agents
#
productivity
#
llm
Comments
Add Comment
8 min read
Boiling the Frog Paper: Multi-Turn Norm Erosion vs Single-Prompt Agent Safety
pueding
pueding
pueding
Follow
May 25
Boiling the Frog Paper: Multi-Turn Norm Erosion vs Single-Prompt Agent Safety
#
ai
#
agents
#
llm
#
security
Comments
Add Comment
8 min read
Cursor Composer 2.5: Targeted Textual Feedback RL
pueding
pueding
pueding
Follow
May 26
Cursor Composer 2.5: Targeted Textual Feedback RL
#
ai
#
llm
#
machinelearning
Comments
Add Comment
8 min read
CDD Paper: Context-Driven Decomposition for RAG Knowledge Conflict
pueding
pueding
pueding
Follow
May 27
CDD Paper: Context-Driven Decomposition for RAG Knowledge Conflict
#
ai
#
llm
#
rag
#
agents
Comments
Add Comment
9 min read
Gemini 3.5 Flash: Agent-First Model Design
pueding
pueding
pueding
Follow
May 29
Gemini 3.5 Flash: Agent-First Model Design
#
ai
#
agents
#
llm
#
machinelearning
Comments
1
comment
8 min read
OmniRetrieval: Source-Native Query Dispatch
pueding
pueding
pueding
Follow
May 30
OmniRetrieval: Source-Native Query Dispatch
#
ai
#
rag
#
llm
#
agents
Comments
Add Comment
6 min read
Claude Opus 4.8: Parallel-Subagent Dynamic Workflows
pueding
pueding
pueding
Follow
May 31
Claude Opus 4.8: Parallel-Subagent Dynamic Workflows
#
agents
#
ai
#
llm
Comments
Add Comment
6 min read
AgentDoG 1.5: Small Inline Guard Models for Agent Actions
pueding
pueding
pueding
Follow
Jun 1
AgentDoG 1.5: Small Inline Guard Models for Agent Actions
#
ai
#
security
#
agents
#
llm
Comments
Add Comment
7 min read
GrepSeek Trains a Search Agent to Use Shell Commands: GRPO-Trained Shell-Command Search
pueding
pueding
pueding
Follow
Jun 2
GrepSeek Trains a Search Agent to Use Shell Commands: GRPO-Trained Shell-Command Search
#
ai
#
agents
#
rag
#
llm
Comments
Add Comment
6 min read
Harness-1: State-Externalizing Search Harness
pueding
pueding
pueding
Follow
Jun 3
Harness-1: State-Externalizing Search Harness
#
ai
#
agents
#
llm
Comments
Add Comment
7 min read
Microsoft MAI-Code-1-Flash: Adaptive Solution-Length Control
pueding
pueding
pueding
Follow
Jun 4
Microsoft MAI-Code-1-Flash: Adaptive Solution-Length Control
#
ai
#
llm
#
machinelearning
1
reaction
Comments
Add Comment
6 min read
Token Budgets Paper: Affine-Typed Budget Ownership
pueding
pueding
pueding
Follow
Jun 5
Token Budgets Paper: Affine-Typed Budget Ownership
#
agents
#
ai
#
llm
Comments
Add Comment
6 min read
MCP 2026-07-28 RC: Stateless Transport
pueding
pueding
pueding
Follow
Jun 6
MCP 2026-07-28 RC: Stateless Transport
#
mcp
#
ai
#
agents
#
devops
Comments
Add Comment
9 min read
MarginGate: Margin-Gated Verification for Batch-Invariant Decoding
pueding
pueding
pueding
Follow
Jun 7
MarginGate: Margin-Gated Verification for Batch-Invariant Decoding
#
llm
#
ai
#
machinelearning
#
tutorial
Comments
Add Comment
5 min read
MCP SEP-2106: Full JSON Schema 2020-12 in Tool I/O
pueding
pueding
pueding
Follow
Jun 8
MCP SEP-2106: Full JSON Schema 2020-12 in Tool I/O
#
mcp
#
ai
#
agents
#
tutorial
Comments
1
comment
7 min read
AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation
pueding
pueding
pueding
Follow
Jun 9
AutoLab Benchmarks Frontier Agents on Long-Horizon R&D Tasks: Iterative Experiment-Loop Evaluation
#
agents
#
ai
#
llm
#
machinelearning
Comments
Add Comment
6 min read
Agent-Harness Scaling Law: Feedback Quality Predicts Success, Not Raw Compute: Effective Feedback Compute (EFC)
pueding
pueding
pueding
Follow
Jun 10
Agent-Harness Scaling Law: Feedback Quality Predicts Success, Not Raw Compute: Effective Feedback Compute (EFC)
#
agents
#
ai
#
llm
#
machinelearning
Comments
Add Comment
7 min read
Google Releases DiffusionGemma: Parallel Block Decoding
pueding
pueding
pueding
Follow
Jun 11
Google Releases DiffusionGemma: Parallel Block Decoding
#
ai
#
llm
#
machinelearning
#
tutorial
3
reactions
Comments
Add Comment
6 min read
MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)
pueding
pueding
pueding
Follow
Jun 12
MiniMax M3 Ships Open-Weight 1M Context: MiniMax Sparse Attention (MSA)
#
ai
#
llm
#
machinelearning
#
tutorial
Comments
Add Comment
6 min read
Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training
pueding
pueding
pueding
Follow
Jun 13
Google Ships Gemma 4 QAT Checkpoints: Quantization-Aware Training
#
ai
#
llm
#
machinelearning
#
tutorial
1
reaction
Comments
Add Comment
6 min read
NVIDIA RTX Spark Superchip: Unified CPU–GPU Memory
pueding
pueding
pueding
Follow
Jun 14
NVIDIA RTX Spark Superchip: Unified CPU–GPU Memory
#
ai
#
machinelearning
#
llm
#
agents
Comments
1
comment
8 min read
NVIDIA Blackwell Leads AgentPerf, the First Agentic-AI Infra Benchmark: Trajectory-Replay Benchmarking
pueding
pueding
pueding
Follow
Jun 15
NVIDIA Blackwell Leads AgentPerf, the First Agentic-AI Infra Benchmark: Trajectory-Replay Benchmarking
#
ai
#
agents
#
llm
#
machinelearning
Comments
Add Comment
6 min read
Google Releases Gemma 4 12B: Encoder-Free Multimodal Projection
pueding
pueding
pueding
Follow
Jun 16
Google Releases Gemma 4 12B: Encoder-Free Multimodal Projection
#
ai
#
llm
#
machinelearning
#
google
Comments
Add Comment
6 min read
Microsoft FastContext: a Repo-Explorer Subagent Cuts Coding-Agent Tokens 60%: Explorer-Subagent Context Offloading
pueding
pueding
pueding
Follow
Jun 17
Microsoft FastContext: a Repo-Explorer Subagent Cuts Coding-Agent Tokens 60%: Explorer-Subagent Context Offloading
#
ai
#
agents
#
llm
#
devops
Comments
Add Comment
7 min read
FlashMemory Cuts DeepSeek-V4's KV Cache to 13.5%: Lookahead Sparse Attention
pueding
pueding
pueding
Follow
Jun 18
FlashMemory Cuts DeepSeek-V4's KV Cache to 13.5%: Lookahead Sparse Attention
#
ai
#
llm
#
machinelearning
Comments
Add Comment
6 min read
NVIDIA Blackwell Sweeps MLPerf Training 6.0: Strong Scaling
pueding
pueding
pueding
Follow
Jun 20
NVIDIA Blackwell Sweeps MLPerf Training 6.0: Strong Scaling
#
ai
#
machinelearning
#
llm
#
devops
Comments
Add Comment
6 min read
AMD ATOM + ATOMesh: Prefill/decode Disaggregation on ROCm
pueding
pueding
pueding
Follow
Jun 21
AMD ATOM + ATOMesh: Prefill/decode Disaggregation on ROCm
#
ai
#
llm
#
devops
#
machinelearning
Comments
Add Comment
7 min read
Agent Leaderboards Mislead Under Distribution Shift (IBM): Predictive Validity
pueding
pueding
pueding
Follow
Jun 22
Agent Leaderboards Mislead Under Distribution Shift (IBM): Predictive Validity
#
agents
#
ai
#
llm
#
machinelearning
Comments
Add Comment
7 min read
GLM-5.2 Becomes the Top Open-Weights Model: Active vs Total Parameters
pueding
pueding
pueding
Follow
Jun 23
GLM-5.2 Becomes the Top Open-Weights Model: Active vs Total Parameters
#
llm
#
ai
#
machinelearning
Comments
Add Comment
6 min read
Baidu Unlimited OCR Holds the KV Cache Constant for 40+ Pages: Reference Sliding Window Attention
pueding
pueding
pueding
Follow
Jun 26
Baidu Unlimited OCR Holds the KV Cache Constant for 40+ Pages: Reference Sliding Window Attention
#
ai
#
llm
#
machinelearning
Comments
Add Comment
8 min read
OpenAI and Broadcom's Jalapeño, a Custom Inference ASIC: Inference ASIC vs GPU
pueding
pueding
pueding
Follow
Jun 27
OpenAI and Broadcom's Jalapeño, a Custom Inference ASIC: Inference ASIC vs GPU
#
ai
#
llm
#
hardware
#
machinelearning
5
reactions
Comments
Add Comment
7 min read
Qwen-AgentWorld Trains a Language Model as a World Model for RL Agents: World Model as a Decoupled RL Simulator
pueding
pueding
pueding
Follow
Jun 28
Qwen-AgentWorld Trains a Language Model as a World Model for RL Agents: World Model as a Decoupled RL Simulator
#
agents
#
ai
#
llm
#
machinelearning
Comments
Add Comment
6 min read
CacheWeaver Reorders RAG Evidence for Prefix-Cache Reuse: Prefix-Cache-Aware Evidence Reordering
pueding
pueding
pueding
Follow
Jun 29
CacheWeaver Reorders RAG Evidence for Prefix-Cache Reuse: Prefix-Cache-Aware Evidence Reordering
#
rag
#
llm
#
ai
#
machinelearning
Comments
3
comments
7 min read
SGLang v0.5.14: LPLB Expert-Parallel Load Balancing
pueding
pueding
pueding
Follow
Jun 30
SGLang v0.5.14: LPLB Expert-Parallel Load Balancing
#
ai
#
llm
#
machinelearning
#
devops
Comments
Add Comment
7 min read
S-Agent: Spatial Tool-Use Makes an 8B Agent Rival GPT-5.4 on Spatial Reasoning: Spatio-Temporal Evidence Accumulation
pueding
pueding
pueding
Follow
Jul 1
S-Agent: Spatial Tool-Use Makes an 8B Agent Rival GPT-5.4 on Spatial Reasoning: Spatio-Temporal Evidence Accumulation
#
agents
#
ai
#
llm
#
machinelearning
Comments
Add Comment
6 min read
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account