DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why JSON.parse() Fails Silently on Truncated LLM Responses (And What I Did About It)

Why JSON.parse() Fails Silently on Truncated LLM Responses (And What I Did About It)

2
Comments 2
3 min read
I built an Agent Memory System for myself and got 90.8% (end-to-end) on LongMemEval

I built an Agent Memory System for myself and got 90.8% (end-to-end) on LongMemEval

1
Comments 1
7 min read
AI Weekly: 4/1–4/10 | Anthropic Triple Shock Sequel — Mythos Too Dangerous to Ship, Revenue Passes OpenAI, Software Stocks Crash

AI Weekly: 4/1–4/10 | Anthropic Triple Shock Sequel — Mythos Too Dangerous to Ship, Revenue Passes OpenAI, Software Stocks Crash

Comments
9 min read
Your AI agent isn't broken. It's confidently wrong. Here's the difference

Your AI agent isn't broken. It's confidently wrong. Here's the difference

Comments
4 min read
Embeddings Just Went Multimodal: What Sentence Transformers 5.4 Means for RAG

Embeddings Just Went Multimodal: What Sentence Transformers 5.4 Means for RAG

Comments
2 min read
𝗪𝗵𝗮𝘁 𝗜 𝗟𝗲𝗮𝗿𝗻𝗲𝗱 𝗳𝗿𝗼𝗺 𝗖𝗵𝗮𝗽𝘁𝗲𝗿 𝟮 𝗼𝗳 𝗔𝗜 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴: 𝗪𝗵𝘆 𝗦𝗮𝗺𝗽𝗹𝗶𝗻𝗴 𝗖𝗵𝗮𝗻𝗴𝗲𝘀 𝗘𝘃𝗲𝗿𝘆𝘁𝗵𝗶𝗻𝗴

𝗪𝗵𝗮𝘁 𝗜 𝗟𝗲𝗮𝗿𝗻𝗲𝗱 𝗳𝗿𝗼𝗺 𝗖𝗵𝗮𝗽𝘁𝗲𝗿 𝟮 𝗼𝗳 𝗔𝗜 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴: 𝗪𝗵𝘆 𝗦𝗮𝗺𝗽𝗹𝗶𝗻𝗴 𝗖𝗵𝗮𝗻𝗴𝗲𝘀 𝗘𝘃𝗲𝗿𝘆𝘁𝗵𝗶𝗻𝗴

1
Comments
4 min read
Anthropic Just Did Something Unprecedented: They Hid Their Best Security Model

Anthropic Just Did Something Unprecedented: They Hid Their Best Security Model

Comments
2 min read
Marker, hosted: a scientific PDF parser API with LaTeX equations preserved

Marker, hosted: a scientific PDF parser API with LaTeX equations preserved

Comments
4 min read
RedSOC: Open-source framework to benchmark adversarial attacks on AI-powered SOCs — 100% detection rate across 15 attack scenarios [paper + code]

RedSOC: Open-source framework to benchmark adversarial attacks on AI-powered SOCs — 100% detection rate across 15 attack scenarios [paper + code]

Comments
2 min read
How to Conduct an Enterprise-Scale AX Audit with megallm-Grade Rigor

How to Conduct an Enterprise-Scale AX Audit with megallm-Grade Rigor

Comments
3 min read
From AI Demo to Production: How to Ship Quality Agentic Applications

From AI Demo to Production: How to Ship Quality Agentic Applications

1
Comments
8 min read
# Your AI Agents Are Talking — But Can You Prove What They Said?

# Your AI Agents Are Talking — But Can You Prove What They Said?

1
Comments
5 min read
AI Search Optimization for Jekyll: JSON-LD, llms.txt, and Entity Graphs

AI Search Optimization for Jekyll: JSON-LD, llms.txt, and Entity Graphs

1
Comments
2 min read
AI Automation Workflows Are Redefining Enterprise Data Engineering

AI Automation Workflows Are Redefining Enterprise Data Engineering

4
Comments
3 min read
I Spent Four Weeks Reading 200+ Sources on Context Engineering. Here's What I Built.

I Spent Four Weeks Reading 200+ Sources on Context Engineering. Here's What I Built.

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.