DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Engenharia de Prompts para Massa de Dados: Escalando Testes com Cobertura e Sem Duplicidade utilizando LLMs

Engenharia de Prompts para Massa de Dados: Escalando Testes com Cobertura e Sem Duplicidade utilizando LLMs

Comments
5 min read
How much VRAM do you actually need to run Llama 3 or Gemma locally?

How much VRAM do you actually need to run Llama 3 or Gemma locally?

Comments
4 min read
consent_url is not a governance layer — what Whire got right and what comes next

consent_url is not a governance layer — what Whire got right and what comes next

Comments
3 min read
Stop Hiding the Chain of Thought: Stream Claude 4.5 Native Thinking Blocks with Spring AI and SSE

Stop Hiding the Chain of Thought: Stream Claude 4.5 Native Thinking Blocks with Spring AI and SSE

Comments
2 min read
How Retrieval‑Augmented Generation Is Revolutionizing Real‑Time, Personalized Career Coaching on AI‑Powered Talent Platforms

How Retrieval‑Augmented Generation Is Revolutionizing Real‑Time, Personalized Career Coaching on AI‑Powered Talent Platforms

Comments
7 min read
How Retrieval‑Augmented Generation Is Revolutionizing Real‑Time, Personalized Career Coaching on AI‑Powered Talent Platforms

How Retrieval‑Augmented Generation Is Revolutionizing Real‑Time, Personalized Career Coaching on AI‑Powered Talent Platforms

Comments
7 min read
AI Jailbreaks Explained: Prompt Injection, Risks, and Node.js Guardrails

AI Jailbreaks Explained: Prompt Injection, Risks, and Node.js Guardrails

Comments
2 min read
Why AI Systems Need State Management More Than Bigger Context Windows

Why AI Systems Need State Management More Than Bigger Context Windows

1
Comments
3 min read
I Built the Easiest Way for Your AI Agent to Get a Phone Number (AgentLine)

I Built the Easiest Way for Your AI Agent to Get a Phone Number (AgentLine)

1
Comments
4 min read
Tokenization under the hood: BPE, WordPiece, SentencePiece, and Unigram compared

Tokenization under the hood: BPE, WordPiece, SentencePiece, and Unigram compared

Comments
9 min read
The Core of a Coding Agent Is 128 Lines of Python. So I Built One From Scratch.

The Core of a Coding Agent Is 128 Lines of Python. So I Built One From Scratch.

1
Comments
4 min read
Nobody keeps the receipts for AI pricing, so I built the changelog

Nobody keeps the receipts for AI pricing, so I built the changelog

2
Comments
2 min read
Ollama Structured Outputs in Practice — Getting Type-Safe JSON from Local LLMs with Pydantic

Ollama Structured Outputs in Practice — Getting Type-Safe JSON from Local LLMs with Pydantic

Comments
6 min read
GLM-5.2 Made It Official: 9 of the Top 10 Open-Source LLMs Are Chinese

GLM-5.2 Made It Official: 9 of the Top 10 Open-Source LLMs Are Chinese

Comments
7 min read
Why Your Reranker Isn't Helping Your RAG Pipeline (And How to Prove It)

Why Your Reranker Isn't Helping Your RAG Pipeline (And How to Prove It)

1
Comments 1
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.