DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
# Medical RAG Architecture Overview #llmszoomcamp

# Medical RAG Architecture Overview #llmszoomcamp

Comments
5 min read
90% of Claude Apps Leak Context. Here's How to Fix It Before It Costs You Thousands

90% of Claude Apps Leak Context. Here's How to Fix It Before It Costs You Thousands

Comments
5 min read
# Medical RAG Architecture Overview #llmszoomcamp

# Medical RAG Architecture Overview #llmszoomcamp

Comments
5 min read
How to serve Markdown to AI agents: Making your docs more AI-friendly

How to serve Markdown to AI agents: Making your docs more AI-friendly

5
Comments 1
2 min read
Integrating Ollama with Python: REST API and Python Client Examples

Integrating Ollama with Python: REST API and Python Client Examples

1
Comments
4 min read
# Data Ingestion & Vector Store #llmszoomcamp

# Data Ingestion & Vector Store #llmszoomcamp

Comments
2 min read
RAG LLM: Why Your AI Costs 10x More Than It Should (And How to Fix It)

RAG LLM: Why Your AI Costs 10x More Than It Should (And How to Fix It)

Comments
5 min read
Agentic AI: How LLMs Really Work Behind the Scenes

Agentic AI: How LLMs Really Work Behind the Scenes

8
Comments
4 min read
I Tried Building a Whiteboard App with Claude 4.5 Sonnet

I Tried Building a Whiteboard App with Claude 4.5 Sonnet

Comments
2 min read
Claude Sonnet 4.5 vs. GPT-5 Codex: Best model for agentic coding

Claude Sonnet 4.5 vs. GPT-5 Codex: Best model for agentic coding

8
Comments 1
11 min read
Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Prompt Caching Slashed My AI Bills by 90%. Here's What Nobody Tells You.

Comments
5 min read
AI-Powered Resume & Job Description Matching with RAG

AI-Powered Resume & Job Description Matching with RAG

Comments
1 min read
Agent Optimization: Why Context Engineering Isn’t Enough

Agent Optimization: Why Context Engineering Isn’t Enough

Comments
5 min read
Train it or feed it? Teaching LLMs your data the smart way

Train it or feed it? Teaching LLMs your data the smart way

Comments
4 min read
AI Security Tools Find Critical curl Vulnerabilities

AI Security Tools Find Critical curl Vulnerabilities

Comments
9 min read
Why Claude Code's Unix Philosophy Beats Other AI Assistants

Why Claude Code's Unix Philosophy Beats Other AI Assistants

Comments
8 min read
Step-by-Step: Manual vLLM Setup on Google Cloud L4 (Debian)

Step-by-Step: Manual vLLM Setup on Google Cloud L4 (Debian)

5
Comments
2 min read
Improving Code Generation Accuracy by Automatically Searching for Similar Examples After Referencing Documentation

Improving Code Generation Accuracy by Automatically Searching for Similar Examples After Referencing Documentation

4
Comments
3 min read
Gemini 2.5 Flash-Lite: Speed > Scale — 887 TPS, 50% Less Verbosity, Real-World Wins

Gemini 2.5 Flash-Lite: Speed > Scale — 887 TPS, 50% Less Verbosity, Real-World Wins

Comments
1 min read
Claude vs Humans: Anthropic’s CTF Run

Claude vs Humans: Anthropic’s CTF Run

5
Comments
5 min read
Amazon Bedrock AgentCore Runtime - Part 7 Using AgentCore long-term Memory with Strands Agents SDK

Amazon Bedrock AgentCore Runtime - Part 7 Using AgentCore long-term Memory with Strands Agents SDK

3
Comments
13 min read
Granite 4: IBM introduces a line of small but fast LLMs

Granite 4: IBM introduces a line of small but fast LLMs

Comments
2 min read
OpenAI's SORA 2 Release Pattern: What It Means for AI Video

OpenAI's SORA 2 Release Pattern: What It Means for AI Video

Comments
9 min read
Post‑Evaluation Action Plan for AI Agents

Post‑Evaluation Action Plan for AI Agents

Comments
5 min read
Tired of AI Hallucinations? I Built a RAG App to Keep My Research Grounded.

Tired of AI Hallucinations? I Built a RAG App to Keep My Research Grounded.

5
Comments 1
4 min read
loading...