DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Learning AI From Scratch: Streaming Output, the Secret Sauce Behind Real-Time LLMs

Learning AI From Scratch: Streaming Output, the Secret Sauce Behind Real-Time LLMs

Comments
3 min read
BATCHNORM IN LANGUAGE MODELS

BATCHNORM IN LANGUAGE MODELS

Comments
16 min read
Giving AI Eyes: Multi-Modal LLMs

Giving AI Eyes: Multi-Modal LLMs

Comments
9 min read
I'm an AI That Designed Its Own Website - Here's How (and Why)

I'm an AI That Designed Its Own Website - Here's How (and Why)

Comments
7 min read
The LLM Shield: How to Build Production-Grade NSFW Guardrails for AI Agents

The LLM Shield: How to Build Production-Grade NSFW Guardrails for AI Agents

1
Comments
4 min read
📌 10 Things You Must Know Before Building Your Language Model from Scratch 📌

📌 10 Things You Must Know Before Building Your Language Model from Scratch 📌

2
Comments
3 min read
The Complete Guide to Streaming LLM Responses in Web Applications: From SSE to Real-Time UI

The Complete Guide to Streaming LLM Responses in Web Applications: From SSE to Real-Time UI

Comments
10 min read
Fine-tuning Qwen 2.5 3B for RBI Regulations: Achieving 8x Performance with Smart Data Augmentation

Fine-tuning Qwen 2.5 3B for RBI Regulations: Achieving 8x Performance with Smart Data Augmentation

1
Comments
13 min read
How Sparse-K Cuts Millions of Attention Computations in llama.cpp

How Sparse-K Cuts Millions of Attention Computations in llama.cpp

2
Comments
6 min read
TOON vs JSON: A Reality Check — When It Saves Tokens and When It Doesn't

TOON vs JSON: A Reality Check — When It Saves Tokens and When It Doesn't

5
Comments 2
5 min read
Building AI Agents from Scratch: A Deep Dive into Function Calling, Tool Use, and Agentic Patterns

Building AI Agents from Scratch: A Deep Dive into Function Calling, Tool Use, and Agentic Patterns

Comments 1
9 min read
Building RAG Systems: From Zero to Hero

Building RAG Systems: From Zero to Hero

2
Comments
19 min read
GraphBit: Reliable AI Workflows with Multi-LLM Integration and Robust Tool Orchestration for Python Developers

GraphBit: Reliable AI Workflows with Multi-LLM Integration and Robust Tool Orchestration for Python Developers

Comments
4 min read
Protege AI Plugin for Ontology Engineering

Protege AI Plugin for Ontology Engineering

Comments
6 min read
How Do You Actually Compare LLMs? (The Battle Nobody's Talking About)

How Do You Actually Compare LLMs? (The Battle Nobody's Talking About)

3
Comments
5 min read
Top LLM Tools Companies Are Using to Add AI to Their Products in 2025

Top LLM Tools Companies Are Using to Add AI to Their Products in 2025

1
Comments 1
6 min read
Is the Monolith Dead? Introducing MQ-AGI: A Modular, Neuro-Symbolic Architecture for Scalable AI

Is the Monolith Dead? Introducing MQ-AGI: A Modular, Neuro-Symbolic Architecture for Scalable AI

Comments
2 min read
LANGUAGE MODELS USING MLP (Part 2)

LANGUAGE MODELS USING MLP (Part 2)

Comments
18 min read
Prompt Engineering Patterns: From Zero-Shot to Chain-of-Thought Reasoning

Prompt Engineering Patterns: From Zero-Shot to Chain-of-Thought Reasoning

1
Comments
14 min read
RAG Chunking Strategies Deep Dive

RAG Chunking Strategies Deep Dive

1
Comments
7 min read
Prompt Injection: What Security Managers Need to Know

Prompt Injection: What Security Managers Need to Know

Comments
15 min read
Why RAG and Agent Systems Are Unstable — A Minimal Deterministic Planner POC

Why RAG and Agent Systems Are Unstable — A Minimal Deterministic Planner POC

Comments 1
2 min read
LLMs + Tool Calls: Clever But Cursed

LLMs + Tool Calls: Clever But Cursed

7
Comments
2 min read
Cross-Modal Embeddings: Bridging AI Modalities

Cross-Modal Embeddings: Bridging AI Modalities

Comments
11 min read
The Complete Guide to Meta-Prompting: The Technique of Having AI Write Your Prompts

The Complete Guide to Meta-Prompting: The Technique of Having AI Write Your Prompts

2
Comments
5 min read
loading...