DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
🚀 Semantic Caching — The System Design Secret to Scaling LLMs 🧠💸

🚀 Semantic Caching — The System Design Secret to Scaling LLMs 🧠💸

Comments
3 min read
2025 Voice AI Guide How to Make Your Own Real-Time Voice Agent (Part-3)

2025 Voice AI Guide How to Make Your Own Real-Time Voice Agent (Part-3)

5
Comments
15 min read
[Golang] Quickly Set Up a Free Local ChatGPT with Ollama and Build a LangChainGo Application

[Golang] Quickly Set Up a Free Local ChatGPT with Ollama and Build a LangChainGo Application

Comments
4 min read
Online Course Notes: DeepLearningAI - Advanced Retrieval for AI with Chroma

Online Course Notes: DeepLearningAI - Advanced Retrieval for AI with Chroma

Comments
4 min read
TIL: Notes on Knowledge Retrieval Architecture for LLMs (2023)

TIL: Notes on Knowledge Retrieval Architecture for LLMs (2023)

Comments
3 min read
[Golang][Gemini Pro] Build a LINE Bot with Memory Using Chat Sessions

[Golang][Gemini Pro] Build a LINE Bot with Memory Using Chat Sessions

Comments
6 min read
Cloud Platform: Choosing Between Heroku and Render as an AI (LLM) Engineer

Cloud Platform: Choosing Between Heroku and Render as an AI (LLM) Engineer

Comments
4 min read
[Learning Notes][OpenAI] About OpenAI's New Function Calling Feature

[Learning Notes][OpenAI] About OpenAI's New Function Calling Feature

Comments
6 min read
LangChain on Cloud Run: Getting YouTube Info

LangChain on Cloud Run: Getting YouTube Info

Comments
4 min read
Gemini: Summarize Search Results Based on Your Keywords

Gemini: Summarize Search Results Based on Your Keywords

Comments
4 min read
Using Gemini to Call MCP Functions on Cline

Using Gemini to Call MCP Functions on Cline

Comments
6 min read
Choosing the Right LLM for the Umbraco CMS Developer MCP: An Quick Cost and Performance Analysis

Choosing the Right LLM for the Umbraco CMS Developer MCP: An Quick Cost and Performance Analysis

Comments
6 min read
NV TW LLM Developer Day 2024: Conference Notes

NV TW LLM Developer Day 2024: Conference Notes

Comments
2 min read
[LangChain] Potential Issues with LangChain Embeddings

[LangChain] Potential Issues with LangChain Embeddings

Comments
2 min read
Google Gemma2/PaliGemma: Notes on Learning and Applications

Google Gemma2/PaliGemma: Notes on Learning and Applications

Comments
3 min read
Notes on GPT-4V(ision): The Dawn of LMMs

Notes on GPT-4V(ision): The Dawn of LMMs

Comments
1 min read
[GAI Conference] Enterprise Prompt Engineering by E.SUN Bank - Notes

[GAI Conference] Enterprise Prompt Engineering by E.SUN Bank - Notes

Comments
3 min read
[Conference Notes] NV TW LLM Developer Day 2024

[Conference Notes] NV TW LLM Developer Day 2024

Comments
2 min read
[YouTube] Practical Data Considerations for Building Production-Ready LLM Applications - Summary

[YouTube] Practical Data Considerations for Building Production-Ready LLM Applications - Summary

Comments
2 min read
Google Gemma 2 Bootcamp Resources Released

Google Gemma 2 Bootcamp Resources Released

Comments
2 min read
Gemini/Firebase: Building a Tech News LINE Bot with IFTTT and LangChain

Gemini/Firebase: Building a Tech News LINE Bot with IFTTT and LangChain

Comments
1 min read
RAG AI

RAG AI

Comments
2 min read
The 2M Token Trap: Why "Context Stuffing" Kills Reasoning

The 2M Token Trap: Why "Context Stuffing" Kills Reasoning

5
Comments
5 min read
Open WebUI: Self-Hosted LLM Interface

Open WebUI: Self-Hosted LLM Interface

Comments
13 min read
Is AI Quietly Killing Open Source?

Is AI Quietly Killing Open Source?

Comments
4 min read
loading...