DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Building a Meeting Summarizer Backend with Python FastAPI, AWS Transcribe and AWS Bedrock

Building a Meeting Summarizer Backend with Python FastAPI, AWS Transcribe and AWS Bedrock

1
Comments
5 min read
Model Context Protocol (MCP): The USB-C for AI Applications

Model Context Protocol (MCP): The USB-C for AI Applications

1
Comments
3 min read
How to run Large Language Models (LLMs) locally.

How to run Large Language Models (LLMs) locally.

2
Comments
5 min read
GPT for Word. Use QwQ-32B in Microsoft Word Locally (100% Private).

GPT for Word. Use QwQ-32B in Microsoft Word Locally (100% Private).

Comments
1 min read
Overview: "InfiniRetri: Enhancing LLMs for Infinite-Length Context via Attention-Based Retrieval"

Overview: "InfiniRetri: Enhancing LLMs for Infinite-Length Context via Attention-Based Retrieval"

1
Comments
4 min read
How to Install German-R1 Locally: Unlock Advanced AI Reasoning in German

How to Install German-R1 Locally: Unlock Advanced AI Reasoning in German

4
Comments
6 min read
Beginning a Series on RAG for Nordic APIs

Beginning a Series on RAG for Nordic APIs

Comments
1 min read
RAG Chatbot: Build with LangChain, Milvus, Fireworks AI 🔥Llama 3.1 8B Instruct, and Cohere embed-multilingual-v2.0

RAG Chatbot: Build with LangChain, Milvus, Fireworks AI 🔥Llama 3.1 8B Instruct, and Cohere embed-multilingual-v2.0

4
Comments
8 min read
AI locators for Playwright 🚀

AI locators for Playwright 🚀

4
Comments
2 min read
Averaging Our Way to AGI

Averaging Our Way to AGI

1
Comments
7 min read
Streamlining Routine ML Tasks with LangChain: A Hacker News Comment Analysis Example

Streamlining Routine ML Tasks with LangChain: A Hacker News Comment Analysis Example

1
Comments 1
4 min read
Overview:"Agentic Retrieval-Augmented Generation: A Comprehensive Survey"

Overview:"Agentic Retrieval-Augmented Generation: A Comprehensive Survey"

1
Comments
8 min read
Ollama Models Comparison : Compare LLM Responses Side-by-Side

Ollama Models Comparison : Compare LLM Responses Side-by-Side

5
Comments
2 min read
I Built an LLM Framework in just 100 Lines

I Built an LLM Framework in just 100 Lines

8
Comments 1
2 min read
What is Retrieval-Augmented Generation (RAG)? A Beginner’s Guide

What is Retrieval-Augmented Generation (RAG)? A Beginner’s Guide

5
Comments 1
2 min read
Comprehensive Guide to Decoding Parameters and Hyperparameters in Large Language Models (LLMs)

Comprehensive Guide to Decoding Parameters and Hyperparameters in Large Language Models (LLMs)

2
Comments
3 min read
10 Common Vulnerabilities in Large Language Models (LLMs)

10 Common Vulnerabilities in Large Language Models (LLMs)

1
Comments
4 min read
DuckDB vs. ClickHouse Local: A Comparative Analysis for Analytical Workloads

DuckDB vs. ClickHouse Local: A Comparative Analysis for Analytical Workloads

1
Comments
4 min read
Getting Started with llms.txt

Getting Started with llms.txt

11
Comments 1
5 min read
Context Windows in Large Language Models

Context Windows in Large Language Models

2
Comments
4 min read
Automating Tests for Multiple Generative AI APIs Using Postman (Newman) and Exporting Responses to CSV

Automating Tests for Multiple Generative AI APIs Using Postman (Newman) and Exporting Responses to CSV

2
Comments
3 min read
What is Vector DB ?

What is Vector DB ?

4
Comments
51 min read
Understanding Transformer-Based Deep Learning Models: Mechanisms, Implementation, Benefits, and Future Development

Understanding Transformer-Based Deep Learning Models: Mechanisms, Implementation, Benefits, and Future Development

Comments
5 min read
GPT-4.5 Announced: How to Access the Latest OpenAI Model Without Rate Limits

GPT-4.5 Announced: How to Access the Latest OpenAI Model Without Rate Limits

13
Comments 1
2 min read
Ollama Meets ServBay: A Match Made in Code Heaven

Ollama Meets ServBay: A Match Made in Code Heaven

15
Comments 2
2 min read
loading...