DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Rewind AI + Cursor AI = screenpipe: how we built a high performance Rust frame streaming API (OSS)

Rewind AI + Cursor AI = screenpipe: how we built a high performance Rust frame streaming API (OSS)

Comments
1 min read
Understanding RAG and Long-Context LLMs: Insights from the SELF-ROUTE Hybrid Approach

Understanding RAG and Long-Context LLMs: Insights from the SELF-ROUTE Hybrid Approach

Comments
3 min read
Txt-to-SQL: Querying Databases with Nebius AI Studio and Agents (part 3)

Txt-to-SQL: Querying Databases with Nebius AI Studio and Agents (part 3)

2
Comments
6 min read
Day:30 Reformer: Efficient Transformer for Large Scale Models

Day:30 Reformer: Efficient Transformer for Large Scale Models

Comments
3 min read
Bolt.new with any LLM, you need to use it

Bolt.new with any LLM, you need to use it

16
Comments 1
2 min read
Building RAG-Powered Applications with LangChain, Pinecone, and OpenAI

Building RAG-Powered Applications with LangChain, Pinecone, and OpenAI

6
Comments 3
7 min read
Day 50: Building a REST API for LLM Inference

Day 50: Building a REST API for LLM Inference

2
Comments
2 min read
Rethinking How We Train Customer-Facing AI Agents

Rethinking How We Train Customer-Facing AI Agents

26
Comments
1 min read
Integrating LangChain with FastAPI for Asynchronous Streaming

Integrating LangChain with FastAPI for Asynchronous Streaming

9
Comments
3 min read
Self-Correcting AI Agents: How to Build AI That Learns From Its Mistakes

Self-Correcting AI Agents: How to Build AI That Learns From Its Mistakes

2
Comments
5 min read
How to Build Smarter AI Agents with Dynamic Tooling

How to Build Smarter AI Agents with Dynamic Tooling

1
Comments
5 min read
Mastering Real-Time AI: A Developer’s Guide to Building Streaming LLMs with FastAPI and Transformers

Mastering Real-Time AI: A Developer’s Guide to Building Streaming LLMs with FastAPI and Transformers

1
Comments
5 min read
Primer on Distributed Parallel Processing with Ray using KubeRay

Primer on Distributed Parallel Processing with Ray using KubeRay

Comments
10 min read
Running Phi 3 with vLLM and Ray Serve

Running Phi 3 with vLLM and Ray Serve

Comments
18 min read
Machine Learning for Software Engineers: A Comprehensive Theoretical Foundation

Machine Learning for Software Engineers: A Comprehensive Theoretical Foundation

27
Comments 4
4 min read
Universal Personal Assistant with LLMs

Universal Personal Assistant with LLMs

2
Comments
6 min read
Gemini 2.0 Released, Reminding of "AI Hitting the Wall" Talks

Gemini 2.0 Released, Reminding of "AI Hitting the Wall" Talks

13
Comments 1
2 min read
Faiss with sqlite for RAG

Faiss with sqlite for RAG

2
Comments 3
1 min read
Day 49: Serving LLMs with ONNX Runtime

Day 49: Serving LLMs with ONNX Runtime

8
Comments
2 min read
Community-driven development of LLM applications — introducing GenSphere

Community-driven development of LLM applications — introducing GenSphere

Comments
5 min read
Introduction to AI Gateway

Introduction to AI Gateway

Comments
2 min read
Building an Article Generator with LangChain and Llama3: An AI Developer's Journey

Building an Article Generator with LangChain and Llama3: An AI Developer's Journey

25
Comments 1
8 min read
7 Best Practices for LLM Testing and Debugging

7 Best Practices for LLM Testing and Debugging

Comments
11 min read
AI Engineer's Tool Review: Guardrails AI

AI Engineer's Tool Review: Guardrails AI

Comments
1 min read
AI and All Data Weekly for 09 Dec 2024

AI and All Data Weekly for 09 Dec 2024

5
Comments 1
5 min read
loading...