DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Make Your Vite Project LLM-Friendly with vite-plugin-llms

Make Your Vite Project LLM-Friendly with vite-plugin-llms

1
Comments
3 min read
Rewind AI + Cursor AI = screenpipe: how we built a high performance Rust frame streaming API (OSS)

Rewind AI + Cursor AI = screenpipe: how we built a high performance Rust frame streaming API (OSS)

Comments
1 min read
Understanding RAG and Long-Context LLMs: Insights from the SELF-ROUTE Hybrid Approach

Understanding RAG and Long-Context LLMs: Insights from the SELF-ROUTE Hybrid Approach

Comments
3 min read
Day:30 Reformer: Efficient Transformer for Large Scale Models

Day:30 Reformer: Efficient Transformer for Large Scale Models

Comments
3 min read
Bolt.new with any LLM, you need to use it

Bolt.new with any LLM, you need to use it

13
Comments
2 min read
Building RAG-Powered Applications with LangChain, Pinecone, and OpenAI

Building RAG-Powered Applications with LangChain, Pinecone, and OpenAI

3
Comments 2
7 min read
Day 50: Building a REST API for LLM Inference

Day 50: Building a REST API for LLM Inference

1
Comments
2 min read
Rethinking How We Train Customer-Facing AI Agents

Rethinking How We Train Customer-Facing AI Agents

28
Comments
1 min read
Self-Correcting AI Agents: How to Build AI That Learns From Its Mistakes

Self-Correcting AI Agents: How to Build AI That Learns From Its Mistakes

1
Comments
5 min read
How to Build Smarter AI Agents with Dynamic Tooling

How to Build Smarter AI Agents with Dynamic Tooling

1
Comments
5 min read
Integrating LangChain with FastAPI for Asynchronous Streaming

Integrating LangChain with FastAPI for Asynchronous Streaming

1
Comments
3 min read
Running Phi 3 with vLLM and Ray Serve

Running Phi 3 with vLLM and Ray Serve

Comments
18 min read
Primer on Distributed Parallel Processing with Ray using KubeRay

Primer on Distributed Parallel Processing with Ray using KubeRay

Comments
10 min read
Universal Personal Assistant with LLMs

Universal Personal Assistant with LLMs

2
Comments
6 min read
Machine Learning for Software Engineers: A Comprehensive Theoretical Foundation

Machine Learning for Software Engineers: A Comprehensive Theoretical Foundation

24
Comments 3
4 min read
Day 29: Sparse Transformers: Efficient Scaling for Large Language Models

Day 29: Sparse Transformers: Efficient Scaling for Large Language Models

Comments
3 min read
Gemini 2.0 Released, Reminding of "AI Hitting the Wall" Talks

Gemini 2.0 Released, Reminding of "AI Hitting the Wall" Talks

13
Comments 1
2 min read
Day 49: Serving LLMs with ONNX Runtime

Day 49: Serving LLMs with ONNX Runtime

5
Comments
2 min read
Community-driven development of LLM applications — introducing GenSphere

Community-driven development of LLM applications — introducing GenSphere

Comments
5 min read
Introduction to AI Gateway

Introduction to AI Gateway

Comments
2 min read
Building an Article Generator with LangChain and Llama3: An AI Developer's Journey

Building an Article Generator with LangChain and Llama3: An AI Developer's Journey

25
Comments 1
8 min read
Day 27: Regularization Techniques for Large Language Models (LLMs)

Day 27: Regularization Techniques for Large Language Models (LLMs)

Comments
2 min read
7 Best Practices for LLM Testing and Debugging

7 Best Practices for LLM Testing and Debugging

Comments
11 min read
AI Engineer's Tool Review: Guardrails AI

AI Engineer's Tool Review: Guardrails AI

Comments
1 min read
AI and All Data Weekly for 09 Dec 2024

AI and All Data Weekly for 09 Dec 2024

5
Comments 1
5 min read
loading...