Chat With Your Documents Locally Using AnythingLLM and Ollama

#ai #tutorial

A private RAG system where you drop in PDFs, Word docs, and code files and ask questions. Runs on any machine, no cloud dependency.

What You Need

Any computer (GPU optional - CPU works fine)
Ollama installed
About 10 minutes

Architecture

Component	Role
AnythingLLM	Desktop/server app with RAG, agents, built-in vector DB
Ollama	Serves local LLM for chat + embeddings
Qwen3 14B	Default model for answering questions

Setup

1. Install Ollama

# Install from ollama.com, or run with Docker:
docker run -d --gpus all -p 11434:11434 --name ollama \
  -v ollama:/root/.ollama ollama/ollama

# Pull a model:
ollama pull qwen3:14b
# Pull an embedder:
ollama pull nomic-embed-text

2. Install AnythingLLM

Desktop app (easiest): Download from anythingllm.com

Docker:

docker run -d -p 3001:3001 --name anythingllm \
  --add-host host.docker.internal:host-gateway \
  -v anythingllm:/app/server/storage \
  mintplexlabs/anythingllm