Luca Visciola

Posted on Mar 29 • Edited on May 20

Arkhein: Command Your Silicon. Own Your Memory.

#slm #ai #agents #privacy

Sovereign macOS Agent · Local-first, zero-cloud intelligence · Alpha v0.0.4

Most AI tools make a trade you didn't explicitly agree to. You send a query, it travels to a remote server, gets processed alongside millions of others, and an answer comes back. The model learns. The platform profits. Your data stays theirs.

Arkhein is built on a different premise entirely: what if the intelligence lived on your machine, reported to no one, and cost you nothing beyond the hardware you already own?

What Is Arkhein?

Arkhein is a native macOS application — built with NativePHP on top of Laravel, Vue 3, and TypeScript — that runs a complete AI reasoning pipeline entirely on your local hardware. No API keys. No cloud dependency. No data ever crosses the hardware boundary.

It is described simply as a Sovereign macOS Agent: local-first, zero-cloud intelligence powered by Ollama for local inference and Vektor for vector search. The entire codebase is open source under the MIT License and available on GitHub.

The Architecture

Arkhein is composed of four interlocking systems, each designed to keep intelligence local and the operator in control.

1. The Vantage Hub — Verticalized Knowledge Silos

Rather than dumping all documents into a single shared vector pool, Arkhein organizes knowledge into Sovereign Silos — isolated, physical partitions that follow a strict hierarchy:

Silo (Folder) → Vessel (Document) → Fragment (Chunk)

This structure enables Parent-Aware RAG: before retrieving a single fragment, the agent already understands the document summary and the subfolder depth. The result is 100% topical isolation — your medical research never bleeds into your client contracts, and Project A never contaminates Project B.

You authorize exactly which folders Arkhein can access. Everything else on your filesystem remains untouched.

2. The Mind — Local Inference via Ollama

Inference is handled entirely by Ollama, running locally on your Mac. But Arkhein doesn't simply prompt a model and relay the output — it runs a Multi-Stage Sovereign Arbiter pipeline: a JSON-based reasoning engine that classifies query intent before triggering any retrieval.

A dedicated Dispatcher module determines what kind of query you're making — retrieval, synthesis, filesystem command, or creative generation — and routes it accordingly. The agent thinks before it acts.

Two compute profiles let you match the workload to your hardware:

Profile	Models	Recommended RAM
Efficient	Mistral + Nomic Embed	8 GB – 16 GB
Elite	Qwen3 Suite	32 GB+

The agent also mirrors your language automatically — write in English, Italian, French, or any other language, and the Archivist responds in kind, without any configuration required.

3. The Memory — Self-Healing SSOT

The memory layer uses a two-component architecture designed for reliability and zero-downtime operation:

SQLite serves as the Single Source of Truth. All document chunks, metadata, and hierarchy information live here — inspectable, portable, and entirely under your control.
Vektor acts as the binary accelerator layer — a disposable vector index built on top of SQLite. If the index becomes inconsistent (for example, due to a dimensional mismatch after switching models), Arkhein automatically detects the issue, discards the index, and rebuilds it using shadow partitions — without taking the system offline.

The archive remains accessible throughout any rebuild. No data loss, no service interruption.

4. The Hand — Filesystem Control

Arkhein can issue filesystem commands — /create, /move, /organize, /delete — with surgical precision. Every action, however, requires a verified Strategic Plan that the operator explicitly approves before execution.

This Human-in-the-Loop model is a core design principle. The agent proposes; you decide; Arkhein executes. You remain the final authority at every step.

The Sovereign Archivist

Above the individual silos sits the Sovereign Archivist: a centralized module for global RAG and system-wide intelligence. Use it to search across all authorized silos simultaneously, or to analyze the system architecture itself. It provides a high-level summary view of everything Arkhein has indexed, and routes cross-silo queries for maximum relevance.

Getting Started

Prerequisites: macOS (Apple Silicon), Ollama, PHP 8.4, Node 22.

Step 1 — Pull the inference models:

# Efficient Profile (Standard — 8GB–16GB RAM)
ollama pull mistral
ollama pull nomic-embed-text

# Elite Profile (Optional — 32GB+ RAM)
ollama pull qwen3
ollama pull qwen3:8b

Step 2 — Initialize the application:

composer install && npm install
php artisan native:migrate
php artisan native:serve

Step 3 — Configure:

Open Settings, authorize your knowledge silos, select a compute profile, and monitor the System Heartbeat to confirm all services are active.

The Stack

Layer	Technology
Desktop Runtime	NativePHP
Backend	Laravel (PHP 8.4)
Frontend	Vue 3 + TypeScript
Local Inference	Ollama
Vector Search	Vektor
Database	SQLite
Build Tooling	Vite + Node 22
License	MIT

Current Status

Arkhein v0.0.4 Alpha is available now for Apple Silicon Macs. The architecture is stable and the core pipeline is fully operational. As an alpha release, rough edges remain — this is software built in the open, not after the fact.

The alpha .dmg is available for direct download from the official website. The full source is available on GitHub, check it out.

Download & Links

Download Alpha v0.0.4 (Apple Silicon): arkhein.melasistema.com
Source Code: github.com/melasistema/arkhein
Documentation: docs.arkhein.melasistema.com

DEV Community